glam/schemas/20251121/linkml/modules/classes/ExtractionSourceInfo.yaml
kempersc 66adec257e Add scripts for normalizing LinkML schemas and validating schema integrity
- Implement `normalize_linkml_alt_descriptions.py` to convert structured alt_descriptions to the expected scalar form.
- Implement `normalize_linkml_structured_aliases.py` to flatten language-keyed structured_aliases into a standard list-of-objects format.
- Implement `validate_linkml_schema_integrity.py` to validate the integrity of LinkML schema bundles, checking for import resolution, YAML parsing, and reference existence.
2026-02-16 10:16:51 +01:00

37 lines
1.5 KiB
YAML

id: https://nde.nl/ontology/hc/classes/ExtractionSourceInfo
name: ExtractionSourceInfo
title: Extraction Source Info Class
prefixes:
linkml: https://w3id.org/linkml/
hc: https://nde.nl/ontology/hc/
schema: http://schema.org/
prov: http://www.w3.org/ns/prov#
default_prefix: hc
imports:
- linkml:types
- ../slots/has_field
- ../slots/has_text
- ../slots/has_method
classes:
ExtractionSourceInfo:
class_uri: prov:Entity
description: Source-level extraction trace containing extracted field, text, and method.
exact_mappings:
- prov:Entity
close_mappings:
- schema:PropertyValue
related_mappings:
- prov:Derivation
slots:
- has_field
- has_text
- has_method
notes:
- |
Preserved from prior description (commit ee5e8e5a):
"Information about extraction source (can be structured or freeform). Documents the field extracted, the extracted text content, and the method used for extraction (pattern matching, LLM, etc.).\nOntology mapping rationale: - class_uri is prov:Entity because extraction source info is a\n traceable entity recording what was extracted and how\n- close_mappings includes pav:SourceAccessedAt as this documents\n a source that was accessed during extraction\n- related_mappings includes prov:Derivation (the extraction is\n derived from source) and schema:PropertyValue (field/value pair)"
annotations:
specificity_score: 0.35
specificity_rationale: Low-level extraction trace entity used for audit and debugging.
custodian_types: '["*"]'