glam/schemas/20251121/linkml/modules/classes/ExtractionMethod.yaml
kempersc 3da90b940e
All checks were successful
Deploy Frontend / build-and-deploy (push) Successful in 2m4s
feat(schema): complete multiple slot_fixes.yaml migrations
Session 2026-01-19: Completed remaining migrations per Rules 53/56/60.

Major migrations:
1. claim_type → has_or_had_type + ClaimType/ClaimTypes (60+ concrete types in 11 categories)
2. circumstances_of_death → is_deceased + DeceasedStatus + CauseOfDeath
3. claims_count → has_or_had_quantity + Quantity (with based_on_claim for provenance)
4. classification_status → has_or_had_type + ClassificationStatusType

Created files:
- ClaimType.yaml, ClaimTypes.yaml (abstract base + 60+ concrete subclasses)
- DeceasedStatus.yaml, CauseOfDeath.yaml, CauseOfDeathTypeEnum.yaml
- ClassificationStatus.yaml, ClassificationStatusType.yaml, ClassificationStatusTypes.yaml
- CITESAppendix.yaml, City.yaml, CertaintyLevel.yaml
- is_deceased.yaml, is_or_was_caused_by.yaml, based_on_claim.yaml

Archived slots:
- claim_type, circumstances_of_death, claims_count, classification_status

Added Rule 60 to AGENTS.md: No Migration Deferral - agents MUST execute all migrations.

All 527 slot_fixes.yaml entries now complete (100%).
2026-01-19 13:05:53 +01:00

74 lines
2.7 KiB
YAML

id: https://nde.nl/ontology/hc/class/ExtractionMethod
name: ExtractionMethod
title: ExtractionMethod Class - Methods for Data Extraction
prefixes:
linkml: https://w3id.org/linkml/
hc: https://nde.nl/ontology/hc/
schema: http://schema.org/
prov: http://www.w3.org/ns/prov#
nif: http://persistence.uni-leipzig.org/nlp2rdf/ontologies/nif-core#
imports:
- linkml:types
- ../slots/has_or_had_label
- ./Label
default_prefix: hc
classes:
ExtractionMethod:
class_uri: prov:SoftwareAgent
description: |
A method or technique used to extract data from a source.
Extraction methods define HOW data was obtained, providing
transparency and reproducibility for claim provenance.
**Common Extraction Methods**:
- `xpath_exact_match`: XPath query with exact text match
- `xpath_fuzzy_match`: XPath query with fuzzy text matching
- `text_search`: Full-text search within document
- `css_selector`: CSS selector for element location
- `json_ld_parse`: Parsing structured JSON-LD data
- `regex_pattern`: Regular expression pattern matching
- `nlp_ner`: Named Entity Recognition via NLP
- `manual_annotation`: Human annotator extraction
**Ontology Mapping Rationale**:
- class_uri is prov:SoftwareAgent because extraction methods
are typically software-based agents that perform extraction
- close_mappings includes nif:Context as NIF models text
extraction contexts and methods
**MIGRATION NOTE (2026-01-19)**:
Created per slot_fixes.yaml revision for claim_extraction_method
slot migration (Rule 53/56).
exact_mappings:
- prov:SoftwareAgent
close_mappings:
- nif:Context
- schema:HowTo
slots:
- has_or_had_label
slot_usage:
has_or_had_label:
description: |
The name/identifier of the extraction method.
Examples: "xpath_exact_match", "nlp_ner", "json_ld_parse"
range: Label
inlined: true
required: true
comments:
- 'CREATED 2026-01-19: Per slot_fixes.yaml revision (Rule 53/56)'
- Replaces string-valued claim_extraction_method slot
- Enables structured representation of extraction techniques
examples:
- value:
has_or_had_label:
has_or_had_label: xpath_exact_match
description: XPath extraction with exact text matching
- value:
has_or_had_label:
has_or_had_label: nlp_ner
description: Named Entity Recognition extraction
- value:
has_or_had_label:
has_or_had_label: json_ld_parse
description: Structured data extraction from JSON-LD