glam/schemas/20251121/linkml/modules/classes/ExtractionMethod.yaml

60 lines
3.3 KiB
YAML

id: https://nde.nl/ontology/hc/class/ExtractionMethod
name: ExtractionMethod
title: Extraction Method Class
prefixes:
linkml: https://w3id.org/linkml/
hc: https://nde.nl/ontology/hc/
prov: http://www.w3.org/ns/prov#
schema: http://schema.org/
default_prefix: hc
imports:
- linkml:types
- ../slots/has_label
classes:
ExtractionMethod:
class_uri: prov:Plan
description: >-
Documented technique or procedure employed to derive structured data from unstructured or semi-structured sources.
alt_descriptions:
nl: >-
Gedocumenteerde techniek of procedure die wordt gebruikt om gestructureerde gegevens af te leiden uit ongestructureerde of semi-gestructureerde bronnen.
de: >-
Dokumentierte Technik oder Prozedur zur Ableitung strukturierter Daten aus unstrukturierten oder semi-strukturierten Quellen.
fr: >-
Technique ou procédure documentée employée pour dériver des données structurées à partir de sources non structurées ou semi-structurées.
es: >-
Técnica o procedimiento documentado empleado para derivar datos estructurados de fuentes no estructuradas o semiestructuradas.
ar: >-
تقنية أو إجراء موثق يستخدم لاستخلاص بيانات منظمة من مصادر غير منظمة أو شبه منظمة.
id: >-
Teknik atau prosedur terdokumentasi yang digunakan untuk menurunkan data terstruktur dari sumber tidak terstruktur atau semi-terstruktur.
zh: >-
用于从非结构化或半结构化源中提取结构化数据的记录技术或过程。
structured_aliases:
nl: Extractiemethode
de: Extraktionsmethode
fr: Méthode d'extraction
es: Método de extracción
ar: طريقة الاستخراج
id: Metode Ekstraksi
zh: 提取方法
broad_mappings:
- prov:Plan
close_mappings:
- schema:HowTo
slots:
- has_label
slot_usage:
has_label:
required: true
notes:
- |
Preserved from prior description (commit ee5e8e5a):
Preserved from prior description (commit ee5e8e5a):
"A method or technique used to extract data from a source.\n\nExtraction methods define HOW data was obtained, providing\ntransparency and reproducibility for claim provenance.\n\n**Common Extraction Methods**:\n- `xpath_exact_match`: XPath query with exact text match\n- `xpath_fuzzy_match`: XPath query with fuzzy text matching\n- `text_search`: Full-text search within document\n- `css_selector`: CSS selector for element location\n- `json_ld_parse`: Parsing structured JSON-LD data\n- `regex_pattern`: Regular expression pattern matching\n- `nlp_ner`: Named Entity Recognition via NLP\n- `manual_annotation`: Human annotator extraction\n\n**Ontology Mapping Rationale**:\n- class_uri is prov:SoftwareAgent because extraction methods\n are typically software-based agents that perform extraction\n- close_mappings includes nif:Context as NIF models text\n extraction contexts and methods\n\n**MIGRATION NOTE (2026-01-19)**:\nCreated per slot_fixes.yaml revision for claim_extraction_method\n\
annotations:
specificity_score: 0.35
specificity_rationale: Method vocabulary term for reproducible extraction workflows.
custodian_types: '["*"]'