glam/schemas/20251121/linkml/modules/classes/ExtractionMethod.yaml

id: https://nde.nl/ontology/hc/class/ExtractionMethod
name: ExtractionMethod
title: Extraction Method Class
prefixes:
  linkml: https://w3id.org/linkml/
  hc: https://nde.nl/ontology/hc/
  prov: http://www.w3.org/ns/prov#
  schema: http://schema.org/
default_prefix: hc
imports:
  - linkml:types
  - ../slots/has_label
classes:
  ExtractionMethod:
    class_uri: prov:Plan
    description: >-
      Documented technique or procedure employed to derive structured data from unstructured or semi-structured sources.
    alt_descriptions:
      nl: >-
        Gedocumenteerde techniek of procedure die wordt gebruikt om gestructureerde gegevens af te leiden uit ongestructureerde of semi-gestructureerde bronnen.
      de: >-
        Dokumentierte Technik oder Prozedur zur Ableitung strukturierter Daten aus unstrukturierten oder semi-strukturierten Quellen.
      fr: >-
        Technique ou procédure documentée employée pour dériver des données structurées à partir de sources non structurées ou semi-structurées.
      es: >-
        Técnica o procedimiento documentado empleado para derivar datos estructurados de fuentes no estructuradas o semiestructuradas.
      ar: >-
        تقنية أو إجراء موثق يستخدم لاستخلاص بيانات منظمة من مصادر غير منظمة أو شبه منظمة.
      id: >-
        Teknik atau prosedur terdokumentasi yang digunakan untuk menurunkan data terstruktur dari sumber tidak terstruktur atau semi-terstruktur.
      zh: >-
        用于从非结构化或半结构化源中提取结构化数据的记录技术或过程。
    structured_aliases:
      nl: Extractiemethode
      de: Extraktionsmethode
      fr: Méthode d'extraction
      es: Método de extracción
      ar: طريقة الاستخراج
      id: Metode Ekstraksi
      zh: 提取方法
    broad_mappings:
      - prov:Plan
    close_mappings:
      - schema:HowTo
    slots:
      - has_label
    slot_usage:
      has_label:
        required: true
    notes:
      - |
        Preserved from prior description (commit ee5e8e5a):

        Preserved from prior description (commit ee5e8e5a):

        "A method or technique used to extract data from a source.\n\nExtraction methods define HOW data was obtained, providing\ntransparency and reproducibility for claim provenance.\n\n**Common Extraction Methods**:\n- `xpath_exact_match`: XPath query with exact text match\n- `xpath_fuzzy_match`: XPath query with fuzzy text matching\n- `text_search`: Full-text search within document\n- `css_selector`: CSS selector for element location\n- `json_ld_parse`: Parsing structured JSON-LD data\n- `regex_pattern`: Regular expression pattern matching\n- `nlp_ner`: Named Entity Recognition via NLP\n- `manual_annotation`: Human annotator extraction\n\n**Ontology Mapping Rationale**:\n- class_uri is prov:SoftwareAgent because extraction methods\n  are typically software-based agents that perform extraction\n- close_mappings includes nif:Context as NIF models text\n  extraction contexts and methods\n\n**MIGRATION NOTE (2026-01-19)**:\nCreated per slot_fixes.yaml revision for claim_extraction_method\n\
    annotations:
      specificity_score: 0.35
      specificity_rationale: Method vocabulary term for reproducible extraction workflows.
      custodian_types: '["*"]'