glam/frontend/public/schemas/20251121/linkml/modules/classes/Provenance.yaml
kempersc 53c6dbc2d9 feat(schema): Migrate temporal slots and introduce new pattern classes
Major slot migrations following slot_fixes.yaml revisions:
- TimeSpan: begin_of_the_begin, begin_of_the_end, end_of_the_begin, end_of_the_end
- Quantity: has_or_had_measurement_unit with MeasureUnit class
- Description: has_or_had_description with Description class
- URL, WikiData, Timestamp, Location, Provenance pattern classes

New slots for RiC-O compliance:
- Temporal: has_or_had_time_interval, calendar_system
- Transfer: is_or_was_transferred, has_or_had_policy
- Location: starts/ends_or_started/ended_at_location
- Provenance: has_or_had_provenance_path, is_or_was_webarchived_at

Archive deprecated slots per Rule 53 workflow.
2026-01-14 20:01:55 +01:00

118 lines
3.6 KiB
YAML

# Provenance - A single provenance statement tracking the origin and lineage of data
# Distinct from ProvenanceBlock which bundles multiple provenance sources
# Created per slot_fixes.yaml migration for: binding_provenance, xpath, type_hypothesis
# Creation date: 2026-01-14
id: https://nde.nl/ontology/hc/classes/Provenance
name: Provenance
title: Provenance
prefixes:
linkml: https://w3id.org/linkml/
hc: https://nde.nl/ontology/hc/
prov: http://www.w3.org/ns/prov#
dct: http://purl.org/dc/terms/
schema: http://schema.org/
xsd: http://www.w3.org/2001/XMLSchema#
imports:
- linkml:types
- ../slots/has_or_had_agent
- ../slots/temporal_extent
default_range: string
classes:
Provenance:
description: >-
A single provenance statement describing the origin, lineage, or derivation
of a piece of data.
**DISTINCTION FROM ProvenanceBlock**:
- `Provenance` represents a SINGLE provenance statement (one source, one activity)
- `ProvenanceBlock` bundles MULTIPLE provenance sources together
Use `Provenance` when:
- Tracking provenance of individual data elements
- Recording specific extraction or transformation activities
- Linking data to its source agent/entity
**ONTOLOGY MAPPING**:
- class_uri: prov:Activity (the extraction/derivation activity)
- Links to prov:Agent via has_or_had_agent
- Links to prov:Entity via used (source) and generated (result)
class_uri: prov:Activity
close_mappings:
- dct:ProvenanceStatement
- schema:Action
related_mappings:
- prov:Entity
- prov:Derivation
slots:
- has_or_had_agent
- temporal_extent
attributes:
source_entity:
range: uriorcurie
description: >-
URI of the source entity from which data was derived (prov:used).
slot_uri: prov:used
generated_entity:
range: uriorcurie
description: >-
URI of the entity generated by this provenance activity (prov:generated).
slot_uri: prov:generated
extraction_method:
range: string
description: >-
Method or process used to extract/derive the data.
Examples: "xpath_extraction", "api_query", "manual_entry"
confidence_score:
range: float
minimum_value: 0.0
maximum_value: 1.0
description: >-
Confidence score (0.0 to 1.0) indicating reliability of this provenance.
note:
range: string
description: >-
Human-readable note about this provenance statement.
slot_uri: prov:value
annotations:
custodian_types: '["*"]'
custodian_types_rationale: >-
Provenance tracking is universal across all custodian types.
custodian_types_primary: "*"
specificity_score: 0.15
specificity_rationale: >-
Very low specificity - provenance metadata applies universally.
examples:
- value: |
Provenance:
extraction_method: "xpath_extraction"
source_entity: "https://example.org/webpage/12345"
confidence_score: 0.95
note: "Extracted from archived HTML using XPath"
description: >-
Provenance for an XPath-extracted value from an archived webpage.
- value: |
Provenance:
has_or_had_agent:
agent_type: "software"
name: "glam-extractor-v2.1"
extraction_method: "api_scraping"
confidence_score: 0.85
description: >-
Provenance for API-scraped data with agent identification.