glam/schemas/20251121/linkml/modules/classes/FindingAidProvenance.yaml
kempersc 174a420c08
All checks were successful
Deploy Frontend / build-and-deploy (push) Successful in 3m57s
refactor(schema): centralize 1515 inline slot definitions per Rule 48
- Remove inline slot definitions from 144 class files
- Create 7 new centralized slot files in modules/slots/:
  - custodian_type_broader.yaml
  - custodian_type_narrower.yaml
  - custodian_type_related.yaml
  - definition.yaml
  - finding_aid_access_restriction.yaml
  - finding_aid_description.yaml
  - finding_aid_temporal_coverage.yaml
- Add centralize_inline_slots.py automation script
- Update manifest with new timestamp

Rule 48: Class files must NOT define inline slots - all slots
must be imported from modules/slots/ directory.

Note: Pre-existing IdentifierFormat duplicate class definition
(in Standard.yaml and IdentifierFormat.yaml) not addressed in
this commit - requires separate schema refactor.
2026-01-11 22:02:14 +01:00

85 lines
2.4 KiB
YAML

id: https://nde.nl/ontology/hc/class/FindingAidProvenance
name: finding_aid_provenance_class
title: FindingAidProvenance Class
prefixes:
linkml: https://w3id.org/linkml/
hc: https://nde.nl/ontology/hc/
prov: http://www.w3.org/ns/prov#
schema: http://schema.org/
default_prefix: hc
default_range: string
imports:
- linkml:types
- ../metadata
- ./SpecificityAnnotation
- ./TemplateSpecificityScores
- ../slots/extraction_method
- ../slots/extraction_note
- ../slots/retrieval_agent
- ../slots/source_url
- ../slots/note
- ../slots/claims_count
- ../slots/crawler_version
- ../slots/date_retrieved
- ../slots/html_snapshot_path
- ../slots/specificity_annotation
- ../slots/template_specificity
- ../slots/validation_status
classes:
FindingAidProvenance:
class_uri: prov:Activity
description: 'Provenance metadata for web-harvested finding aid information.
Tracks when and how the metadata was retrieved, enabling data freshness
assessment and re-crawling decisions.
'
slots:
- claims_count
- crawler_version
- date_retrieved
- extraction_method
- extraction_note
- html_snapshot_path
- note
- retrieval_agent
- source_url
- specificity_annotation
- template_specificity
- validation_status
slot_usage:
date_retrieved:
description: ISO 8601 date when data was retrieved
range: date
required: true
retrieval_agent:
description: Tool/method used for retrieval
required: true
source_url:
description: Original source URL
range: uri
extraction_note:
description: Notes about extraction process or quality
extraction_method:
description: Method used (manual, automated, hybrid)
crawler_version:
description: Version of crawling tool used
html_snapshot_path:
description: Relative path to archived HTML snapshot
claims_count:
description: Number of web claims extracted
range: integer
validation_status:
description: Status of validation (validated, pending, failed)
note:
description: Additional provenance notes or remarks
specificity_annotation:
range: SpecificityAnnotation
inlined: true
template_specificity:
range: TemplateSpecificityScores
inlined: true
exact_mappings:
- prov:Activity