glam/schemas/20251121/linkml/modules/slots/source_url.yaml
2025-12-03 17:38:46 +01:00

42 lines
1.3 KiB
YAML

# source_url slot - URL of source web page for provenance tracking
id: https://nde.nl/ontology/hc/slot/source_url
name: source_url
title: Source URL Slot
prefixes:
pav: http://purl.org/pav/
schema: http://schema.org/
prov: http://www.w3.org/ns/prov#
description: |
URL of the web page from which data was retrieved or extracted.
Used for provenance tracking in:
- WebClaim: Source page for extracted claims
- WebObservation: Source page for web observations
- FundingOpportunity: Source page for funding information
slots:
source_url:
slot_uri: pav:retrievedFrom
range: uri
required: false
multivalued: false
description: |
URL of the web page this data was retrieved from.
pav:retrievedFrom - "The URI from which the resource was retrieved."
This is a key provenance field that enables verification of extracted data
by linking back to the original source web page.
Examples:
- https://historischeverenigingnijeveen.nl/ (institution homepage)
- https://ec.europa.eu/info/funding-tenders/... (EU funding portal)
- https://www.heritagefund.org.uk/funding/... (UK heritage fund)
comments:
- "Maps to pav:retrievedFrom for provenance tracking"
- "Essential for web claim verification workflows"
- "Should be the exact URL that was accessed, not normalized"