glam/schemas/20251121/linkml/modules/classes/RawSource.yaml

196 lines
4.6 KiB
YAML

id: https://nde.nl/ontology/hc/classes/RawSource
name: RawSource
title: RawSource
prefixes:
linkml: https://w3id.org/linkml/
hc: https://nde.nl/ontology/hc/
schema: http://schema.org/
prov: http://www.w3.org/ns/prov#
xsd: http://www.w3.org/2001/XMLSchema#
pav: http://purl.org/pav/
imports:
- linkml:types
- ../slots/has_type
# default_range: string
classes:
RawSource:
description: >-
Unprocessed web resource capture containing original content, retrieval metadata, and extraction artifacts for enrichment pipelines.
alt_descriptions:
nl: >-
Onverwerkte webresource-opname met originele inhoud, ophaal-metadata en extractie-artefacten voor verrijkingspijplijnen.
de: >-
Unverarbeitete Webressourcenerfassung mit Originalinhalt, Abrufmetadaten und Extraktionsartefakten für Anreicherungspipelines.
fr: >-
Capture de ressource web non traitée contenant le contenu original, les métadonnées de récupération et les artefacts d'extraction pour les pipelines d'enrichissement.
es: >-
Captura de recurso web sin procesar que contiene contenido original, metadatos de recuperación y artefactos de extracción para tuberías de enriquecimiento.
ar: >-
التقاط مورد ويب غير معالج يحتوي على المحتوى الأصلي وبيانات الاسترجاع الوصفية ومصنوعات الاستخراج لخطوط إثراء البيانات.
id: >-
Tangkapan sumber daya web yang tidak diproses berisi konten asli, metadata pengambilan, dan artefak ekstraksi untuk pipeline pengayaan.
zh: >-
包含原始内容、检索元数据和提取工件用于丰富流程的未处理网络资源捕获。
structured_aliases:
- literal_form: ruwe bron
predicate: EXACT_SYNONYM
in_language: nl
- literal_form: onverwerkte opname
predicate: EXACT_SYNONYM
in_language: nl
- literal_form: Rohquelle
predicate: EXACT_SYNONYM
in_language: de
- literal_form: unbearbeitete Erfassung
predicate: EXACT_SYNONYM
in_language: de
- literal_form: source brute
predicate: EXACT_SYNONYM
in_language: fr
- literal_form: capture non traitée
predicate: EXACT_SYNONYM
in_language: fr
- literal_form: fuente sin procesar
predicate: EXACT_SYNONYM
in_language: es
- literal_form: captura sin procesar
predicate: EXACT_SYNONYM
in_language: es
- literal_form: المصدر الخام
predicate: EXACT_SYNONYM
in_language: ar
- literal_form: الالتقاط غير المعالج
predicate: EXACT_SYNONYM
in_language: ar
- literal_form: sumber mentah
predicate: EXACT_SYNONYM
in_language: id
- literal_form: tangkapan tidak diproses
predicate: EXACT_SYNONYM
in_language: id
- literal_form: 原始来源
predicate: EXACT_SYNONYM
in_language: zh
- literal_form: 未处理捕获
predicate: EXACT_SYNONYM
in_language: zh
- literal_form: raw source
predicate: EXACT_SYNONYM
in_language: zh
- literal_form: web capture
predicate: EXACT_SYNONYM
in_language: zh
- literal_form: unprocessed
predicate: EXACT_SYNONYM
in_language: zh
- literal_form: extraction
predicate: EXACT_SYNONYM
in_language: zh
- literal_form: ingestion
predicate: EXACT_SYNONYM
in_language: zh
- literal_form: prov:PrimarySource
predicate: EXACT_SYNONYM
in_language: zh
- literal_form: pav:RetrievedFrom
predicate: EXACT_SYNONYM
in_language: zh
- literal_form: schema:WebPage
predicate: EXACT_SYNONYM
in_language: zh
- literal_form: prov:Entity
predicate: EXACT_SYNONYM
in_language: zh
- literal_form: "Typically captures: URL, fetch timestamp, HTTP status, content hash, and extracted highlights/snippets."
predicate: EXACT_SYNONYM
in_language: zh
- literal_form: "Ontology alignment: prov:PrimarySource for original fetched material; pav:RetrievedFrom for retrieval provenance; schema:WebPage for web-resource framing."
predicate: EXACT_SYNONYM
in_language: zh
- literal_form: 'Preserved from prior description: Raw source record captured from the web for enrichment and provenance.'
predicate: EXACT_SYNONYM
in_language: zh
- literal_form: has_type
predicate: EXACT_SYNONYM
in_language: zh