glam/schemas/20251121/linkml/modules/classes/VideoTextContent.yaml
kempersc 66adec257e Add scripts for normalizing LinkML schemas and validating schema integrity
- Implement `normalize_linkml_alt_descriptions.py` to convert structured alt_descriptions to the expected scalar form.
- Implement `normalize_linkml_structured_aliases.py` to flatten language-keyed structured_aliases into a standard list-of-objects format.
- Implement `validate_linkml_schema_integrity.py` to validate the integrity of LinkML schema bundles, checking for import resolution, YAML parsing, and reference existence.
2026-02-16 10:16:51 +01:00

172 lines
5.8 KiB
YAML

id: https://nde.nl/ontology/hc/class/VideoTextContent
name: video_text_content_class
title: Video Text Content Class
imports:
- linkml:types
- ../enums/GenerationMethodEnum
- ../slots/has_title
- ../slots/has_language
- ../slots/has_quantity
- ../slots/has_score
- ../slots/generated_by
- ../slots/verified_by
- ../slots/provided_by
- ../slots/has_confidence_measure
- ../slots/has_time_interval
- ../slots/has_source
- ../slots/temporal_extent
prefixes:
linkml: https://w3id.org/linkml/
hc: https://nde.nl/ontology/hc/
schema: http://schema.org/
dcterms: http://purl.org/dc/terms/
prov: http://www.w3.org/ns/prov#
crm: http://www.cidoc-crm.org/cidoc-crm/
skos: http://www.w3.org/2004/02/skos/core#
oa: http://www.w3.org/ns/oa#
default_prefix: hc
classes:
VideoTextContent:
class_uri: crm:E73_Information_Object
abstract: true
alt_descriptions:
nl: {text: Abstracte basisklasse voor uit video afgeleide tekstuele inhoud (transcript, ondertitels, annotaties)., language: nl}
de: {text: Abstrakte Basisklasse fuer aus Videos abgeleitete textuelle Inhalte (Transkripte, Untertitel, Annotationen)., language: de}
fr: {text: Classe de base abstraite pour le contenu textuel derive de la video (transcriptions, sous-titres, annotations)., language: fr}
es: {text: Clase base abstracta para contenido textual derivado de video (transcripciones, subtitulos, anotaciones)., language: es}
ar: {text: فئة أساس مجردة للمحتوى النصي المشتق من الفيديو (تفريغ، ترجمات، تعليقات)., language: ar}
id: {text: Kelas dasar abstrak untuk konten teks turunan video (transkrip, subtitle, anotasi)., language: id}
zh: {text: 视频衍生文本内容(转录、字幕、注释)的抽象基类。, language: zh}
structured_aliases:
nl: [{literal_form: videotekstinhoud, language: nl}]
de: [{literal_form: Video-Textinhalt, language: de}]
fr: [{literal_form: contenu texte video, language: fr}]
es: [{literal_form: contenido textual de video, language: es}]
ar: [{literal_form: محتوى نص الفيديو, language: ar}]
id: [{literal_form: konten teks video, language: id}]
zh: [{literal_form: 视频文本内容, language: zh}]
description: >-
Abstract base class for textual or derived information extracted from
video content.
exact_mappings:
- crm:E73_Information_Object
close_mappings:
- prov:Entity
related_mappings:
- schema:CreativeWork
- dcterms:Text
slots:
- has_language
- has_title
- generated_by
- temporal_extent
- verified_by
- provided_by
- has_confidence_measure
- has_time_interval
- has_source
- has_score
- has_quantity
slot_usage:
has_source:
range: string
required: false
examples:
- value: FbIoC-Owy-M
- value: https://www.youtube.com/watch?v=FbIoC-Owy-M
has_language:
# range: string
required: true
inlined: false # Fixed invalid inline for primitive type
multivalued: true
examples:
- value:
language_name: "Dutch"
- value:
language_name: "English"
has_title:
# range: string
required: false
examples:
- value: De Vrijheidsroute Ep.3 - Dutch Transcript
- value: YouTube Auto-captions
- value: manual:curator@rijksmuseum.nl
generated_by:
range: GenerationEvent
required: true
inlined: true
examples:
- value:
has_methodology:
methodology_type: ASR_AUTOMATIC
has_label: Automatic Speech Recognition
- value:
has_methodology:
methodology_type: MANUAL_TRANSCRIPTION
has_label: Manual Transcription
temporal_extent:
range: TimeSpan
inlined: true
required: false
examples:
- value:
begin_of_the_begin: '2025-12-02T15:00:00Z'
provided_by:
# range: string
required: false
examples:
- value: OpenAI
- value: Google Cloud
has_confidence_measure:
range: float
required: false
minimum_value: 0.0
maximum_value: 1.0
examples:
- value: 0.92
verified_by:
range: Verifier
required: false
inlined: true
examples:
- value:
has_name: curator@rijksmuseum.nl
has_time_interval:
range: float
required: false
minimum_value: 0.0
examples:
- value: 45.3
has_quantity:
range: integer
required: false
multivalued: true
inlined: false # Fixed invalid inline for primitive type
inlined_as_list: false # Fixed invalid inline for primitive type
examples:
- value: 1523
- value: 8742
comments:
- Abstract base for all video-derived text content
- Comprehensive PROV-O provenance tracking
- Confidence scoring for AI-generated content
- Verification workflow support
- Critical for heritage accessibility and discovery
see_also:
- https://www.w3.org/TR/prov-o/
- http://www.cidoc-crm.org/cidoc-crm/E73_Information_Object
annotations:
specificity_score: 0.1
specificity_rationale: Generic utility class/slot created during migration
custodian_types: "['*']"
modeling_notes: |
Typical subclasses
- VideoTranscript (audio-derived text)
- VideoSubtitle (time-coded caption tracks)
- VideoAnnotation (visual/multimodal derived descriptions)
Provenance expectations
Derived text should record source, generation method/agent, timestamp,
and confidence when available.