- Removed obsolete slots: `has_or_had_custodian_observation`, `provider`, and `specificity_annotation`. - Updated `has_or_had_score` slot to use `SpecificityScore` class and modified its description and examples. - Added new slots: `end_seconds`, `end_time`, `has_archive_path`, `has_or_had_custodian_name`, `protocol_name`, and `protocol_version`. - Introduced a script `check_annotation_types.py` to validate the presence and structure of `custodian_types` in YAML files. - Added a script `update_specificity.py` to automate updates related to `SpecificityAnnotation` to `SpecificityScore`.
179 lines
8.3 KiB
YAML
179 lines
8.3 KiB
YAML
id: https://nde.nl/ontology/hc/class/VideoSubtitle
|
|
name: video_subtitle_class
|
|
title: Video Subtitle Class
|
|
imports:
|
|
- linkml:types
|
|
- ../enums/SubtitleFormatEnum
|
|
- ../enums/SubtitlePositionEnum
|
|
- ../slots/has_or_had_alignment
|
|
- ../slots/has_or_had_caption
|
|
- ../slots/has_or_had_format
|
|
- ../slots/has_or_had_identifier
|
|
- ../slots/has_or_had_label
|
|
- ../slots/has_or_had_mean
|
|
- ../slots/has_or_had_quantity
|
|
- ../slots/has_or_had_score
|
|
- ../slots/has_or_had_segment
|
|
- ../slots/has_or_had_unit
|
|
- ../slots/includes_music_description
|
|
- ../slots/includes_sound_description
|
|
- ../slots/includes_speaker_identification
|
|
- ../slots/includes_timestamp
|
|
- ../slots/is_closed_caption
|
|
- ../slots/is_or_was_created_through
|
|
- ../slots/is_sdh
|
|
- ../slots/raw_subtitle_content
|
|
prefixes:
|
|
linkml: https://w3id.org/linkml/
|
|
hc: https://nde.nl/ontology/hc/
|
|
schema: http://schema.org/
|
|
dcterms: http://purl.org/dc/terms/
|
|
prov: http://www.w3.org/ns/prov#
|
|
crm: http://www.cidoc-crm.org/cidoc-crm/
|
|
skos: http://www.w3.org/2004/02/skos/core#
|
|
ma: http://www.w3.org/ns/ma-ont#
|
|
rdfs: http://www.w3.org/2000/01/rdf-schema#
|
|
org: http://www.w3.org/ns/org#
|
|
xsd: http://www.w3.org/2001/XMLSchema#
|
|
default_prefix: hc
|
|
classes:
|
|
VideoSubtitle:
|
|
is_a: VideoTranscript
|
|
class_uri: hc:VideoSubtitle
|
|
abstract: false
|
|
description: "Time-coded caption/subtitle content for video.\n\n**DEFINITION**:\n\nVideoSubtitle represents caption/subtitle tracks that provide time-coded\ntext synchronized with video playback. It extends VideoTranscript because\nsubtitles contain complete transcription PLUS temporal synchronization.\n\n**INHERITANCE FROM VideoTranscript**:\n\nVideoSubtitle inherits all transcript capabilities:\n- `full_text`: Complete subtitle text concatenated\n- `segments`: Time-coded entries (REQUIRED for subtitles)\n- `includes_timestamps`: Always true for subtitles\n- `content_language`: Language of subtitle text\n- All provenance from VideoTextContent\n\nAnd adds subtitle-specific properties:\n- `has_or_had_format`: SRT, VTT, TTML, SBV, ASS\n- `is_closed_caption`: CC vs regular subtitles\n- `is_sdh`: Subtitles for Deaf/Hard-of-Hearing\n- `includes_sound_descriptions`: Non-speech audio descriptions\n\n**SCHEMA.ORG ALIGNMENT**:\n\nMaps to `schema:caption` property:\n> \"For downloadable machine\
|
|
\ formats (closed caption, subtitles etc.)\n> use the MediaObject.encodingFormat property.\"\n\n**SUBTITLE vs CAPTION vs TRANSCRIPT**:\n\n| Type | Time-coded | Purpose | Audience |\n|------|------------|---------|----------|\n| Transcript | Optional | Reading, search | Everyone |\n| Subtitle | Required | Language translation | Hearing viewers |\n| Caption (CC) | Required | Accessibility | Deaf/HoH viewers |\n| SDH | Required | Full accessibility | Deaf viewers, noisy environments |\n\n**SDH (Subtitles for Deaf/Hard-of-Hearing)**:\n\nSDH differs from regular subtitles by including:\n- Speaker identification: \"(John) Hello\"\n- Sound effects: \"[door slams]\", \"[music playing]\"\n- Music descriptions: \"\u266A upbeat jazz \u266A\"\n- Emotional cues: \"[laughing]\", \"[whispering]\"\n\n**SUBTITLE FORMATS**:\n\n| Format | Extension | Features | Use Case |\n|--------|-----------|----------|----------|\n| SRT | .srt | Simple, universal | Most video players |\n| VTT | .vtt | W3C standard,\
|
|
\ styling | HTML5 video, web |\n| TTML | .ttml/.dfxp | XML, rich styling | Broadcast, streaming |\n| SBV | .sbv | YouTube native | YouTube uploads |\n| ASS | .ass | Advanced styling | Anime, complex layouts |\n\n**SRT FORMAT EXAMPLE**:\n\n```\n1\n00:00:00,000 --> 00:00:03,500\nWelcome to the Rijksmuseum.\n\n2\n00:00:03,500 --> 00:00:08,200\nToday we'll explore the Night Watch gallery.\n```\n\n**VTT FORMAT EXAMPLE**:\n\n```\nWEBVTT\n\n00:00:00.000 --> 00:00:03.500\nWelcome to the Rijksmuseum.\n\n00:00:03.500 --> 00:00:08.200\nToday we'll explore the Night Watch gallery.\n```\n\n**HERITAGE INSTITUTION CONTEXT**:\n\nSubtitles are critical for heritage video accessibility:\n\n1. **Accessibility Compliance**: WCAG 2.1, Section 508\n2. **Multilingual Access**: Translate for international audiences\n3. **Silent Viewing**: Social media, public displays, quiet spaces\n4. **Search Discovery**: Subtitle text is indexed by platforms\n5. **Preservation**: Text outlasts video format obsolescence\n\
|
|
\n**YOUTUBE API INTEGRATION**:\n\nSubtitle tracks from YouTube API populate:\n- `has_or_had_format`: Typically VTT or SRT\n- `generation_method`: PLATFORM_PROVIDED or ASR_AUTOMATIC\n- `content_language`: From track language code\n- `is_or_was_created_through`: YouTube auto-caption flag\n\n**SEGMENTS ARE REQUIRED**:\n\nUnlike VideoTranscript where segments are optional, VideoSubtitle\nREQUIRES the `segments` slot to be populated with VideoTimeSegment\nentries that include start_seconds, end_seconds, and segment_text.\n"
|
|
close_mappings:
|
|
- schema:caption
|
|
- ma:CaptioningFormat
|
|
related_mappings:
|
|
- schema:transcript
|
|
slots:
|
|
- has_or_had_mean
|
|
- has_or_had_unit
|
|
- has_or_had_caption
|
|
- has_or_had_alignment
|
|
- has_or_had_quantity
|
|
- includes_music_description
|
|
- includes_sound_description
|
|
- includes_speaker_identification
|
|
- is_or_was_created_through
|
|
- is_closed_caption
|
|
- is_sdh
|
|
- raw_subtitle_content
|
|
- has_or_had_format
|
|
- has_or_had_score
|
|
- has_or_had_identifier
|
|
- has_or_had_label
|
|
slot_usage:
|
|
has_or_had_segment:
|
|
required: true
|
|
includes_timestamp:
|
|
ifabsent: 'true'
|
|
has_or_had_format:
|
|
range: SubtitleFormatEnum
|
|
required: true
|
|
examples:
|
|
- value: VTT
|
|
- value: SRT
|
|
raw_subtitle_content:
|
|
range: string
|
|
required: false
|
|
examples:
|
|
- value: 'WEBVTT
|
|
|
|
|
|
00:00:00.000 --> 00:00:03.500
|
|
|
|
Welcome to the museum.
|
|
|
|
'
|
|
is_closed_caption:
|
|
range: boolean
|
|
required: false
|
|
ifabsent: 'false'
|
|
examples:
|
|
- value: true
|
|
is_sdh:
|
|
range: boolean
|
|
required: false
|
|
ifabsent: 'false'
|
|
examples:
|
|
- value: true
|
|
includes_sound_description:
|
|
range: boolean
|
|
required: false
|
|
ifabsent: 'false'
|
|
examples:
|
|
- value: true
|
|
includes_music_description:
|
|
range: boolean
|
|
required: false
|
|
ifabsent: 'false'
|
|
examples:
|
|
- value: true
|
|
includes_speaker_identification:
|
|
range: boolean
|
|
required: false
|
|
ifabsent: 'false'
|
|
examples:
|
|
- value: true
|
|
is_or_was_created_through:
|
|
range: boolean
|
|
required: false
|
|
ifabsent: 'false'
|
|
examples:
|
|
- value: true
|
|
has_or_had_label:
|
|
range: string
|
|
required: false
|
|
deprecated: Use has_or_had_identifier with TrackIdentifier range instead
|
|
examples:
|
|
- value: English (auto-generated)
|
|
has_or_had_identifier:
|
|
range: TrackIdentifier
|
|
required: false
|
|
inlined: true
|
|
examples:
|
|
- value: '{"platform": "YouTube", "has_or_had_code": "en.3OWxR1w4QfE"}'
|
|
has_or_had_caption:
|
|
range: Caption
|
|
inlined: true
|
|
required: false
|
|
has_or_had_alignment:
|
|
range: Alignment
|
|
inlined: true
|
|
required: false
|
|
has_or_had_quantity:
|
|
range: integer
|
|
required: false
|
|
inlined: true
|
|
examples:
|
|
- value:
|
|
has_or_had_unit:
|
|
has_or_had_label: entries
|
|
has_or_had_mean:
|
|
range: MeanValue
|
|
inlined: true
|
|
examples:
|
|
- value:
|
|
has_or_had_value: 3.2
|
|
has_or_had_unit:
|
|
has_or_had_label: seconds
|
|
comments:
|
|
- Time-coded caption/subtitle content
|
|
- Extends VideoTranscript - subtitles ARE transcripts plus time codes
|
|
- 'Supports multiple formats: SRT, VTT, TTML, SBV, ASS'
|
|
- 'Accessibility metadata: CC, SDH, sound/music descriptions'
|
|
- Critical for heritage video accessibility compliance
|
|
see_also:
|
|
- https://schema.org/caption
|
|
- https://www.w3.org/TR/webvtt1/
|
|
- https://developer.mozilla.org/en-US/docs/Web/API/WebVTT_API
|
|
- https://www.3playmedia.com/learn/popular-topics/closed-captioning/
|
|
annotations:
|
|
specificity_score: 0.1
|
|
specificity_rationale: Generic utility class/slot created during migration
|
|
custodian_types: "['*']"
|