glam/schemas/20251121/linkml/modules/classes/YoutubeTranscript.yaml
kempersc 7cf10084b4 Implement scripts for schema modifications and ontology verification
- Added `fix_dual_class_link.py` to remove dual class link references from specified YAML files.
- Created `fix_specific_ghosts.py` to apply specific replacements in YAML files based on defined mappings.
- Introduced `migrate_staff_count.py` to migrate staff count references to a new structure in specified YAML files.
- Developed `migrate_type_slots.py` to replace type-related slots with new identifiers across YAML files.
- Implemented `scan_ghost_references.py` to identify and report ghost references to archived slots and classes in YAML files.
- Added `verify_ontology_terms.py` to verify the presence of ontology terms in specified ontology files against schema definitions.
2026-01-29 17:10:25 +01:00

45 lines
1.7 KiB
YAML

id: https://nde.nl/ontology/hc/classes/YoutubeTranscript
name: YoutubeTranscript
title: YoutubeTranscript
prefixes:
linkml: https://w3id.org/linkml/
hc: https://nde.nl/ontology/hc/
schema: http://schema.org/
prov: http://www.w3.org/ns/prov#
xsd: http://www.w3.org/2001/XMLSchema#
oa: http://www.w3.org/ns/oa#
imports:
- linkml:types
default_range: string
classes:
YoutubeTranscript:
description: "YouTube video transcript including video ID, language, transcript type (manual or auto-generated), full text, and extraction metadata.\nOntology mapping rationale: - class_uri is schema:Transcript because this represents a textual\n transcript of video/audio content.\n- close_mappings includes oa:TextualBody as the transcript text\n forms annotation body content.\n- related_mappings includes prov:Entity for tracking and schema:MediaObject\n as transcripts derive from media."
class_uri: schema:Transcript
exact_mappings:
- oa:TextualBody
related_mappings:
- prov:Entity
- schema:MediaObject
attributes:
video_id:
range: string
language:
range: string
description: Transcript language code
transcript_type:
range: string
description: manual or auto-generated
transcript_text:
range: string
description: Full transcript text
extraction_method:
range: string
description: Method used to extract transcript
transcript_length_chars:
range: integer
description: Length of transcript in characters
annotations:
specificity_score: 0.1
specificity_rationale: Generic utility class/slot created during migration
custodian_types: "['*']"
custodian_types_rationale: Universal utility concept