- Added `fix_dual_class_link.py` to remove dual class link references from specified YAML files. - Created `fix_specific_ghosts.py` to apply specific replacements in YAML files based on defined mappings. - Introduced `migrate_staff_count.py` to migrate staff count references to a new structure in specified YAML files. - Developed `migrate_type_slots.py` to replace type-related slots with new identifiers across YAML files. - Implemented `scan_ghost_references.py` to identify and report ghost references to archived slots and classes in YAML files. - Added `verify_ontology_terms.py` to verify the presence of ontology terms in specified ontology files against schema definitions.
45 lines
1.7 KiB
YAML
45 lines
1.7 KiB
YAML
id: https://nde.nl/ontology/hc/classes/YoutubeTranscript
|
|
name: YoutubeTranscript
|
|
title: YoutubeTranscript
|
|
prefixes:
|
|
linkml: https://w3id.org/linkml/
|
|
hc: https://nde.nl/ontology/hc/
|
|
schema: http://schema.org/
|
|
prov: http://www.w3.org/ns/prov#
|
|
xsd: http://www.w3.org/2001/XMLSchema#
|
|
oa: http://www.w3.org/ns/oa#
|
|
imports:
|
|
- linkml:types
|
|
default_range: string
|
|
classes:
|
|
YoutubeTranscript:
|
|
description: "YouTube video transcript including video ID, language, transcript type (manual or auto-generated), full text, and extraction metadata.\nOntology mapping rationale: - class_uri is schema:Transcript because this represents a textual\n transcript of video/audio content.\n- close_mappings includes oa:TextualBody as the transcript text\n forms annotation body content.\n- related_mappings includes prov:Entity for tracking and schema:MediaObject\n as transcripts derive from media."
|
|
class_uri: schema:Transcript
|
|
exact_mappings:
|
|
- oa:TextualBody
|
|
related_mappings:
|
|
- prov:Entity
|
|
- schema:MediaObject
|
|
attributes:
|
|
video_id:
|
|
range: string
|
|
language:
|
|
range: string
|
|
description: Transcript language code
|
|
transcript_type:
|
|
range: string
|
|
description: manual or auto-generated
|
|
transcript_text:
|
|
range: string
|
|
description: Full transcript text
|
|
extraction_method:
|
|
range: string
|
|
description: Method used to extract transcript
|
|
transcript_length_chars:
|
|
range: integer
|
|
description: Length of transcript in characters
|
|
annotations:
|
|
specificity_score: 0.1
|
|
specificity_rationale: Generic utility class/slot created during migration
|
|
custodian_types: "['*']"
|
|
custodian_types_rationale: Universal utility concept
|