glam/schemas/20251121/linkml/modules/classes/SocialMediaPost.yaml
kempersc fc405445c6 Refactor and update schema definitions
- Removed obsolete slots: `has_or_had_custodian_observation`, `provider`, and `specificity_annotation`.
- Updated `has_or_had_score` slot to use `SpecificityScore` class and modified its description and examples.
- Added new slots: `end_seconds`, `end_time`, `has_archive_path`, `has_or_had_custodian_name`, `protocol_name`, and `protocol_version`.
- Introduced a script `check_annotation_types.py` to validate the presence and structure of `custodian_types` in YAML files.
- Added a script `update_specificity.py` to automate updates related to `SpecificityAnnotation` to `SpecificityScore`.
2026-02-01 19:55:38 +01:00

171 lines
8.7 KiB
YAML

id: https://nde.nl/ontology/hc/class/SocialMediaPost
name: social_media_post_class
title: Social Media Post Class
imports:
- linkml:types
- ../slots/content_category
- ../slots/has_or_had_description
- ../slots/has_or_had_endpoint
- ../slots/has_or_had_label
- ../slots/has_or_had_score
- ../slots/has_or_had_url
- ../slots/is_official_content
- ../slots/is_or_was_categorized_as
- ../slots/is_or_was_last_updated_at
- ../slots/is_or_was_published_at
- ../slots/language
- ../slots/platform_type
- ../slots/post_id
- ../slots/post_url
- ../slots/posted_by_profile
- ../slots/retrieval_timestamp
prefixes:
linkml: https://w3id.org/linkml/
hc: https://nde.nl/ontology/hc/
schema: http://schema.org/
foaf: http://xmlns.com/foaf/0.1/
dcterms: http://purl.org/dc/terms/
prov: http://www.w3.org/ns/prov#
crm: http://www.cidoc-crm.org/cidoc-crm/
skos: http://www.w3.org/2004/02/skos/core#
as: https://www.w3.org/ns/activitystreams#
default_prefix: hc
classes:
SocialMediaPost:
class_uri: as:Object
abstract: false
description: "Concrete class for social media posts/content items.\n\n**DEFINITION**:\n\nSocialMediaPost represents a discrete piece of content published on a social media\nplatform. This includes videos, images, text posts, stories, carousels, threads,\nand other content types. Each post is published by a SocialMediaProfile (account/channel).\n\n**CRITICAL: MULTIVALUED POST TYPES**:\n\nA single SocialMediaPost can have **multiple** post types via the `post_types` slot.\nThis reflects the reality that social media content often combines formats:\n\n| Post Example | Primary Type | Secondary Types |\n|--------------|--------------|-----------------|\n| Instagram carousel with videos | CarouselPost | ImagePost, VideoPost |\n| Twitter thread with images | ThreadPost | TextPost, ImagePost |\n| YouTube Live archived as video | LiveStreamPost | VideoPost |\n| Podcast with video recording | AudioPost | VideoPost |\n| Story highlight (saved) | StoryPost\
\ | ImagePost, VideoPost |\n\n**TYPE ORDERING CONVENTION**:\n- First type in list = primary/dominant format\n- Subsequent types = secondary characteristics\n- Order matters for display and categorization\n\n**CRITICAL: POST vs PROFILE vs PLATFORM**:\n\n| Class | Represents | Example | Cardinality |\n|-------|------------|---------|-------------|\n| SocialMediaPlatformType | Platform category | YouTube | ~25 types |\n| SocialMediaProfile | Account/channel | @rijksmuseum | Thousands |\n| **SocialMediaPost** | Individual content | A video, post | Millions |\n\n**ACTIVITY STREAMS 2.0 ALIGNMENT**:\n\nMaps to `as:Object` from W3C Activity Streams 2.0 vocabulary:\n- as:Object is the base type for content in social media\n- Individual posts may also map to as:Video, as:Image, as:Note based on post_types\n- Enables federation with ActivityPub-based platforms (Mastodon, PeerTube)\n\n**HERITAGE INSTITUTION CONTEXT**:\n\nSocial media content from heritage institutions includes:\n\n1. **Official\
\ content** (posted by the institution):\n - Collection highlights (ImagePost)\n - Exhibition announcements (TextPost)\n - Educational videos (VideoPost)\n - Behind-the-scenes content (ShortVideoPost, StoryPost)\n - Event recordings (LiveStreamPost \u2192 VideoPost)\n - Curator interviews (VideoPost, AudioPost)\n - Multi-image collection stories (CarouselPost)\n\n2. **Third-party content** (about the institution):\n - Visitor videos/photos\n - News coverage\n - Academic discussions\n - Review content\n\n3. **User-generated content** (mentions):\n - Tagged posts\n - Check-ins\n - Comments/discussions\n\n**PROVENANCE TRACKING**:\n\nContent items are OBSERVATIONAL data retrieved via:\n- Platform APIs (YouTube Data API, Twitter API, etc.)\n- Web scraping (when API unavailable)\n- Manual archival\n\nEach content record includes:\n- `retrieval_timestamp`: When content was fetched\n- `api_endpoint`: Which API/method was used\n\
- `metrics_observed_date`: When engagement metrics were recorded\n\n**TEMPORAL CONSIDERATIONS**:\n\\nContent has multiple temporal dimensions:\\n- `is_or_was_published_at`: When originally posted\\n- `is_or_was_last_updated_at`: Last modification\\\n \\ by author\\n- `retrieval_timestamp`: When we fetched it\\n- `metrics_observed_date`: When metrics (views, likes) were\\ recorded\n\nEngagement metrics change constantly; always record observation timestamp.\n\n**SUBCLASSES FOR SPECIALIZED CONTENT**:\n\nWhile SocialMediaPost can represent any content with post_types,\nspecialized subclasses provide additional platform-specific slots:\n\n- **VideoPost**: YouTube, TikTok, Vimeo videos (duration, definition, captions)\n- **ImagePost**: Instagram, Pinterest posts (dimensions, alt_text) [future]\n- **TextPost**: Twitter/X, Mastodon posts (character_count) [future]\n- **StoryPost**: Instagram/Facebook stories (ephemeral, segments) [future]\n"
exact_mappings:
- as:Object
close_mappings:
- schema:CreativeWork
- crm:E73_Information_Object
related_mappings:
- schema:SocialMediaPosting
- dcterms:BibliographicResource
slots:
- has_or_had_endpoint
- content_category
- has_or_had_description
- is_official_content
- language
- platform_type
- post_id
- post_url
- posted_by_profile
- is_or_was_published_at
- retrieval_timestamp
- is_or_was_categorized_as
- has_or_had_score
- has_or_had_url
- has_or_had_label
- is_or_was_last_updated_at
slot_usage:
post_id:
range: string
required: true
identifier: true
examples:
- value: FbIoC-Owy-M
- value: '1234567890123456789'
post_url:
range: uri
required: true
pattern: ^https?://
examples:
- value: https://www.youtube.com/watch?v=FbIoC-Owy-M
- value: '[CarouselPost, ImagePost, VideoPost]'
platform_type:
range: uriorcurie
required: true
inlined: false
examples:
- value: YouTube
posted_by_profile:
range: SocialMediaProfile
required: false
inlined: false
examples:
- value: https://nde.nl/ontology/hc/social-media/nationaal-onderduikmuseum-youtube
has_or_had_label:
range: string
required: false
examples:
- value: "De Vrijheidsroute (aflevering 3) \u2013 Zevenaar, Duiven, Westervoort"
is_or_was_published_at:
range: PublicationEvent
required: true
inlined: true
examples:
- value:
temporal_extent:
begin_of_the_begin: '2025-07-30T18:05:15Z'
end_of_the_begin: '2025-07-30T18:05:15Z'
begin_of_the_end: '2025-07-30T18:05:15Z'
end_of_the_end: '2025-07-30T18:05:15Z'
is_or_was_last_updated_at:
range: datetime
required: false
examples:
- value: '2025-08-01T10:30:00Z'
language:
range: string
required: false
examples:
- value: nl
is_or_was_categorized_as:
range: Tag
multivalued: true
inlined_as_list: true
required: false
examples:
- value:
- tag_value: 80 jaar vrijheid
- tag_value: wo2
- tag_value: vrijheidsroute
has_or_had_url:
range: uri
required: false
examples:
- value: https://i.ytimg.com/vi/FbIoC-Owy-M/hqdefault.jpg
is_official_content:
range: boolean
required: false
ifabsent: 'true'
examples:
- value: true
content_category:
range: string
required: false
examples:
- value: '22'
retrieval_timestamp:
range: datetime
required: true
examples:
- value: '2025-12-01T23:16:22.294232+00:00'
has_or_had_endpoint:
range: APIEndpoint
inlined: true
examples:
- value:
has_or_had_url: https://www.googleapis.com/youtube/v3
has_or_had_description: YouTube Data API v3
comments:
- Concrete class for social media posts
- post_types is MULTIVALUED - a post can have multiple content types
- First type in post_types list is the primary format
- Use specialized subclasses (VideoPost) for platform-specific properties
- Activity Streams 2.0 alignment enables ActivityPub federation
- Metrics are observational - always include retrieval_timestamp
see_also:
- https://www.w3.org/ns/activitystreams#Object
- https://schema.org/CreativeWork
- https://schema.org/SocialMediaPosting
annotations:
specificity_score: 0.1
specificity_rationale: Generic utility class/slot created during migration
custodian_types: "['*']"