- Added detailed descriptions for slots: collecting_scope, collection_access, custody_history, education_level, membership_size, and publication_activity to improve clarity and usability. - Removed the publication_date slot due to migration to a new structure. - Updated slot fixes with migration notes and adjustments for various slots, ensuring alignment with new ontology standards. - Introduced new classes for DigitalPlatformV2, including DigitalPlatformV2DataQualityNotes, DigitalPlatformV2DataSource, DigitalPlatformV2KeyContact, DigitalPlatformV2OrganizationProfile, DigitalPlatformV2OrganizationStatus, DigitalPlatformV2PrimaryPlatform, DigitalPlatformV2Provenance, DigitalPlatformV2ServiceDetails, and DigitalPlatformV2TransformationMetadata, each with comprehensive attributes and descriptions. - Added classes for EnrichmentProvenance and EnrichmentProvenanceEntry to track provenance for enrichment sources, including detailed attributes for verification and source tracking. - Created LogoClaim, LogoEnrichment, and LogoEnrichmentSummary classes to manage logo and favicon data extracted from web scraping, with attributes for claims and summary statistics. - Archived the publication_date slot to maintain historical records.
57 lines
1.9 KiB
YAML
57 lines
1.9 KiB
YAML
# LogoEnrichment - Logo and favicon enrichment data from web scraping
|
|
# Created for CustodianSourceFile validation compatibility
|
|
# Creation date: 2026-01-18
|
|
|
|
id: https://nde.nl/ontology/hc/classes/LogoEnrichment
|
|
name: LogoEnrichment
|
|
title: LogoEnrichment
|
|
|
|
prefixes:
|
|
linkml: https://w3id.org/linkml/
|
|
hc: https://nde.nl/ontology/hc/
|
|
schema: http://schema.org/
|
|
prov: http://www.w3.org/ns/prov#
|
|
xsd: http://www.w3.org/2001/XMLSchema#
|
|
|
|
imports:
|
|
- linkml:types
|
|
- ./LogoClaim
|
|
- ./LogoEnrichmentSummary
|
|
default_range: string
|
|
|
|
classes:
|
|
LogoEnrichment:
|
|
description: >-
|
|
Logo and favicon enrichment data extracted from institutional websites
|
|
via web scraping (crawl4ai). Captures logos, favicons, OpenGraph images,
|
|
and other visual brand assets with full provenance tracking.
|
|
|
|
Ontology mapping rationale:
|
|
- class_uri is prov:Entity because this represents enrichment data
|
|
with provenance (timestamps, methods, sources)
|
|
- close_mappings includes schema:ImageObject as the extracted data
|
|
relates to image/logo assets
|
|
class_uri: prov:Entity
|
|
close_mappings:
|
|
- schema:ImageObject
|
|
related_mappings:
|
|
- prov:Derivation
|
|
- schema:Brand
|
|
attributes:
|
|
enrichment_timestamp:
|
|
range: datetime
|
|
description: When logo enrichment was performed
|
|
source_url:
|
|
range: uri
|
|
description: URL of the webpage that was scraped for logos
|
|
extraction_method:
|
|
range: string
|
|
description: Method used to extract logos (crawl4ai, playwright, etc.)
|
|
claims:
|
|
range: LogoClaim
|
|
multivalued: true
|
|
inlined_as_list: true
|
|
description: Individual logo/favicon claims extracted from the page
|
|
summary:
|
|
range: LogoEnrichmentSummary
|
|
description: Summary statistics of logo extraction results
|