glam/frontend/public/schemas/20251121/linkml/modules/classes/WebClaimsBlock.yaml
kempersc 1fb924c412 feat: add ontology mappings to LinkML schema and enhance entity resolution
Schema enhancements (443 files):
- Add class_uri with proper ontology references (schema:, prov:, skos:, rico:)
- Add close_mappings, related_mappings per Rule 50 convention
- Replace stub hc: slot_uri with standard predicates (dcterms:identifier, skos:prefLabel)
- Improve descriptions with ontology mapping rationale
- Add prefixes blocks to all schema modules

Entity Resolution improvements:
- Add entity_resolution module with email semantics parsing
- Enhance build_entity_resolution.py with email-based matching signals
- Extend Entity Review API with filtering by signal types and count
- Add candidates caching and indexing for performance
- Add ReviewLoginPage component

New rules and documentation:
- Add Rule 51: No Hallucinated Ontology References
- Add .opencode/rules/no-hallucinated-ontology-references.md
- Add .opencode/rules/slot-ontology-mapping-reference.md
- Add adms.ttl and dqv.ttl ontology files

Frontend ontology support:
- Add RiC-O_1-1.rdf and schemaorg.owl to public/ontology
2026-01-13 13:51:02 +01:00

79 lines
2.6 KiB
YAML

# WebClaimsBlock - Claims extracted from archived websites
# Extracted from custodian_source.yaml per Rule 38 (modular schema files)
# Extraction date: 2026-01-08
id: https://nde.nl/ontology/hc/classes/WebClaimsBlock
name: WebClaimsBlock
title: WebClaimsBlock
prefixes:
linkml: https://w3id.org/linkml/
hc: https://nde.nl/ontology/hc/
schema: http://schema.org/
prov: http://www.w3.org/ns/prov#
xsd: http://www.w3.org/2001/XMLSchema#
imports:
- linkml:types
- ./InvalidWebClaim
- ./LayoutMetadata
- ./ValidationMetadata
- ./WebClaim
default_range: string
classes:
WebClaimsBlock:
description: >-
Container for claims extracted from archived websites, including valid
claims, removed invalid claims, validation metadata, and layout
integration status. Serves as a collection of web observations with
full provenance tracking.
Ontology mapping rationale:
- class_uri is prov:Collection because this aggregates multiple WebClaim
entities extracted from the same source archive
- close_mappings includes schema:Collection as a general collection type
- related_mappings includes prov:Entity as this block is itself an entity
with extraction timestamp and provenance
class_uri: prov:Collection
close_mappings:
- schema:Collection
related_mappings:
- prov:Entity
attributes:
extraction_timestamp:
range: datetime
source_archive:
range: string
description: Path to source archive directory
claims_count:
range: integer
claim:
range: WebClaim
multivalued: true
inlined_as_list: true
removed_invalid_claims:
range: InvalidWebClaim
multivalued: true
inlined_as_list: true
description: Claims removed due to validation failures
validation_metadata:
range: ValidationMetadata
description: Metadata about the validation process
layout_integrated:
range: boolean
description: Whether layout integration has been performed
layout_integration_timestamp:
range: datetime
description: When layout integration was performed
layout_metadata:
range: LayoutMetadata
description: Metadata about layout extraction
claims_migrated:
range: boolean
description: Whether claims have been migrated to new format
migration_timestamp:
range: datetime
description: When migration was performed