Schema enhancements (443 files): - Add class_uri with proper ontology references (schema:, prov:, skos:, rico:) - Add close_mappings, related_mappings per Rule 50 convention - Replace stub hc: slot_uri with standard predicates (dcterms:identifier, skos:prefLabel) - Improve descriptions with ontology mapping rationale - Add prefixes blocks to all schema modules Entity Resolution improvements: - Add entity_resolution module with email semantics parsing - Enhance build_entity_resolution.py with email-based matching signals - Extend Entity Review API with filtering by signal types and count - Add candidates caching and indexing for performance - Add ReviewLoginPage component New rules and documentation: - Add Rule 51: No Hallucinated Ontology References - Add .opencode/rules/no-hallucinated-ontology-references.md - Add .opencode/rules/slot-ontology-mapping-reference.md - Add adms.ttl and dqv.ttl ontology files Frontend ontology support: - Add RiC-O_1-1.rdf and schemaorg.owl to public/ontology
79 lines
2.6 KiB
YAML
79 lines
2.6 KiB
YAML
# WebClaimsBlock - Claims extracted from archived websites
|
|
# Extracted from custodian_source.yaml per Rule 38 (modular schema files)
|
|
# Extraction date: 2026-01-08
|
|
|
|
id: https://nde.nl/ontology/hc/classes/WebClaimsBlock
|
|
name: WebClaimsBlock
|
|
title: WebClaimsBlock
|
|
|
|
prefixes:
|
|
linkml: https://w3id.org/linkml/
|
|
hc: https://nde.nl/ontology/hc/
|
|
schema: http://schema.org/
|
|
prov: http://www.w3.org/ns/prov#
|
|
xsd: http://www.w3.org/2001/XMLSchema#
|
|
|
|
imports:
|
|
- linkml:types
|
|
|
|
|
|
- ./InvalidWebClaim
|
|
- ./LayoutMetadata
|
|
- ./ValidationMetadata
|
|
- ./WebClaim
|
|
default_range: string
|
|
|
|
classes:
|
|
WebClaimsBlock:
|
|
description: >-
|
|
Container for claims extracted from archived websites, including valid
|
|
claims, removed invalid claims, validation metadata, and layout
|
|
integration status. Serves as a collection of web observations with
|
|
full provenance tracking.
|
|
|
|
Ontology mapping rationale:
|
|
- class_uri is prov:Collection because this aggregates multiple WebClaim
|
|
entities extracted from the same source archive
|
|
- close_mappings includes schema:Collection as a general collection type
|
|
- related_mappings includes prov:Entity as this block is itself an entity
|
|
with extraction timestamp and provenance
|
|
class_uri: prov:Collection
|
|
close_mappings:
|
|
- schema:Collection
|
|
related_mappings:
|
|
- prov:Entity
|
|
attributes:
|
|
extraction_timestamp:
|
|
range: datetime
|
|
source_archive:
|
|
range: string
|
|
description: Path to source archive directory
|
|
claims_count:
|
|
range: integer
|
|
claim:
|
|
range: WebClaim
|
|
multivalued: true
|
|
inlined_as_list: true
|
|
removed_invalid_claims:
|
|
range: InvalidWebClaim
|
|
multivalued: true
|
|
inlined_as_list: true
|
|
description: Claims removed due to validation failures
|
|
validation_metadata:
|
|
range: ValidationMetadata
|
|
description: Metadata about the validation process
|
|
layout_integrated:
|
|
range: boolean
|
|
description: Whether layout integration has been performed
|
|
layout_integration_timestamp:
|
|
range: datetime
|
|
description: When layout integration was performed
|
|
layout_metadata:
|
|
range: LayoutMetadata
|
|
description: Metadata about layout extraction
|
|
claims_migrated:
|
|
range: boolean
|
|
description: Whether claims have been migrated to new format
|
|
migration_timestamp:
|
|
range: datetime
|
|
description: When migration was performed
|