Schema enhancements (443 files): - Add class_uri with proper ontology references (schema:, prov:, skos:, rico:) - Add close_mappings, related_mappings per Rule 50 convention - Replace stub hc: slot_uri with standard predicates (dcterms:identifier, skos:prefLabel) - Improve descriptions with ontology mapping rationale - Add prefixes blocks to all schema modules Entity Resolution improvements: - Add entity_resolution module with email semantics parsing - Enhance build_entity_resolution.py with email-based matching signals - Extend Entity Review API with filtering by signal types and count - Add candidates caching and indexing for performance - Add ReviewLoginPage component New rules and documentation: - Add Rule 51: No Hallucinated Ontology References - Add .opencode/rules/no-hallucinated-ontology-references.md - Add .opencode/rules/slot-ontology-mapping-reference.md - Add adms.ttl and dqv.ttl ontology files Frontend ontology support: - Add RiC-O_1-1.rdf and schemaorg.owl to public/ontology
79 lines
2.5 KiB
YAML
79 lines
2.5 KiB
YAML
# InvalidWebClaim - Invalid claim with validation reason
|
|
# Extracted from custodian_source.yaml per Rule 38 (modular schema files)
|
|
# Extraction date: 2026-01-08
|
|
|
|
id: https://nde.nl/ontology/hc/classes/InvalidWebClaim
|
|
name: InvalidWebClaim
|
|
title: InvalidWebClaim
|
|
|
|
prefixes:
|
|
linkml: https://w3id.org/linkml/
|
|
hc: https://nde.nl/ontology/hc/
|
|
schema: http://schema.org/
|
|
prov: http://www.w3.org/ns/prov#
|
|
xsd: http://www.w3.org/2001/XMLSchema#
|
|
rdf: http://www.w3.org/1999/02/22-rdf-syntax-ns#
|
|
dqv: http://www.w3.org/ns/dqv#
|
|
|
|
imports:
|
|
- linkml:types
|
|
|
|
|
|
default_range: string
|
|
|
|
classes:
|
|
InvalidWebClaim:
|
|
description: >-
|
|
A web claim that failed validation, preserving the original extracted
|
|
data along with the validation failure reason and rule violated.
|
|
Used for audit trails and quality assessment of extraction processes.
|
|
|
|
Ontology mapping rationale:
|
|
- class_uri is rdf:Statement because this represents a claim/assertion
|
|
that was made but found to be invalid - the statement structure is
|
|
preserved even though the content failed validation
|
|
- close_mappings includes dqv:QualityAnnotation as this documents a
|
|
quality issue with extracted data
|
|
- related_mappings includes prov:Entity for provenance tracking
|
|
class_uri: rdf:Statement
|
|
close_mappings:
|
|
- dqv:QualityAnnotation
|
|
related_mappings:
|
|
- prov:Entity
|
|
attributes:
|
|
claim_type:
|
|
range: string
|
|
description: Type of claim
|
|
claim_value:
|
|
range: string
|
|
description: Extracted value
|
|
raw_value:
|
|
range: string
|
|
description: Raw value before processing
|
|
source_url:
|
|
range: uri
|
|
description: URL claim was extracted from
|
|
retrieved_on:
|
|
range: datetime
|
|
description: When page was archived
|
|
xpath:
|
|
range: string
|
|
description: XPath to element
|
|
html_file:
|
|
range: string
|
|
description: Relative path to archived HTML
|
|
xpath_match_score:
|
|
range: float
|
|
description: Match score
|
|
extraction_method:
|
|
range: string
|
|
description: Extraction method
|
|
extraction_timestamp:
|
|
range: datetime
|
|
description: When claim was extracted
|
|
validation_reason:
|
|
range: string
|
|
description: Reason why claim was invalid
|
|
validation_rule:
|
|
range: string
|
|
description: Rule that was violated
|