- Deleted obsolete slot definitions: statement_summary, statement_text, statement_type, status_name, supersede_articles, supersede_condition, supersede_name, temporal_dynamics, total_amount, typical_contents, use_cases, was_acquired_through, was_fetched_at, was_retrieved_at. - Updated existing slot definitions for states_or_stated to enhance clarity and structure. - Introduced new classes: Article, ConditionofAccess, FinancialStatementType, MaximumQuantity, Series, Summary, Type, and their respective slots to improve schema organization and usability. - Added new slots: changes_or_changed_through, has_or_had_condition_of_access, has_or_had_heritage_type, is_or_was_part_of_series, is_or_was_retrieved_at, maximum_of_maximum to capture additional metadata and relationships.
129 lines
3.8 KiB
YAML
129 lines
3.8 KiB
YAML
id: https://nde.nl/ontology/hc/class/ConfidenceMethod
|
|
name: confidence_method_class
|
|
title: Confidence Method
|
|
|
|
prefixes:
|
|
linkml: https://w3id.org/linkml/
|
|
hc: https://nde.nl/ontology/hc/
|
|
prov: http://www.w3.org/ns/prov#
|
|
schema: http://schema.org/
|
|
|
|
imports:
|
|
- linkml:types
|
|
- ../slots/has_or_had_type
|
|
- ../slots/has_or_had_description
|
|
- ../slots/has_or_had_identifier
|
|
|
|
default_prefix: hc
|
|
|
|
classes:
|
|
ConfidenceMethod:
|
|
description: >-
|
|
A method or algorithm used to calculate confidence scores.
|
|
|
|
**USAGE**:
|
|
Documents how confidence values were computed:
|
|
- Fuzzy string matching algorithms
|
|
- ML model predictions
|
|
- Rule-based validation
|
|
- XPath match verification
|
|
- Human assessment
|
|
|
|
**COMMON METHODS**:
|
|
| Method | Description |
|
|
|--------|-------------|
|
|
| fuzzy_matching | Levenshtein, Jaro-Winkler, etc. |
|
|
| xpath_validation | XPath match confidence |
|
|
| llm_classification | LLM-based entity classification |
|
|
| ml_prediction | Machine learning model output |
|
|
| human_assessment | Manual quality assessment |
|
|
| ensemble | Combined multiple methods |
|
|
|
|
class_uri: prov:Plan
|
|
|
|
exact_mappings:
|
|
- prov:Plan
|
|
|
|
close_mappings:
|
|
- schema:HowTo
|
|
|
|
slots:
|
|
- has_or_had_type
|
|
- has_or_had_description
|
|
- has_or_had_identifier
|
|
|
|
slot_usage:
|
|
has_or_had_type:
|
|
range: string
|
|
required: true
|
|
description: >-
|
|
Type of confidence calculation method.
|
|
examples:
|
|
- value: "fuzzy_matching"
|
|
description: String similarity algorithm
|
|
- value: "ml_prediction"
|
|
description: Machine learning model
|
|
- value: "human_assessment"
|
|
description: Manual human review
|
|
|
|
has_or_had_description:
|
|
range: string
|
|
required: false
|
|
description: >-
|
|
Detailed description of the method implementation.
|
|
examples:
|
|
- value: "Jaro-Winkler similarity with 0.7 threshold"
|
|
description: Fuzzy matching configuration
|
|
|
|
has_or_had_identifier:
|
|
range: string
|
|
required: false
|
|
description: >-
|
|
Identifier for the method (e.g., algorithm name, model version).
|
|
examples:
|
|
- value: "rapidfuzz-2.15.1"
|
|
description: Library version
|
|
|
|
attributes:
|
|
threshold:
|
|
range: float
|
|
description: >-
|
|
Confidence threshold used by this method for accept/reject decisions.
|
|
examples:
|
|
- value: 0.85
|
|
description: 85% threshold for acceptance
|
|
|
|
is_deterministic:
|
|
range: boolean
|
|
description: >-
|
|
Whether the method produces consistent results for same inputs.
|
|
|
|
annotations:
|
|
custodian_types: '["*"]'
|
|
custodian_types_rationale: >-
|
|
Confidence methods apply universally to data quality assessment.
|
|
custodian_types_primary: "*"
|
|
specificity_score: 0.25
|
|
specificity_rationale: >-
|
|
Low specificity - fundamental methodology documentation.
|
|
|
|
examples:
|
|
- value:
|
|
has_or_had_type: "fuzzy_matching"
|
|
has_or_had_description: "Levenshtein distance with ratio normalization"
|
|
has_or_had_identifier: "rapidfuzz-levenshtein"
|
|
threshold: 0.85
|
|
is_deterministic: true
|
|
description: Fuzzy string matching method
|
|
|
|
- value:
|
|
has_or_had_type: "llm_classification"
|
|
has_or_had_description: "GPT-4 based entity type classification"
|
|
has_or_had_identifier: "gpt-4-turbo-2024-04-09"
|
|
is_deterministic: false
|
|
description: LLM-based classification method
|
|
|
|
comments:
|
|
- Created from slot_fixes.yaml migration (2026-01-19)
|
|
- Documents confidence calculation methodology
|
|
- Used with ConfidenceScore class
|