glam/frontend/public/schemas/20251121/linkml/modules/classes/ConfidenceMethod.yaml
kempersc 2d09776856 Refactor StorageCondition schema: Migrate compliance_status to has_or_had_status with ComplianceStatus class
- Removed compliance_status slot and replaced it with has_or_had_status.
- Updated has_or_had_status to use ComplianceStatus for structured representation.
- Adjusted examples to reflect new structure for compliance status.
- Updated documentation to indicate migration and provide details on the ComplianceStatus class.
2026-01-22 16:22:16 +01:00

135 lines
4 KiB
YAML

# ConfidenceMethod - Method used to calculate confidence scores
#
# Created per slot_fixes.yaml migration for: confidence_method
# Revision: has_or_had_method + ConfidenceMethod
# Creation date: 2026-01-19
id: https://nde.nl/ontology/hc/class/ConfidenceMethod
name: confidence_method_class
title: Confidence Method
prefixes:
linkml: https://w3id.org/linkml/
hc: https://nde.nl/ontology/hc/
prov: http://www.w3.org/ns/prov#
schema: http://schema.org/
imports:
- linkml:types
- ../slots/has_or_had_type
- ../slots/has_or_had_description
- ../slots/has_or_had_identifier
default_prefix: hc
classes:
ConfidenceMethod:
description: >-
A method or algorithm used to calculate confidence scores.
**USAGE**:
Documents how confidence values were computed:
- Fuzzy string matching algorithms
- ML model predictions
- Rule-based validation
- XPath match verification
- Human assessment
**COMMON METHODS**:
| Method | Description |
|--------|-------------|
| fuzzy_matching | Levenshtein, Jaro-Winkler, etc. |
| xpath_validation | XPath match confidence |
| llm_classification | LLM-based entity classification |
| ml_prediction | Machine learning model output |
| human_assessment | Manual quality assessment |
| ensemble | Combined multiple methods |
class_uri: prov:Plan
exact_mappings:
- prov:Plan
close_mappings:
- schema:HowTo
slots:
- has_or_had_type
- has_or_had_description
- has_or_had_identifier
slot_usage:
has_or_had_type:
range: string
required: true
description: >-
Type of confidence calculation method.
examples:
- value: "fuzzy_matching"
description: String similarity algorithm
- value: "ml_prediction"
description: Machine learning model
- value: "human_assessment"
description: Manual human review
has_or_had_description:
range: string
required: false
description: >-
Detailed description of the method implementation.
examples:
- value: "Jaro-Winkler similarity with 0.7 threshold"
description: Fuzzy matching configuration
has_or_had_identifier:
range: string
required: false
description: >-
Identifier for the method (e.g., algorithm name, model version).
examples:
- value: "rapidfuzz-2.15.1"
description: Library version
attributes:
threshold:
range: float
description: >-
Confidence threshold used by this method for accept/reject decisions.
examples:
- value: 0.85
description: 85% threshold for acceptance
is_deterministic:
range: boolean
description: >-
Whether the method produces consistent results for same inputs.
annotations:
custodian_types: '["*"]'
custodian_types_rationale: >-
Confidence methods apply universally to data quality assessment.
custodian_types_primary: "*"
specificity_score: 0.25
specificity_rationale: >-
Low specificity - fundamental methodology documentation.
examples:
- value:
has_or_had_type: "fuzzy_matching"
has_or_had_description: "Levenshtein distance with ratio normalization"
has_or_had_identifier: "rapidfuzz-levenshtein"
threshold: 0.85
is_deterministic: true
description: Fuzzy string matching method
- value:
has_or_had_type: "llm_classification"
has_or_had_description: "GPT-4 based entity type classification"
has_or_had_identifier: "gpt-4-turbo-2024-04-09"
is_deterministic: false
description: LLM-based classification method
comments:
- Created from slot_fixes.yaml migration (2026-01-19)
- Documents confidence calculation methodology
- Used with ConfidenceScore class