- Added `id`, `name`, `title`, and `description` fields to multiple LinkML class YAML files. - Standardized prefixes across all class definitions. - Introduced a new script `fix_linkml_metadata.py` to automate the addition of metadata to class files. - Updated existing class files to ensure compliance with the new metadata structure.
116 lines
3.9 KiB
YAML
116 lines
3.9 KiB
YAML
id: https://nde.nl/ontology/hc/class/ConfidenceThreshold
|
|
name: ConfidenceThreshold
|
|
title: Confidence Threshold
|
|
description: LinkML class definition for Confidence Threshold
|
|
prefixes:
|
|
linkml: https://w3id.org/linkml/
|
|
schema: http://schema.org/
|
|
skos: http://www.w3.org/2004/02/skos/core#
|
|
rico: https://www.ica.org/standards/RiC/ontology#
|
|
wd: http://www.wikidata.org/entity/
|
|
# ConfidenceThreshold - Class for structured threshold values
|
|
# Created 2026-01-22 per slot_fixes.yaml revision for confidence_threshold (Rule 53)
|
|
#
|
|
id: https://nde.nl/ontology/hc/class/ConfidenceThreshold
|
|
name: ConfidenceThreshold
|
|
title: Confidence Threshold
|
|
|
|
prefixes:
|
|
linkml: https://w3id.org/linkml/
|
|
hc: https://nde.nl/ontology/hc/
|
|
schema: http://schema.org/
|
|
|
|
imports:
|
|
- linkml:types
|
|
- ../slots/has_or_had_description
|
|
- ./Description
|
|
|
|
default_prefix: hc
|
|
|
|
classes:
|
|
ConfidenceThreshold:
|
|
class_uri: hc:ConfidenceThreshold
|
|
description: |
|
|
A threshold value used in confidence-based processing or validation.
|
|
|
|
**DEFINITION**:
|
|
|
|
ConfidenceThreshold captures the minimum (or maximum) confidence level
|
|
required for an action or classification to be accepted. Common in:
|
|
- NLP entity extraction pipelines
|
|
- ML classification workflows
|
|
- Automated data validation
|
|
- Quality assurance checks
|
|
|
|
**THRESHOLD TYPES**:
|
|
|
|
- **CONFIDENCE**: Minimum confidence score for acceptance (0.0-1.0)
|
|
- **PROBABILITY**: Probability threshold for classification
|
|
- **QUALITY**: Minimum quality score for data acceptance
|
|
- **SIMILARITY**: Minimum similarity score for matching
|
|
|
|
**EXAMPLE**:
|
|
```yaml
|
|
has_or_had_threshold:
|
|
- threshold_value: 0.85
|
|
threshold_type: CONFIDENCE
|
|
threshold_description: "Entity must have ≥85% confidence to be extracted"
|
|
- threshold_value: 0.70
|
|
threshold_type: SIMILARITY
|
|
threshold_description: "Name matches require ≥70% similarity"
|
|
```
|
|
|
|
**Migration (2026-01-22)**:
|
|
Created as part of confidence_threshold → has_or_had_threshold migration
|
|
per slot_fixes.yaml (Rule 53).
|
|
|
|
slots:
|
|
- has_or_had_description
|
|
|
|
attributes:
|
|
threshold_value:
|
|
description: The numeric threshold value (typically 0.0-1.0 for confidence)
|
|
range: float
|
|
required: true
|
|
threshold_type:
|
|
description: Type of threshold (CONFIDENCE, PROBABILITY, QUALITY, SIMILARITY)
|
|
range: string
|
|
threshold_operator:
|
|
description: Comparison operator (GTE, LTE, GT, LT, EQ)
|
|
range: string
|
|
threshold_description:
|
|
description: Human-readable description of what this threshold controls
|
|
range: string
|
|
|
|
slot_usage:
|
|
has_or_had_description:
|
|
range: Description
|
|
description: Detailed description of the threshold and its application
|
|
|
|
annotations:
|
|
custodian_types: '["*"]'
|
|
custodian_types_rationale: "Threshold-based processing applies across heritage types."
|
|
specificity_score: 0.6
|
|
specificity_rationale: "Specialized class for methodology/processing contexts."
|
|
|
|
examples:
|
|
- value:
|
|
threshold_value: 0.85
|
|
threshold_type: CONFIDENCE
|
|
threshold_operator: GTE
|
|
threshold_description: "Entity extraction requires ≥85% confidence"
|
|
description: NLP entity extraction threshold
|
|
|
|
- value:
|
|
threshold_value: 0.70
|
|
threshold_type: SIMILARITY
|
|
threshold_operator: GTE
|
|
threshold_description: "Name matching requires ≥70% Levenshtein similarity"
|
|
description: Fuzzy matching threshold
|
|
|
|
- value:
|
|
threshold_value: 0.95
|
|
threshold_type: QUALITY
|
|
threshold_operator: GTE
|
|
threshold_description: "Data must pass 95% quality checks"
|
|
description: Data quality threshold
|