glam/apps/archief-assistent/public
kempersc f2bc2d54cb feat(archief-assistent): integrate ontology-driven vocabulary into semantic cache
Implements Rule 46: Ontology-Driven Cache Segmentation

Semantic Cache Enhancements:
- Add institutionSubtype, recordSetType, wikidataEntity to ExtractedEntities
- Add extractionMethod field to track vocabulary vs regex extraction
- Implement async extractEntitiesWithVocabulary() using term log
- Maintain sync regex fallback for cache key generation (<5ms)

Build Pipeline:
- Add prebuild hook to regenerate types-vocab.json from LinkML schemas
- Extract vocabulary from *Type.yaml and *Types.yaml schema files
- Generate GLAMORCUBESFIXPHDNT code mappings automatically

New Script:
- scripts/extract-types-vocab.ts - Extracts vocabulary from LinkML schemas
- Supports --skip-embeddings flag for faster builds
- Outputs to apps/archief-assistent/public/types-vocab.json

This enables richer cache segmentation using ontology-derived subtypes
(e.g., 'MUNICIPAL_ARCHIVE', 'ART_MUSEUM') instead of just top-level
GLAMORCUBESFIXPHDNT codes.
2026-01-10 13:30:30 +01:00
..
ontology add sparql validator and RAG injector 2025-12-30 03:43:31 +01:00
schemas/20251121 enrich HC profiles 2026-01-02 02:11:04 +01:00
de-aa-logo.svg remove a,bihguous web-claims 2025-12-21 00:01:54 +01:00
favicon.svg remove a,bihguous web-claims 2025-12-21 00:01:54 +01:00
na-logo.svg remove a,bihguous web-claims 2025-12-21 00:01:54 +01:00
types-vocab.json feat(archief-assistent): integrate ontology-driven vocabulary into semantic cache 2026-01-10 13:30:30 +01:00