kempersc/glam - Forgejo: Beyond coding. We Forge.

Author	SHA1	Message	Date
kempersc	3b676f3ea5	fix: remove duplicate ontology mappings rendering in LinkML viewer Some checks are pending Deploy Frontend / build-and-deploy (push) Waiting to run Details The slot details section was rendering close_mappings, narrow_mappings, broad_mappings, and related_mappings twice each. This caused the mappings to appear duplicated on pages like /linkml?class=AcademicArchive. Removed 68 lines of duplicate JSX code.	2026-01-13 18:11:54 +01:00
kempersc	8a3c907f59	fix: resolve CIDOC-CRM relative URIs, add EDM ontology, render HTML in descriptions Some checks failed Deploy Frontend / build-and-deploy (push) Has been cancelled Details DSPy RAG Evaluation / Layer 1 - Unit Tests (push) Successful in 11m12s Details DSPy RAG Evaluation / Layer 2 - DSPy Module Tests (push) Successful in 12m55s Details DSPy RAG Evaluation / Layer 3 - Integration Tests (push) Successful in 10m24s Details DSPy RAG Evaluation / Layer 4 - Comprehensive Evaluation (push) Successful in 11m31s Details DSPy RAG Evaluation / Quality Gate (push) Successful in 2s Details - Fix resolveUri() to handle bare local names like 'E27_Site' used by CIDOC-CRM (previously only handled URIs starting with '#') - Add EDM (Europeana Data Model) ontology to frontend - Copy edm.owl to frontend/public/ontology/ - Register in ONTOLOGY_FILES array - Add 'edm' prefix to STANDARD_PREFIXES - Add EDM color to ONTOLOGY_COLORS - Render HTML content in ontology descriptions safely using DOMPurify - Sanitize HTML to allow only safe tags (a, br, em, strong, etc.) - Fix Schema.org relative links to absolute URLs - Add target='_blank' to external links	2026-01-13 16:50:40 +01:00
kempersc	f74513e8ef	feat: Enhance entity resolution with email semantics and review merging - Updated `entity_review.py` to map email semantic fields from JSON. - Expanded `email_semantics.py` with additional museum mappings. - Introduced a new rule in `.opencode/rules/no-duplicate-ontology-mappings.md` to prevent duplicate ontology mappings. - Added a backup JSON file for entity resolution candidates. - Created `enrich_email_semantics.py` to enrich candidates with email semantic signals. - Developed `merge_entity_reviews.py` to merge reviewed decisions from a backup into new candidates.	2026-01-13 16:43:56 +01:00
kempersc	21ed120ac2	fix: correct hallucinated RiC-O terms and add locn ontology RiC-O hallucinated terms fixed: - FindingAidType.yaml: rico:FindingAidType → rico:DocumentaryFormType - has_acquisition_method.yaml: rico:hasOrHadActivityType → prov:wasGeneratedBy - has_activity_type.yaml: rico:hasOrHadActivityType → dcterms:type - has_arrangement.yaml: rico:hasOrHadArrangement → dcterms:description - has_or_had_finding_aid.yaml: rico:isDescribedBy → rico:isOrWasDescribedBy The following terms do NOT exist in RiC-O 1.1: - rico:FindingAidType (use rico:DocumentaryFormType) - rico:hasOrHadActivityType (no equivalent) - rico:hasOrHadArrangement (no equivalent) - rico:isDescribedBy (correct form: rico:isOrWasDescribedBy) Added LOCN ontology support: - Copied locn.ttl to frontend/public/ontology/ - Added LOCN to ONTOLOGY_FILES in ontology-loader.ts - Added locn prefix to OntologyTermPopup.tsx - LOCN (http://www.w3.org/ns/locn#) is W3C Location Core Vocabulary for addresses and geometry (used by locn:Address)	2026-01-13 16:42:32 +01:00
kempersc	6781073d06	fix: add @base directive support for Turtle/RDF parsing The VCard ontology file (and 3 others) use @base directive with relative URIs like <#Address>. The Turtle parser was not extracting @base or resolving relative URIs against it. Changes: - Extract @base directive in first pass alongside @prefix - Add baseUri parameter to expandUri() function - Handle relative URIs starting with # (resolve against base) - Handle empty relative URI <> (returns base URI itself) - Pass baseUri through to processSubject() function This fixes the 'Term not found' error for vcard:Address and similar terms that use relative URI notation in their ontology definitions. Affected ontologies: vcard.rdf, prov.ttl, era_ontology.ttl, ebg-ontology.ttl	2026-01-13 15:54:29 +01:00
kempersc	f2b10fca19	fix: correct hallucinated PREMIS terms and Schema.org namespace mismatch All checks were successful Deploy Frontend / build-and-deploy (push) Successful in 3m48s Details PREMIS ontology fixes (8 schema files): - Replace invalid premis:hasRepresentation with dcterms:hasFormat - Replace invalid premis:hasAccessRestriction with odrl:hasPolicy - Replace invalid premis:hasPreservationPolicy with dcterms:conformsTo - Replace invalid premis:hasAccessPolicy with dcterms:accessRights - Replace invalid premis:hasStoragePolicy with dcterms:conformsTo - Replace invalid premis:ProcessingStatus with skos:Concept - Add proper close_mappings to valid PREMIS classes (premis:Representation, etc.) - Document hallucinated terms in Rule 51 (AGENTS.md) for future prevention Schema.org namespace fixes (3 frontend files): - Update OntologyTermPopup.tsx: add normalizeSchemaOrgUri() function - Update ontology-loader.ts: change schema prefix to https://schema.org/ - Update linkml-schema-service.ts: change schema prefix to https://schema.org/ - The schemaorg.owl file uses https:// but code was using http:// These changes ensure ontology term lookups work correctly for Schema.org terms and that LinkML schema files only reference valid ontology predicates.	2026-01-13 14:16:33 +01:00
kempersc	1fb924c412	feat: add ontology mappings to LinkML schema and enhance entity resolution Schema enhancements (443 files): - Add class_uri with proper ontology references (schema:, prov:, skos:, rico:) - Add close_mappings, related_mappings per Rule 50 convention - Replace stub hc: slot_uri with standard predicates (dcterms:identifier, skos:prefLabel) - Improve descriptions with ontology mapping rationale - Add prefixes blocks to all schema modules Entity Resolution improvements: - Add entity_resolution module with email semantics parsing - Enhance build_entity_resolution.py with email-based matching signals - Extend Entity Review API with filtering by signal types and count - Add candidates caching and indexing for performance - Add ReviewLoginPage component New rules and documentation: - Add Rule 51: No Hallucinated Ontology References - Add .opencode/rules/no-hallucinated-ontology-references.md - Add .opencode/rules/slot-ontology-mapping-reference.md - Add adms.ttl and dqv.ttl ontology files Frontend ontology support: - Add RiC-O_1-1.rdf and schemaorg.owl to public/ontology	2026-01-13 13:51:02 +01:00
kempersc	c5fb9ec88e	feat: add route for Entity Review page with lazy loading	2026-01-13 01:49:43 +01:00
kempersc	8d7aca0f98	Refactor code structure for improved readability and maintainability	2026-01-12 19:13:35 +01:00
kempersc	3b35f4aea5	Refactor code structure for improved readability and maintainability	2026-01-12 18:31:31 +01:00
kempersc	846a6cdcec	Add new Record Set Types for various archival collections - Introduced SoundArchiveRecordSetType, SpecialCollectionRecordSetType, SpecializedArchiveRecordSetType, SpecializedArchivesCzechiaRecordSetType, StateArchivesRecordSetType, StateArchivesSectionRecordSetType, StateDistrictArchiveRecordSetType, StateRegionalArchiveCzechiaRecordSetType, TelevisionArchiveRecordSetType, TradeUnionArchiveRecordSetType, UniversityArchiveRecordSetType, VereinsarchivRecordSetType, VerlagsarchivRecordSetType, VerwaltungsarchivRecordSetType, WebArchiveRecordSetType, and WomensArchivesRecordSetType. - Each new type includes appropriate metadata, slots, and relationships to existing classes. - Implemented a script to detect and fix Type class violations in LinkML files.	2026-01-12 15:20:29 +01:00
kempersc	5807840bbc	fix: update generated timestamp in manifest.json Some checks failed Deploy Frontend / build-and-deploy (push) Successful in 4m2s Details DSPy RAG Evaluation / Layer 1 - Unit Tests (push) Failing after 6s Details DSPy RAG Evaluation / Layer 3 - Integration Tests (push) Has been skipped Details DSPy RAG Evaluation / Layer 2 - DSPy Module Tests (push) Has been skipped Details DSPy RAG Evaluation / Layer 4 - Comprehensive Evaluation (push) Has been skipped Details DSPy RAG Evaluation / Quality Gate (push) Failing after 2s Details	2026-01-12 14:46:05 +01:00
kempersc	355d8be51d	centralise slots	2026-01-12 14:33:56 +01:00
kempersc	f497be98d1	chore: update schema manifest after slot centralization All checks were successful Deploy Frontend / build-and-deploy (push) Successful in 3m54s Details	2026-01-11 23:28:03 +01:00
kempersc	da5660cf4c	chore: update schema manifest timestamp	2026-01-11 23:19:52 +01:00
kempersc	5d3d8530b0	chore: trigger DSPy eval workflow All checks were successful Deploy Frontend / build-and-deploy (push) Successful in 4m13s Details	2026-01-11 22:40:23 +01:00
kempersc	56c373bba8	Implement fast WCMS migration script with state file checkpointing and batch processing	2026-01-11 22:26:37 +01:00
kempersc	174a420c08	refactor(schema): centralize 1515 inline slot definitions per Rule 48 All checks were successful Deploy Frontend / build-and-deploy (push) Successful in 3m57s Details - Remove inline slot definitions from 144 class files - Create 7 new centralized slot files in modules/slots/: - custodian_type_broader.yaml - custodian_type_narrower.yaml - custodian_type_related.yaml - definition.yaml - finding_aid_access_restriction.yaml - finding_aid_description.yaml - finding_aid_temporal_coverage.yaml - Add centralize_inline_slots.py automation script - Update manifest with new timestamp Rule 48: Class files must NOT define inline slots - all slots must be imported from modules/slots/ directory. Note: Pre-existing IdentifierFormat duplicate class definition (in Standard.yaml and IdentifierFormat.yaml) not addressed in this commit - requires separate schema refactor.	2026-01-11 22:02:14 +01:00
kempersc	3e6c2367ad	feat(linkml-viewer): UX improvements - entry counts, deep links, settings persistence All checks were successful Deploy Frontend / build-and-deploy (push) Successful in 4m4s Details - Add entry count badge next to schema file name showing (xC, yE, zS) counts - Add tooltip explaining LinkML file names vs class names - Remove redundant section headers (Classes, Enums, Slots collapsible sections) - Add URL params for enum (?enum=) and slot (?slot=) deep linking - Persist category filters, dev tools visibility, and legend visibility to localStorage - Set 'Main Schema' filter to OFF by default (confusing for users) - Add Rule 48: Class files must not define inline slots	2026-01-11 21:42:35 +01:00
kempersc	eff3153f3f	feat(schema): add Environmental Zone Type slot definitions All checks were successful Deploy Frontend / build-and-deploy (push) Successful in 3m56s Details Add 4 slot files for EnvironmentalZoneType class: - environmental_zone_type_id: URI identifier slot - environmental_zone_type_code: code slot for zone type codes - environmental_zone_type_label: human-readable label - environmental_zone_type_description: detailed description Update manifest.json with new slot count (2084 slots total)	2026-01-11 21:22:44 +01:00
kempersc	95d79d0078	fix: update manifest with new generated timestamp and file counts; add EnvironmentalZoneType classes and new slot requirements All checks were successful Deploy Frontend / build-and-deploy (push) Successful in 4m51s Details	2026-01-11 21:15:49 +01:00
kempersc	10bb5b69c5	Add Environmental Zone Type Enumeration and related slots - Introduced EnvironmentalZoneTypeEnum.yaml to classify climate-controlled storage zones with detailed descriptions and recommended conditions for various materials. - Created slots for environmental zone type code, description, ID, label, and HC preset URI to facilitate structured data representation. - Implemented boolean slots for specific environmental requirements including dark storage, dust-free environment, ESD protection, and UV filtering, referencing relevant ISO standards. - Enhanced documentation for each slot to clarify usage and preservation context.	2026-01-11 21:14:59 +01:00
kempersc	66ab2908d0	fix: remove deprecated AnnotationMotivationEnum, add European surname data Some checks failed Deploy Frontend / build-and-deploy (push) Failing after 3m21s Details - Move deprecated AnnotationMotivationEnum to archive-deprecated/ (outside served paths) - Add French, Italian, Polish, Spanish surname datasets for entity resolution - Update name_commonality.py with expanded European surname detection - Triggers GitOps workflow to test Forgejo Actions runner	2026-01-11 16:03:18 +01:00
kempersc	fd792fce2c	Refactor code structure for improved readability and maintainability Some checks failed Deploy Frontend / build-and-deploy (push) Has been cancelled Details	2026-01-11 15:27:14 +01:00
kempersc	0f7fbf1ca0	feat(ci): add Forgejo Actions workflow for auto-deploy on LinkML schema changes Some checks are pending Deploy Frontend / build-and-deploy (push) Waiting to run Details Infrastructure changes to enable automatic frontend deployment when schemas change: - Add .forgejo/workflows/deploy-frontend.yml workflow triggered by: - Changes to frontend/ or schemas/20251121/linkml/ - Manual workflow dispatch - Rewrite generate-schema-manifest.cjs to properly scan all schema directories - Recursively scans classes, enums, slots, modules directories - Uses singular category names (class, enum, slot) matching TypeScript types - Includes all 4 main schemas at root level - Skips archive directories and backup files - Update schema-loader.ts to match new manifest format - Add SchemaCategory interface - Update SchemaManifest to use categories as array - Add flattenCategories() helper function - Add getSchemaCategories() and getSchemaCategoriesSync() functions The workflow builds frontend with updated manifest and deploys to bronhouder.nl	2026-01-11 14:16:57 +01:00
kempersc	329b341bb1	refactor(schema): sync AnnotationMotivationType changes to frontend public schemas - Update VideoAnnotation class with new motivation type references - Add AnnotationMotivationType and AnnotationMotivationTypes class files - Add motivation_type slots (description, id, name) - Archive deprecated AnnotationMotivationEnum - Update slot references for derived_from_entity, has_observation, has_person_observation	2026-01-11 14:16:39 +01:00
kempersc	9726cc7917	feat(frontend): Add AnnotationMotivationType to LinkML schema manifest Add new AnnotationMotivationType and AnnotationMotivationTypes to the SCHEMA_FILES array so they appear in the /linkml viewer.	2026-01-11 13:56:11 +01:00
kempersc	0a888ec682	chore: add node_modules to .gitignore and remove from tracking - Add node_modules/ and .pnpm-store/ to .gitignore - Remove 76k node_modules files from git tracking - Update frontend manifest	2026-01-11 00:41:21 +01:00
kempersc	a4184cb805	feat(infra): add webhook-based schema deployment pipeline - Add FastAPI webhook receiver for Forgejo push events - Add setup script for server deployment - Add Caddy snippet for webhook endpoint - Add local sync-schemas.sh helper script - Sync frontend schemas with source (archived deprecated slots) Infrastructure scripts staged for optional webhook deployment. Current deployment uses: ./infrastructure/deploy.sh --frontend	2026-01-10 21:45:02 +01:00
kempersc	6c19ef8661	feat(rag): add Rule 46 epistemic provenance tracking Track full lineage of RAG responses: WHERE data comes from, WHEN it was retrieved, HOW it was processed (SPARQL/vector/LLM). Backend changes: - Add provenance.py with EpistemicProvenance, DataTier, SourceAttribution - Integrate provenance into MultiSourceRetriever.merge_results() - Return epistemic_provenance in DSPyQueryResponse Frontend changes: - Pass EpistemicProvenance through useMultiDatabaseRAG hook - Display provenance in ConversationPage (for cache transparency) Schema fixes: - Fix truncated example in has_observation.yaml slot definition References: - Pavlyshyn's Context Graphs and Data Traces paper - LinkML ProvenanceBlock schema pattern	2026-01-10 18:42:43 +01:00
kempersc	28c3aaf33f	enrich profiles	2026-01-10 17:31:02 +01:00
kempersc	13938c92ca	chore(schemas): sync LinkML schemas to frontend apps Copies authoritative schemas from schemas/20251121/ to: - frontend/public/schemas/20251121/ - apps/archief-assistent/public/schemas/20251121/ This ensures slot definitions with corrected ontology property references (commit `2808dad6cd`) are available to frontend apps.	2026-01-10 15:02:25 +01:00
kempersc	8a475d5c02	refactor(linkml): apply RiC-O slot naming conventions (Rule 39) Rename slots to follow Records in Contexts (RiC-O) style naming: - Add 'has_' prefix for possession predicates (has_acquisition_method) - Add 'is_or_was_' prefix for temporal relationships - Add 'has_or_had_' for bidirectional temporal relations Key changes across 496 schema files: - acquisition_method → has_acquisition_method - acquisition_date → has_acquisition_date - acquisition_source → has_acquisition_source - access_policy_ref → has_access_policy_reference - arrangement → has_arrangement - parent_custodian → is_or_was_suborganization_of (hierarchy) - parent_custodian → associated_custodian (event association) Also adds new slots following RiC-O patterns: - is_or_was_aggregated_by - is_or_was_allocated_by - is_or_was_archive_department_of - was_approved_by, was_archived_at, was_asserted_by This aligns with AGENTS.md Rule 39: Slot Naming Convention (RiC-O Style) for accurate temporal semantics in heritage custodian ontology. Net change: +2,063 lines (new slots added, old patterns consolidated)	2026-01-10 10:33:51 +01:00
kempersc	004d342935	chore: minor updates and evaluation results - auth.setup.ts: require env vars for test credentials (no hardcoded defaults) - manifest.json: update schema manifest - full_evaluation_results.json: add RAG evaluation results - petra-links.json: update birth date from web claim	2026-01-09 21:10:55 +01:00
kempersc	f7bd3e9edc	feat(linkml-viewer): add slot_usage side-by-side comparison view - Add 'Compare' toggle button next to slots with slot_usage overrides - Show generic slot definition vs class-specific override in 3-column grid - Highlight changed properties with green 'changed' badge - Display '(inherited)' when override matches generic definition - Display '(not defined)' when generic has no value for property - Compare: range, description, required, multivalued, slot_uri, pattern, identifier - Full i18n support (Dutch/English translations) - Responsive design: stacks vertically on mobile (<640px)	2026-01-09 21:02:14 +01:00
kempersc	9e67d0f967	enrich profiles	2026-01-09 20:35:19 +01:00
kempersc	932ec5438c	add person profiles with PPID	2026-01-09 18:26:58 +01:00
kempersc	1ad717767a	feat(linkml-viewer): add visual indicators for slot_usage overrides - Add green 'slot_usage' badge for slots with class-specific overrides - Add ✦ markers next to properties that are overridden vs inherited - Add green left border styling for slots with slot_usage - Add i18n translations (nl/en) for override indicators - Merge generic slot definitions with class-specific slot_usage properties This helps users understand which slot properties come from the generic slot definition vs which are overridden at the class level via slot_usage.	2026-01-09 18:23:21 +01:00
kempersc	35a057981c	chore(frontend): sync schema files with custodian_type → has_or_had_custodian_type refactor - Remove deprecated slots: custodian_type.yaml, custodian_types.yaml, custodian_type_broader/narrower/related.yaml, custodian_types_primary/rationale.yaml - Add new unified slot: has_or_had_custodian_type.yaml - Sync all 236+ class files with updated slot references - Update manifest.json	2026-01-09 12:15:32 +01:00
kempersc	c88fd3af70	Refactor code structure for improved readability and maintainability	2026-01-09 11:05:26 +01:00
kempersc	0393b321c9	refactor(schema): unify custodian_type slots into has_or_had_custodian_type (Rule 39, 43) - Migrate 236+ class files from custodian_types to has_or_had_custodian_type - Archive deprecated slots: custodian_type, custodian_types, custodian_type_broader/narrower/related - Update main schema and manifest imports - Fix Custodian.yaml class to use new slot - Fix annotation format (list→scalar) in has_or_had_custodian_type.yaml Rules applied: - Rule 39: RiC-O naming convention (hasOrHad pattern) - Rule 43: Slot nouns must be singular (multivalued:true for cardinality) - Rule 38: Slot centralization with semantic URI	2026-01-09 10:55:21 +01:00
kempersc	6608a207d4	update frontend	2026-01-08 15:56:28 +01:00
kempersc	0b0ea75070	feat(rag): add factual query fast path - skip LLM for count/list queries - Add ontology cache warming at startup in lifespan() function - Add is_factual_query() detection in template_sparql.py (12 templates) - Add factual_result and sparql_query fields to DSPyQueryResponse - Skip LLM generation for factual templates (count, list, compare) - Execute SPARQL directly and return results as table (~15s → ~2s latency) - Update ConversationPanel.tsx to render factual results table - Add CSS styling for factual results with green theme For queries like 'hoeveel archieven zijn er in Den Haag', the SPARQL results ARE the answer - no need for expensive LLM prose generation.	2026-01-08 13:34:23 +01:00
kempersc	9b769f1ca2	Update manifest timestamp and minor class fixes	2026-01-07 22:04:29 +01:00
kempersc	81da4ede50	Add comprehensive slot visualization to LinkML viewer - Add standalone Slots section in visual view alongside Classes and Enums - Display slot_uri, range, identifier badge, description, pattern - Show examples with value/description pairs - Color-coded SKOS mapping tags (exact/close/narrow/broad/related) - Yellow highlighted comments section - Custodian type filtering works with slots - Shared renderSlotDetails() function for consistency	2026-01-07 22:03:08 +01:00
kempersc	d19822f958	Remove redundant sections from class descriptions - Created cleanup_class_descriptions_v2.py script using text-based regex - Removed 134 class files' redundant sections: - dual_class_pattern: 80 occurrences - ontological_alignment: 35 occurrences - ontology_alignment_upper: 33 occurrences - multilingual_labels: 26 occurrences - glamorcubes_category: 6 occurrences - example_structure: 6 occurrences - Fixed ArchiveOrganizationType.yaml parse error after cleanup - Added 49 new slot definition files - All 395 class files validate as correct YAML - Deployed to bronhouder.nl/linkml	2026-01-07 13:50:14 +01:00
kempersc	dfa667c90f	Fix LinkML schema for valid RDF generation with proper slot_uri Summary: - Create 46 missing slot definition files with proper slot_uri values - Add slot imports to main schema (01_custodian_name_modular.yaml) - Fix YAML examples sections in 116+ class and slot files - Fix PersonObservation.yaml examples section (nested objects → string literals) Technical changes: - All slots now have explicit slot_uri mapping to base ontologies (RiC-O, Schema.org, SKOS) - Eliminates malformed URIs like 'custodian/:slot_name' in generated RDF - gen-owl now produces valid Turtle with 153,166 triples New slot files (46): - RiC-O slots: rico_note, rico_organizational_principle, rico_has_or_had_holder, etc. - Scope slots: scope_includes, scope_excludes, archive_scope - Organization slots: organization_type, governance_authority, area_served - Platform slots: platform_type_category, portal_type_category - Social media slots: social_media_platform_category, post_type_* - Type hierarchy slots: broader_type, narrower_types, custodian_type_broader - Wikidata slots: wikidata_equivalent, wikidata_mapping Generated output: - schemas/20251121/rdf/01_custodian_name_modular_20260107_134534_clean.owl.ttl (6.9MB) - Validated with rdflib: 153,166 triples, no malformed URIs	2026-01-07 13:48:03 +01:00
kempersc	98c42bf272	Fix LinkML URI conflicts and generate RDF outputs - Fix scope_note → finding_aid_scope_note in FindingAid.yaml - Remove duplicate wikidata_entity slot from CustodianType.yaml (import instead) - Remove duplicate rico_record_set_type from class_metadata_slots.yaml - Fix range types for equals_string compatibility (uriorcurie → string) - Move class names from close_mappings to see_also in 10 RecordSetTypes files - Generate all RDF formats: OWL, N-Triples, RDF/XML, N3, JSON-LD context - Sync schemas to frontend/public/schemas/ Files: 1,151 changed (includes prior CustodianType migration)	2026-01-07 12:32:59 +01:00
kempersc	6c6810fa43	Replace CustodianTypeCodeEnum with CustodianType class references - Remove deprecated CustodianTypeCodeEnum from class_metadata_slots.yaml - Update custodian_types slot to use uriorcurie range (references CustodianType subclasses) - Update custodian_types_primary slot similarly - Add migration note for legacy string format ['A'] vs new URI format Per Rule 9: Enum-to-Class Promotion - Single Source of Truth	2026-01-06 12:37:40 +01:00
kempersc	b34992b1d3	Migrate all 293 class files to ontology-aligned slots Extends migration to all class types (museums, libraries, galleries, etc.) New slots added to class_metadata_slots.yaml: - RiC-O: rico_record_set_type, rico_organizational_principle, rico_has_or_had_holder, rico_note - Multilingual: label_de, label_es, label_fr, label_nl, label_it, label_pt - Scope: scope_includes, scope_excludes, custodian_only, organizational_level, geographic_restriction - Notes: privacy_note, preservation_note, legal_note Migration script now handles 30+ annotation types. All migrated schemas pass linkml-validate. Total: 387 class files now use proper slots instead of annotations.	2026-01-06 12:24:54 +01:00
kempersc	aa763dab25	Migrate 94 archive class annotations to ontology-aligned slots - Add migration script: scripts/migrate_annotations_to_slots.py - Convert custodian_types, wikidata, skos_broader, specificity_* annotations - Replace with proper slots mapped to SKOS, PROV-O, RiC-O predicates - Add ../slots/class_metadata_slots import to all migrated files - Remove AcademicArchive_refactored.yaml (main file now migrated) - Sync changes to frontend/public/schemas/ Migration converts: - custodian_types → hc:custodianTypes slot - wikidata/wikidata_label → wikidata_alignment structured slot - skos_broader → skos:broader slot - specificity_* → specificity_annotation structured slot - dual_class_pattern → dual_class_link structured slot - template_specificity → template_specificity slot All 94 migrated schemas pass linkml-validate.	2026-01-06 11:25:37 +01:00
kempersc	f37f5208ca	Copy class metadata slots to frontend public folder for deployment	2026-01-06 11:17:12 +01:00
kempersc	11983014bb	Enhance specificity scoring system integration with existing infrastructure - Updated documentation to clarify integration points with existing components in the RAG pipeline and DSPy framework. - Added detailed mapping of SPARQL templates to context templates for improved specificity filtering. - Implemented wrapper patterns around existing classifiers to extend functionality without duplication. - Introduced new tests for the SpecificityAwareClassifier and SPARQLToContextMapper to ensure proper integration and functionality. - Enhanced the CustodianRDFConverter to include ISO country and subregion codes from GHCID for better geospatial data handling.	2026-01-05 17:37:49 +01:00
kempersc	41d8905661	Fix Turtle parser multi-line string handling for PiCo ontology - Fixed bug where closing triple-quotes (""") would incorrectly re-trigger multi-line string detection, causing subsequent class definitions to be skipped - Added lineToProcess variable to track which portion of line to process after closing a multi-line string, preventing re-detection of opening quotes - Moved UML large diagram confirmation logic from OntologyViewerPage to UMLVisualization component for better encapsulation - PiCo ontology now correctly shows all 8 classes instead of 2 Deployed and verified on https://bronhouder.nl/ontology?ontology=PiCo	2026-01-05 11:25:43 +01:00
kempersc	242bc8bb35	Add new slots for heritage custodian entities - Created deliverables_slot for expected or achieved deliverable outputs. - Introduced event_id_slot for persistent unique event identifiers. - Added follow_up_date_slot for scheduled follow-up action dates. - Implemented object_ref_slot for references to heritage objects. - Established price_slot for price information across entities. - Added price_currency_slot for currency codes in price information. - Created protocol_slot for API protocol specifications. - Introduced provenance_text_slot for full provenance entry text. - Added record_type_slot for classification of record types. - Implemented response_formats_slot for supported API response formats. - Established status_slot for current status of entities or activities. - Added FactualCountDisplay component for displaying count query results. - Introduced ReplyTypeIndicator component for visualizing reply types. - Created approval_date_slot for formal approval dates. - Added authentication_required_slot for API authentication status. - Implemented capacity_items_slot for maximum storage capacity. - Established conservation_lab_slot for conservation laboratory information. - Added cost_usd_slot for API operation costs in USD.	2026-01-05 00:49:05 +01:00
kempersc	89001fbc53	compact header controls on OntologyViewer and QueryBuilder pages	2026-01-04 17:29:34 +01:00
kempersc	eb61f45de2	compact UML controls toolbar to fit single line when sidebar collapsed	2026-01-04 17:21:53 +01:00
kempersc	2dca28d8c1	enrich CH entries with mission statements	2026-01-04 13:12:32 +01:00
kempersc	4f0cafe98a	enrich HC profiles	2026-01-02 02:11:04 +01:00
kempersc	349f31ae6f	enrich custodian profiles	2026-01-02 02:10:18 +01:00
kempersc	45e873ec0a	enrich JP BE AR profiles	2025-12-30 23:07:03 +01:00
kempersc	d64f857aa9	add sparql validator and RAG injector	2025-12-30 03:43:31 +01:00
kempersc	ca219340f2	enrich entries	2025-12-26 14:30:31 +01:00
kempersc	54869589d1	fix(linkml-viewer): 3D cube visualization bugs - prevent click-to-filter and parse JSON custodian_types - Remove onFaceClick prop from CustodianTypeIndicator3D in class/slot/enum detail views to prevent accidental filtering when clicking decorative cubes (Bug 3) - Add parseCustodianTypesAnnotation() helper to handle JSON-stringified arrays like '["A"]' in YAML annotations, fixing Bug 2 where all 19 letters appeared on every cube - Legend bar retains onTypeClick for intentional filtering functionality	2025-12-23 20:40:32 +01:00
kempersc	5e8a432ef0	enrich japanese and dutch custodians	2025-12-23 18:08:45 +01:00
kempersc	a1fb6344e7	enriching custodian data	2025-12-23 17:26:29 +01:00
kempersc	0c1d19e98b	enrich entries	2025-12-23 13:27:35 +01:00
kempersc	aca68ea47f	remove a,bihguous web-claims	2025-12-21 00:01:54 +01:00
kempersc	23b1d8ee5f	clean up GHCID	2025-12-17 11:58:40 +01:00
kempersc	99430c2a70	add new entries and semantic routing	2025-12-17 10:11:56 +01:00
kempersc	5fe692296d	Fix RDF visualization: correct SPARQL namespaces and show all node types - Update SPARQL CONSTRUCT query to use correct ontology namespaces: - hc: https://w3id.org/heritage/custodian/ (was nde) - Use nde:Custodian type (was crm:E39_Actor) - Use schema:location + geo:lat/long (was crm predicates) - Remove LIMIT 500 clause to fetch all results - Show all node types by default instead of random single type - Fixes issue where Knowledge Graph showed incomplete/random data	2025-12-17 08:55:26 +01:00
kempersc	e0dd847491	extend ontology	2025-12-16 20:27:39 +01:00
kempersc	b0416efc7d	enrich custodians and persons	2025-12-16 11:57:34 +01:00
kempersc	52ae711c56	add timespans	2025-12-16 09:02:52 +01:00
kempersc	b1340e30c8	add timespan	2025-12-15 22:35:35 +01:00
kempersc	cb56aa7e40	enrich all custodian timespan	2025-12-15 22:31:41 +01:00
kempersc	82aa655522	feat(conversation): Add resizable embedding projector panel with improved UX - Larger default size (700x550) for better readability - Resizable from all 8 edges/corners with visual SE grip indicator - Clearer button icons (18px, strokeWidth 2.5) - Draggable, minimizable, pinnable panel - Dark theme and mobile responsive support	2025-12-15 17:45:27 +01:00
kempersc	31bbce13e6	fix(types): Make genealogiewerkbalk nested fields optional Fixes TypeScript error where parseGenealogiewerkbalk returns optional fields but Institution interface expected required fields.	2025-12-15 09:04:09 +01:00
kempersc	0a38225b36	feat(frontend): Add multi-select filters, URL params, and UI improvements - Institution Browser: multi-select for types and countries - URL query param sync for shareable filter URLs - New utility: countryNames.ts with flag emoji support - New utility: imageProxy.ts for image URL handling - New component: SearchableMultiSelect dropdown - Career timeline CSS and component updates - Media gallery improvements - Lazy load error boundary component - Version check utility	2025-12-15 01:47:11 +01:00
kempersc	22709cc13e	feat(rag): Add per-message refresh, bypass cache toggle, and cache clear improvements - Add refresh button to assistant messages for re-running queries with fresh results - Highlight refresh button (amber) for cached responses to draw attention - Add spinning icon animation while refreshing - Fix cache clear to return detailed success/failure status for local vs shared cache - Add bypass cache toggle that forces fresh queries (one-shot, resets after query) - Add Dutch/English translations for new UI elements	2025-12-14 19:12:25 +01:00
kempersc	1d26cade66	correct person labels	2025-12-14 17:58:55 +01:00
kempersc	c6aee998db	correct person labels	2025-12-14 17:29:39 +01:00
kempersc	c50c35fd3a	enrich person custodian	2025-12-14 17:09:55 +01:00
kempersc	41aace785f	feat: Add SyncPanel component for database synchronization - Add SyncPanel component with bilingual (NL/EN) support - Add relative URL handling for production (bronhouder.nl) - Integrate SyncPanel into Database page - Show sync status for all 4 databases (DuckLake, PostgreSQL, Oxigraph, Qdrant) - Support dry-run mode and file limit options	2025-12-12 23:42:22 +01:00
kempersc	505c12601a	Add test script for PiCo extraction from Arabic waqf documents - Implemented a new script `test_pico_arabic_waqf.py` to test the GLM annotator's ability to extract person observations from Arabic historical documents. - The script includes environment variable handling for API token, structured prompts for the GLM API, and validation of extraction results. - Added comprehensive logging for API responses, extraction results, and validation errors. - Included a sample Arabic waqf text for testing purposes, following the PiCo ontology pattern.	2025-12-12 17:50:17 +01:00
kempersc	b1f93b6f22	enrich person profiles	2025-12-12 12:51:10 +01:00
kempersc	03263f67d6	moved web archives	2025-12-12 00:40:26 +01:00
kempersc	1b1cfbfca0	enrich custodians	2025-12-11 22:32:09 +01:00
kempersc	d4906abae4	update postgis data	2025-12-10 23:51:51 +01:00
kempersc	be3fbac601	enrich entries and persons	2025-12-10 18:04:25 +01:00
kempersc	41959f0766	correct HCID!	2025-12-10 13:01:13 +01:00
kempersc	3a6ead8fde	feat: Add legal form filtering rule for CustodianName - Introduced LEGAL-FORM-FILTER rule to standardize CustodianName by removing legal form designations. - Documented rationale, examples, and implementation guidelines for the filtering process. docs: Create README for value standardization rules - Established a comprehensive README outlining various value standardization rules applicable to Heritage Custodian classes. - Categorized rules into Name Standardization, Geographic Standardization, Web Observation, and Schema Evolution. feat: Implement transliteration standards for non-Latin scripts - Added TRANSLIT-ISO rule to ensure GHCID abbreviations are generated from emic names using ISO standards for transliteration. - Included detailed guidelines for various scripts and languages, along with implementation examples. feat: Define XPath provenance rules for web observations - Created XPATH-PROVENANCE rule mandating XPath pointers for claims extracted from web sources. - Established a workflow for archiving websites and verifying claims against archived HTML. chore: Update records lifecycle diagram - Generated a new Mermaid diagram illustrating the records lifecycle for heritage custodians. - Included phases for active records, inactive archives, and processed heritage collections with key relationships and classifications.	2025-12-09 16:58:41 +01:00
kempersc	a7321b1bb9	reconstruct location blocks	2025-12-09 12:25:16 +01:00
kempersc	cab712659d	recover location blocks	2025-12-09 11:34:56 +01:00
kempersc	62fdd35321	Refactor code structure for improved readability and maintainability	2025-12-09 11:15:51 +01:00
kempersc	131e3ca259	normalise custodian entries	2025-12-09 07:56:35 +01:00
kempersc	13f67bed19	feat(frontend): add graph visualization and data explorer features Database Panels: - Add D3.js force-directed graph visualization to Oxigraph and TypeDB panels - Add 'Explore' tab with class/entity browser, graph/table toggle, and search - Add data explorer to PostgreSQL panel with table browser, pagination, search, export - Fix SPARQL variable naming bug in Oxigraph getGraphData() function - Add node details panel showing selected entity attributes - Add zoom/pan controls and node coloring by entity type Map Features: - Add TimelineSlider component for temporal filtering of institutions - Support dual-handle range slider with decade histogram - Add quick presets (Ancient, Medieval, Modern, Contemporary) - Show institution density visualization by founding decade Hooks: - Extend useOxigraph with getGraphData() for graph visualization - Extend useTypeDB with getGraphData() for graph visualization - Extend usePostgreSQL with getTableData() and exportTableData() - Improve useDuckLakeInstitutions with temporal filtering support Styles: - Add HeritageDashboard.css with shared panel styling - Add TimelineSlider.css for timeline component styling	2025-12-08 14:56:17 +01:00
kempersc	7e3559f7e5	add new entries	2025-12-07 23:08:02 +01:00
kempersc	57c743b005	refactor(frontend): fetch NL municipalities from PostGIS API instead of static file Replace static netherlands_municipalities_simplified.geojson with dynamic PostGIS API call to /boundaries/countries/NL/admin2/geojson. Transform API response properties to expected format: - API: {code, name, name_local, admin1_code, admin1_name} - Expected: {code, naam, provincieCode, provincieNaam} This ensures NL boundary data comes from the authoritative PostGIS database rather than a static file that could become outdated.	2025-12-07 19:48:07 +01:00
kempersc	12965071be	fix(frontend): improve DuckLake connection detection in map page Wait for DuckLake loading to complete before deciding whether to use DuckLake data or fallback to static JSON. Prevents race conditions.	2025-12-07 19:23:12 +01:00
kempersc	1981dc28ed	fix(frontend): normalize org_type names to letter codes in DuckLake hook DuckLake stores full names like 'MUSEUM' but map expects single-letter codes like 'M' for color styling. Also includes CSS fixes for Database page.	2025-12-07 19:21:58 +01:00
kempersc	810022d524	feat(frontend): add search filter and claim page filter to DuckLakePanel - Add search bar to filter table data across all columns - Filter web archive claims by selected page - Include source_page in claim queries for filtering - Fix TypeScript unused parameter warning	2025-12-07 19:20:40 +01:00
kempersc	f82dd57903	feat(frontend): add useBoundariesAPI hook for PostGIS boundary fetching New React hook that fetches administrative boundaries from the PostGIS API: - Supports international boundaries (NL, JP, CZ, DE, BE, CH, AT, etc.) - Caches admin1, admin2, and GeoJSON data - Provides point-in-polygon lookup - Includes utility functions for filtering boundaries by code/name - Replaces static GeoJSON file loading pattern	2025-12-07 19:20:21 +01:00
kempersc	d9325c0bb5	feat: add web archives integration and improve enrichment scripts Backend: - Attach web_archives.duckdb as read-only database in DuckLake - Create views for web_archives, web_pages, web_claims in heritage schema Scripts: - enrich_cities_google.py: Add batch processing and retry logic - migrate_web_archives.py: Improve schema handling and error recovery Frontend: - DuckLakePanel: Add web archives query support - Database.css: Improve layout for query results display	2025-12-07 17:49:07 +01:00
kempersc	0b06af0fb6	chore: mark unused function and ignore ducklake databases	2025-12-07 14:28:12 +01:00
kempersc	9d15cce65c	docs: add enrichment reports and update manifest Add enrichment reports from city resolution: - Austrian, Belgian, Bulgarian, Czech, Swiss ISIL enrichment reports - GeoNames update reports - Custodian creation reports - Entry-to-GHCID mapping file	2025-12-07 14:27:36 +01:00
kempersc	4825f57951	feat(frontend): improve werkgebied display and database UI - Fix polygon rendering with static paint properties instead of data-driven - Add ensureSourceAndLayers() helper for reliable layer management - Use setPaintProperty() for historical vs modern styling distinction - Improve Database page layout with back buttons and cleaner navigation - Add ResizableNestedTable component for DuckLake data display - Optimize spacing and layout in Database.css - Update schema manifest	2025-12-07 14:26:37 +01:00
kempersc	ee4e57bc75	add new entries	2025-12-07 00:26:01 +01:00
kempersc	1635625032	added web annotations	2025-12-06 19:50:04 +01:00
kempersc	55e2cd2340	feat: implement LLM-based extraction for Archives Lab content - Introduced `llm_extract_archiveslab.py` script for entity and relationship extraction using LLMAnnotator with GLAM-NER v1.7.0. - Replaced regex-based extraction with generative LLM inference. - Added functions for loading markdown content, converting annotation sessions to dictionaries, and generating extraction statistics. - Implemented comprehensive logging of extraction results, including counts of entities, relationships, and specific types like heritage institutions and persons. - Results and statistics are saved in JSON format for further analysis.	2025-12-05 23:16:21 +01:00
kempersc	3a242370fc	annotation standards added	2025-12-05 15:30:23 +01:00
kempersc	d661947830	update enriched entries	2025-12-03 17:38:46 +01:00
kempersc	ef89b1213a	validate enrichments	2025-12-02 14:36:01 +01:00
kempersc	4b833d20b2	add pids	2025-12-01 23:55:55 +01:00
kempersc	7dce283c17	Add new enums for PersonalCollectionType, ResearchCenterType, and TasteScentHeritage classifications; implement validation script for custodian names against authoritative sources	2025-12-01 18:39:22 +01:00
kempersc	48a2b26f59	feat: Add script to generate Mermaid ER diagrams with instance data from LinkML schemas - Implemented `generate_mermaid_with_instances.py` to create ER diagrams that include all classes, relationships, enum values, and instance data. - Loaded instance data from YAML files and enriched enum definitions with meaningful annotations. - Configured output paths for generated diagrams in both frontend and schema directories. - Added support for excluding technical classes and limiting the number of displayed enum and instance values for readability.	2025-12-01 16:58:03 +01:00
kempersc	097d116b72	enrich entries	2025-12-01 16:06:34 +01:00
kempersc	2497e5913f	enrich entries	2025-12-01 00:37:24 +01:00
kempersc	f3c149b1bb	update entries	2025-11-30 23:30:29 +01:00
kempersc	ff92698c7a	Implement feature X to enhance user experience and fix bug Y in module Z	2025-11-30 23:25:05 +01:00
kempersc	d623f0af4a	store archived websites	2025-11-29 20:40:46 +01:00
kempersc	0ab8f24a6b	archive websites	2025-11-29 18:05:16 +01:00
kempersc	da1eae6597	Refactor code structure for improved readability and maintainability	2025-11-29 12:27:39 +01:00
kempersc	30162e6526	Add script to validate KB library entries and generate enrichment report - Implemented a Python script to validate KB library YAML files for required fields and data quality. - Analyzed enrichment coverage from Wikidata and Google Maps, generating statistics. - Created a comprehensive markdown report summarizing validation results and enrichment quality. - Included error handling for file loading and validation processes. - Generated JSON statistics for further analysis.	2025-11-28 14:48:33 +01:00
kempersc	5cdce584b2	Add complete schema for heritage custodian observation reconstruction - Introduced a comprehensive class diagram for the heritage custodian observation reconstruction schema. - Defined multiple classes including AllocationAgency, ArchiveOrganizationType, AuxiliaryDigitalPlatform, and others, with relevant attributes and relationships. - Established inheritance and associations among classes to represent complex relationships within the schema. - Generated on 2025-11-28, version 0.9.0, excluding the Container class.	2025-11-28 13:13:23 +01:00
kempersc	0d1741c55e	Refactor code structure for improved readability and maintainability	2025-11-28 11:44:21 +01:00
kempersc	37886f0433	Refactor code structure for improved readability and maintainability	2025-11-27 17:43:14 +01:00
kempersc	5ef8ccac51	Add script to enrich NDE Register NL entries with Wikidata data - Implemented a Python script that fetches and enriches entries from the NDE Register using data from Wikidata. - Utilized the Wikibase REST API and SPARQL endpoints for data retrieval. - Added logging for tracking progress and errors during the enrichment process. - Configured rate limiting based on authentication status for API requests. - Created a structured output in YAML format, including detailed enrichment data. - Generated a log file summarizing the enrichment process and results.	2025-11-27 13:30:00 +01:00
kempersc	cd0ff5b9c7	wrap up voorbeeld lijst	2025-11-27 10:58:53 +01:00
kempersc	a6cbce1749	feat: Implement intersection calculation for UML diagram node links	2025-11-27 10:58:45 +01:00
kempersc	e99b1e644e	feat: Add platform_description slot for detailed auxiliary platform information	2025-11-26 10:18:16 +01:00
kempersc	a5a66eb547	add classes	2025-11-25 12:48:07 +01:00
kempersc	3ff0e33bf9	Add UML diagrams and scripts for custodian schema - Created PlantUML diagrams for custodian types, full schema, legal status, and organizational structure. - Implemented a script to generate GraphViz DOT diagrams from OWL/RDF ontology files. - Developed a script to generate UML diagrams from modular LinkML schema, supporting both Mermaid and PlantUML formats. - Enhanced class definitions and relationships in UML diagrams to reflect the latest schema updates.	2025-11-23 23:05:33 +01:00
kempersc	67657c39b6	feat: Complete Country Class Implementation and Hypernyms Removal - Created the Country class with ISO 3166-1 alpha-2 and alpha-3 codes, ensuring minimal design without additional metadata. - Integrated the Country class into CustodianPlace and LegalForm schemas to support country-specific feature types and legal forms. - Removed duplicate keys in FeatureTypeEnum.yaml, resulting in 294 unique feature types. - Eliminated "Hypernyms:" text from FeatureTypeEnum descriptions, verifying that semantic relationships are now conveyed through ontology mappings. - Created example instance file demonstrating integration of Country with CustodianPlace and LegalForm. - Updated documentation to reflect the completion of the Country class implementation and hypernyms removal.	2025-11-23 13:09:38 +01:00
kempersc	2761857b0d	Add scripts for converting OWL/Turtle ontology to Mermaid and PlantUML diagrams - Implemented `owl_to_mermaid.py` to convert OWL/Turtle files into Mermaid class diagrams. - Implemented `owl_to_plantuml.py` to convert OWL/Turtle files into PlantUML class diagrams. - Added two new PlantUML files for custodian multi-aspect diagrams.	2025-11-22 23:01:13 +01:00
kempersc	8907aa6213	feat: Refactor Heritage Custodian Ontology to Multi-Aspect Model - Implemented three independent aspects for custodians: CustodianLegalStatus, CustodianName, and CustodianPlace. - Renamed CustodianReconstruction to CustodianLegalStatus and updated all references. - Created new components for CustodianPlace and PlaceSpecificityEnum. - Removed direct links from CustodianObservation to Custodian, aligning with PROV-O standards. - Generated comprehensive example instance demonstrating the new architecture. - Updated documentation to reflect changes and provide guidance on multi-aspect modeling. - Added React hook for managing IndexedDB operations, including storing and loading transformation results. - Created complete YAML example for Rijksmuseum, illustrating the integration of all three aspects.	2025-11-22 15:40:17 +01:00
kempersc	94d1054f4a	Refactor code structure for improved readability and maintainability; removed redundant code blocks and optimized function calls.	2025-11-22 15:35:35 +01:00

... 3 4 5 6 7

337 commits