glam/data
kempersc 3a6ead8fde feat: Add legal form filtering rule for CustodianName
- Introduced LEGAL-FORM-FILTER rule to standardize CustodianName by removing legal form designations.
- Documented rationale, examples, and implementation guidelines for the filtering process.

docs: Create README for value standardization rules

- Established a comprehensive README outlining various value standardization rules applicable to Heritage Custodian classes.
- Categorized rules into Name Standardization, Geographic Standardization, Web Observation, and Schema Evolution.

feat: Implement transliteration standards for non-Latin scripts

- Added TRANSLIT-ISO rule to ensure GHCID abbreviations are generated from emic names using ISO standards for transliteration.
- Included detailed guidelines for various scripts and languages, along with implementation examples.

feat: Define XPath provenance rules for web observations

- Created XPATH-PROVENANCE rule mandating XPath pointers for claims extracted from web sources.
- Established a workflow for archiving websites and verifying claims against archived HTML.

chore: Update records lifecycle diagram

- Generated a new Mermaid diagram illustrating the records lifecycle for heritage custodians.
- Included phases for active records, inactive archives, and processed heritage collections with key relationships and classifications.
2025-12-09 16:58:41 +01:00
..
custodian geocode: add coordinates to CZ, BY, CH, FR, ES custodian files from GeoNames (1145 files) 2025-12-09 16:41:41 +01:00
entity_annotation
examples
extracted normalise custodian entries 2025-12-09 07:56:35 +01:00
google_maps_enrichment feat: Add legal form filtering rule for CustodianName 2025-12-09 16:58:41 +01:00
instances
intangible_heritage
isil
json
jsonld
manual_enrichment
museum_register_nl
nde
ontology
raw
rdf
reference
review
test enrich entries 2025-12-09 10:46:43 +01:00
unified
web/lap_gaza_report_2024 normalise custodian entries 2025-12-09 07:56:35 +01:00
wikidata feat: Add legal form filtering rule for CustodianName 2025-12-09 16:58:41 +01:00
wikpedia/Destruction_of_cultural_heritage_during_the_Israeli_invasion_of_the_Gaza_Strip normalise custodian entries 2025-12-09 07:56:35 +01:00
collision_edge_case_analysis.md
deduplication_improvement_summary.md
dutch_collision_report.txt
dutch_collision_stats.json
dutch_deduplication_report.txt
dutch_institutions_with_ghcids.yaml
extraction_checkpoint.json
ISIL-codes_2025-08-01.csv
mexican_geography_analysis.yaml
NDE-logo-RGB-basis-nl-blauw.png
temp_conv1_artifact2.md
temp_conv2_artifact1.md
temp_mexican_conv1.json
temp_mexican_conv2.json