glam/data
kempersc 85d9cee82f fix: mark 8 more Google Maps false matches detected via name mismatch
Additional Type I custodian files with obvious name mismatches between
KIEN registry entries and Google Maps results. These couldn't be
auto-detected via domain mismatch because they lack official websites.

Fixes:
- Dick Timmerman (person) → carpentry business
- Ria Bos (cigar maker) → money transfer agent
- Stichting Kracom (Krampuslauf) → Happy Caps retail
- Fed. Nederlandse Vertelorganisaties → NET Foundation
- Stichting dodenherdenking Alphen → wrong memorial
- Sao Joao Rotterdam → Heemraadsplein (location not org)
- sport en spel (heritage) → equipment rental
- Eiertikken Ommen → restaurant

Also adds detection and fix scripts for Google Maps false matches.
2026-01-08 13:26:53 +01:00
..
custodian fix: mark 8 more Google Maps false matches detected via name mismatch 2026-01-08 13:26:53 +01:00
custodian.backup.20251230 backup html content 2025-12-31 02:36:38 +01:00
custodian_sample
entity_annotation
examples
extracted
google_maps_enrichment
instances
intangible_heritage
isil
json
jsonld
manual_enrichment
museum_register_nl
nde
ontology
raw
rdf
reference
reports
review
test
training
unified
validation
web/lap_gaza_report_2024
wikidata
wikpedia/Destruction_of_cultural_heritage_during_the_Israeli_invasion_of_the_Gaza_Strip
collision_edge_case_analysis.md
deduplication_improvement_summary.md
dutch_collision_report.txt
dutch_collision_stats.json
dutch_deduplication_report.txt
dutch_institutions_with_ghcids.yaml
extraction_checkpoint.json
failed_crawl_urls.txt
failed_crawl_urls_round1_backup.txt
failed_crawl_urls_round3_backup.txt
failed_crawl_urls_round4.txt
ISIL-codes_2025-08-01.csv
linkedin_locations.json
mexican_geography_analysis.yaml
missing_annotations_checkpoint.json
NDE-logo-RGB-basis-nl-blauw.png
reenrich_queue.json
sparql_templates.yaml Fix LinkML URI conflicts and generate RDF outputs 2026-01-07 12:32:59 +01:00
temp_conv1_artifact2.md
temp_conv2_artifact1.md
temp_mexican_conv1.json
temp_mexican_conv2.json
unenriched_urls_round2.txt
xxx_matches.json