glam/data/instances/global/geocoding_validation_report.md
2025-11-19 23:25:22 +01:00

2.3 KiB

Geocoding Validation Report

Generated: validate_geocoding_results.py

Summary

  • Total Institutions: 13,396
  • With Location Data: 13,396 (100.0%)
  • Successfully Geocoded: 13,393 (100.0%)
  • Failed Geocoding: 3 (0.0%)
  • Invalid Coordinates: 0
  • With GeoNames IDs: 1 (0.0% of geocoded)

Coordinate Ranges

  • Latitude: -54.9356 to 53.4412
  • Longitude: -122.4193 to 145.5834

Coverage by Country

Country Total Geocoded Failed Invalid Coverage
JP 12065 12064 1 0 100.0%
NL 1017 1015 2 0 99.8%
MX 109 109 0 0 100.0%
BR 97 97 0 0 100.0%
CL 90 90 0 0 100.0%
BE 7 7 0 0 100.0%
US 7 7 0 0 100.0%
IT 2 2 0 0 100.0%
LU 1 1 0 0 100.0%
AR 1 1 0 0 100.0%

Coverage by Institution Type

Type Total Geocoded Failed Coverage
LIBRARY 7648 7647 1 100.0%
MUSEUM 4721 4721 0 100.0%
MIXED 543 542 1 99.8%
ARCHIVE 305 304 1 99.7%
COLLECTING_SOCIETY 66 66 0 100.0%
EDUCATION_PROVIDER 38 38 0 100.0%
OFFICIAL_INSTITUTION 37 37 0 100.0%
RESEARCH_CENTER 32 32 0 100.0%
BOTANICAL_ZOO 4 4 0 100.0%
UNDEFINED 2 2 0 100.0%

Quality Indicators

Excellent Coverage (≥95%)

No Invalid Coordinates

Low GeoNames Coverage (0.0%)

Recommendations

  1. Enhance GeoNames IDs:
    • GeoNames IDs enable better geographic linking
    • Consider querying GeoNames API directly
    • Use reverse geocoding to find GeoNames IDs

Generated: '2025-11-07T18:30:43.008016+00:00'

Source: data/instances/global/global_heritage_institutions.yaml