glam/data/instances/global/geocoding_validation_report.md
2025-11-19 23:25:22 +01:00

67 lines
2.3 KiB
Markdown

# Geocoding Validation Report
Generated: validate_geocoding_results.py
## Summary
- **Total Institutions**: 13,396
- **With Location Data**: 13,396 (100.0%)
- **Successfully Geocoded**: 13,393 (100.0%)
- **Failed Geocoding**: 3 (0.0%)
- **Invalid Coordinates**: 0
- **With GeoNames IDs**: 1 (0.0% of geocoded)
## Coordinate Ranges
- **Latitude**: -54.9356 to 53.4412
- **Longitude**: -122.4193 to 145.5834
## Coverage by Country
| Country | Total | Geocoded | Failed | Invalid | Coverage |
|---------|-------|----------|--------|---------|----------|
| JP | 12065 | 12064 | 1 | 0 | 100.0% |
| NL | 1017 | 1015 | 2 | 0 | 99.8% |
| MX | 109 | 109 | 0 | 0 | 100.0% |
| BR | 97 | 97 | 0 | 0 | 100.0% |
| CL | 90 | 90 | 0 | 0 | 100.0% |
| BE | 7 | 7 | 0 | 0 | 100.0% |
| US | 7 | 7 | 0 | 0 | 100.0% |
| IT | 2 | 2 | 0 | 0 | 100.0% |
| LU | 1 | 1 | 0 | 0 | 100.0% |
| AR | 1 | 1 | 0 | 0 | 100.0% |
## Coverage by Institution Type
| Type | Total | Geocoded | Failed | Coverage |
|------|-------|----------|--------|---------|
| LIBRARY | 7648 | 7647 | 1 | 100.0% |
| MUSEUM | 4721 | 4721 | 0 | 100.0% |
| MIXED | 543 | 542 | 1 | 99.8% |
| ARCHIVE | 305 | 304 | 1 | 99.7% |
| COLLECTING_SOCIETY | 66 | 66 | 0 | 100.0% |
| EDUCATION_PROVIDER | 38 | 38 | 0 | 100.0% |
| OFFICIAL_INSTITUTION | 37 | 37 | 0 | 100.0% |
| RESEARCH_CENTER | 32 | 32 | 0 | 100.0% |
| BOTANICAL_ZOO | 4 | 4 | 0 | 100.0% |
| UNDEFINED | 2 | 2 | 0 | 100.0% |
## Quality Indicators
**Excellent Coverage** (≥95%)
**No Invalid Coordinates**
**Low GeoNames Coverage** (0.0%)
## Recommendations
3. **Enhance GeoNames IDs**:
- GeoNames IDs enable better geographic linking
- Consider querying GeoNames API directly
- Use reverse geocoding to find GeoNames IDs
---
**Generated**: '2025-11-07T18:30:43.008016+00:00'
**Source**: `data/instances/global/global_heritage_institutions.yaml`