glam/data
kempersc 59963c8d3f Logo enrichment batch: JP+300, CZ-0 - 12,833 files (40.4%)
- JP: 4,496 processed (37.2% of 12,096)  COMPLETE
- CZ: 2,820 processed (33.4% of 8,432) - batch completed, slight decrease
- CH, NL, BE, AT, BR: 100% complete
- Total: 12,833 of 31,772 files (40.4%)
- Using crawl4ai favicon extraction
2025-12-26 13:42:21 +01:00
..
custodian Logo enrichment batch: JP+300, CZ-0 - 12,833 files (40.4%) 2025-12-26 13:42:21 +01:00
custodian_sample
entity_annotation enrich entries 2025-12-23 13:27:35 +01:00
examples
extracted
google_maps_enrichment
instances
intangible_heritage
isil
json
jsonld
manual_enrichment
museum_register_nl
nde enrich entries 2025-12-23 13:27:35 +01:00
ontology
raw
rdf
reference
reports
review
test
unified
web/lap_gaza_report_2024
wikidata
wikpedia/Destruction_of_cultural_heritage_during_the_Israeli_invasion_of_the_Gaza_Strip
collision_edge_case_analysis.md
deduplication_improvement_summary.md
dutch_collision_report.txt
dutch_collision_stats.json
dutch_deduplication_report.txt
dutch_institutions_with_ghcids.yaml
extraction_checkpoint.json
failed_crawl_urls.txt
failed_crawl_urls_round1_backup.txt
failed_crawl_urls_round3_backup.txt
failed_crawl_urls_round4.txt
ISIL-codes_2025-08-01.csv
linkedin_locations.json
mexican_geography_analysis.yaml
missing_annotations_checkpoint.json
NDE-logo-RGB-basis-nl-blauw.png
reenrich_queue.json
temp_conv1_artifact2.md
temp_conv2_artifact1.md
temp_mexican_conv1.json
temp_mexican_conv2.json
unenriched_urls_round2.txt
xxx_matches.json