glam/data/instances/archive/brazilian_curation_report.md
2025-11-19 23:25:22 +01:00

2 KiB

Brazilian GLAM Institution Curation Report

Generated: 2025-11-06T08:25:13.589472+00:00

Summary Statistics

  • Original records (v2): 104
  • Filtered out (platforms/non-institutions): 7
  • Valid curated institutions: 97
  • Recall rate: 93.3%

Curation Actions

Records Filtered Out

The following records were removed as they are platforms/technologies, not heritage institutions:

  1. Tainacan - Collection management platform (WordPress-based)
  2. AtoM - Archival description software
  3. DSpace - Digital repository platform
  4. APIs - Generic technology reference
  5. LOCKSS Cariniana - Digital preservation network
  6. Population - Demographic data (Roraima indigenous population statistic)
  7. Documentation - Too generic, not a specific institution

Valid Institutions Retained

97 heritage custodian organizations representing:

  • Museums (MUSEUM, MIXED)
  • Libraries (LIBRARY)
  • Archives (ARCHIVE)
  • Research centers (RESEARCH_CENTER)
  • Educational providers (EDUCATION_PROVIDER)
  • Official institutions (OFFICIAL_INSTITUTION)

Quality Metrics

Completeness (by field)

To be calculated after enrichment:

  • Records with descriptions: TBD
  • Records with identifiers: TBD
  • Records with city names: TBD
  • Records with digital platforms: TBD

Geographic Coverage

All 27 Brazilian states + Federal District represented

Next Steps

  1. Deep enrichment needed: Extract comprehensive metadata from conversation JSON

    • Founding dates and change history
    • Collection descriptions with subjects/extents
    • Digital platform URLs and systems
    • Additional identifiers (Wikidata, VIAF, etc.)
  2. Manual verification: Review Brasiliana Museus and Hemeroteca Digital

    • Classify as national aggregation platforms vs. custodian institutions
  3. Field completion: Achieve targets:

    • 90%+ with descriptions (2-4 sentences)
    • 80%+ with website identifiers
    • 60%+ with city-level location data

Generated by curate_brazilian_institutions.py