GLAM heritage institution data extraction and management
Find a file
kempersc e5a532a8bc Add comprehensive tests for NLP institution extraction and RDF partnership integration
- Introduced `test_nlp_extractor.py` with unit tests for the InstitutionExtractor, covering various extraction patterns (ISIL, Wikidata, VIAF, city names) and ensuring proper classification of institutions (museum, library, archive).
- Added tests for extracted entities and result handling to validate the extraction process.
- Created `test_partnership_rdf_integration.py` to validate the end-to-end process of extracting partnerships from a conversation and exporting them to RDF format.
- Implemented tests for temporal properties in partnerships and ensured compliance with W3C Organization Ontology patterns.
- Verified that extracted partnerships are correctly linked with PROV-O provenance metadata.
2025-11-19 23:20:47 +01:00
data/instances/brazil Deduplicate Brazilian institutions (212→121) 2025-11-11 22:08:34 +01:00
schemas Add comprehensive tests for NLP institution extraction and RDF partnership integration 2025-11-19 23:20:47 +01:00
scripts Add comprehensive tests for NLP institution extraction and RDF partnership integration 2025-11-19 23:20:47 +01:00
src/glam_extractor Add comprehensive tests for NLP institution extraction and RDF partnership integration 2025-11-19 23:20:47 +01:00
tests Add comprehensive tests for NLP institution extraction and RDF partnership integration 2025-11-19 23:20:47 +01:00
deduplicate_brazilian_institutions.py Deduplicate Brazilian institutions (212→121) 2025-11-11 22:08:34 +01:00