glam/src/glam_extractor/__init__.py
kempersc e5a532a8bc Add comprehensive tests for NLP institution extraction and RDF partnership integration
- Introduced `test_nlp_extractor.py` with unit tests for the InstitutionExtractor, covering various extraction patterns (ISIL, Wikidata, VIAF, city names) and ensuring proper classification of institutions (museum, library, archive).
- Added tests for extracted entities and result handling to validate the extraction process.
- Created `test_partnership_rdf_integration.py` to validate the end-to-end process of extracting partnerships from a conversation and exporting them to RDF format.
- Implemented tests for temporal properties in partnerships and ensured compliance with W3C Organization Ontology patterns.
- Verified that extracted partnerships are correctly linked with PROV-O provenance metadata.
2025-11-19 23:20:47 +01:00

21 lines
670 B
Python

"""
GLAM Extractor
A comprehensive toolkit for extracting and standardizing global GLAM (Galleries,
Libraries, Archives, Museums) institutional data from conversation transcripts and
authoritative registries.
"""
__version__ = "0.1.0"
__author__ = "GLAM Data Extraction Project"
# TODO: Import actual implementations when ready
# from glam_extractor.parsers import ConversationParser, CSVParser
# from glam_extractor.extractors import InstitutionExtractor, IdentifierExtractor
# from glam_extractor.validators import LinkMLValidator
# from glam_extractor.exporters import JSONLDExporter, RDFExporter, CSVExporter
__all__ = [
"__version__",
"__author__",
]