- Introduced `test_nlp_extractor.py` with unit tests for the InstitutionExtractor, covering various extraction patterns (ISIL, Wikidata, VIAF, city names) and ensuring proper classification of institutions (museum, library, archive). - Added tests for extracted entities and result handling to validate the extraction process. - Created `test_partnership_rdf_integration.py` to validate the end-to-end process of extracting partnerships from a conversation and exporting them to RDF format. - Implemented tests for temporal properties in partnerships and ensured compliance with W3C Organization Ontology patterns. - Verified that extracted partnerships are correctly linked with PROV-O provenance metadata.
21 lines
670 B
Python
21 lines
670 B
Python
"""
|
|
GLAM Extractor
|
|
|
|
A comprehensive toolkit for extracting and standardizing global GLAM (Galleries,
|
|
Libraries, Archives, Museums) institutional data from conversation transcripts and
|
|
authoritative registries.
|
|
"""
|
|
|
|
__version__ = "0.1.0"
|
|
__author__ = "GLAM Data Extraction Project"
|
|
|
|
# TODO: Import actual implementations when ready
|
|
# from glam_extractor.parsers import ConversationParser, CSVParser
|
|
# from glam_extractor.extractors import InstitutionExtractor, IdentifierExtractor
|
|
# from glam_extractor.validators import LinkMLValidator
|
|
# from glam_extractor.exporters import JSONLDExporter, RDFExporter, CSVExporter
|
|
|
|
__all__ = [
|
|
"__version__",
|
|
"__author__",
|
|
]
|