glam/data
kempersc 55e2cd2340 feat: implement LLM-based extraction for Archives Lab content
- Introduced `llm_extract_archiveslab.py` script for entity and relationship extraction using LLMAnnotator with GLAM-NER v1.7.0.
- Replaced regex-based extraction with generative LLM inference.
- Added functions for loading markdown content, converting annotation sessions to dictionaries, and generating extraction statistics.
- Implemented comprehensive logging of extraction results, including counts of entities, relationships, and specific types like heritage institutions and persons.
- Results and statistics are saved in JSON format for further analysis.
2025-12-05 23:16:21 +01:00
..
entity_annotation annotation standards added 2025-12-05 15:30:23 +01:00
examples
extracted feat: implement LLM-based extraction for Archives Lab content 2025-12-05 23:16:21 +01:00
instances add pids 2025-12-01 23:55:55 +01:00
intangible_heritage annotation standards added 2025-12-05 15:30:23 +01:00
isil annotation standards added 2025-12-05 15:30:23 +01:00
jsonld
manual_enrichment
museum_register_nl update entries 2025-11-30 23:30:29 +01:00
nde feat: implement LLM-based extraction for Archives Lab content 2025-12-05 23:16:21 +01:00
ontology validate enrichments 2025-12-02 14:36:01 +01:00
raw
rdf
reference annotation standards added 2025-12-05 15:30:23 +01:00
review
unified update entries 2025-11-30 23:30:29 +01:00
wikidata
collision_edge_case_analysis.md
deduplication_improvement_summary.md
dutch_collision_report.txt
dutch_collision_stats.json
dutch_deduplication_report.txt
dutch_institutions_with_ghcids.yaml
extraction_checkpoint.json feat: implement LLM-based extraction for Archives Lab content 2025-12-05 23:16:21 +01:00
ISIL-codes_2025-08-01.csv
mexican_geography_analysis.yaml
NDE-logo-RGB-basis-nl-blauw.png update entries 2025-11-30 23:30:29 +01:00
temp_conv1_artifact2.md
temp_conv2_artifact1.md
temp_mexican_conv1.json
temp_mexican_conv2.json