- Introduced `llm_extract_archiveslab.py` script for entity and relationship extraction using LLMAnnotator with GLAM-NER v1.7.0. - Replaced regex-based extraction with generative LLM inference. - Added functions for loading markdown content, converting annotation sessions to dictionaries, and generating extraction statistics. - Implemented comprehensive logging of extraction results, including counts of entities, relationships, and specific types like heritage institutions and persons. - Results and statistics are saved in JSON format for further analysis. |
||
|---|---|---|
| .. | ||
| mirror/www.heemkunderavenstein.nl | ||
| pages | ||
| annotations_v1.7.0.yaml | ||
| archive.cdx | ||
| metadata.yaml | ||