- Implemented `owl_to_mermaid.py` to convert OWL/Turtle files into Mermaid class diagrams. - Implemented `owl_to_plantuml.py` to convert OWL/Turtle files into PlantUML class diagrams. - Added two new PlantUML files for custodian multi-aspect diagrams.
140 lines
4.2 KiB
Markdown
140 lines
4.2 KiB
Markdown
# Type 'F' (FEATURES) Extraction Summary
|
|
|
|
## Overview
|
|
|
|
This document describes the extraction of all Wikidata entities with type 'F' (FEATURES - Physical landscape features with heritage significance) from the GLAMORCUBEPSXHFN taxonomy.
|
|
|
|
## Extraction Details
|
|
|
|
- **Source File**: `hyponyms_curated_full.yaml`
|
|
- **Output File**: `hyponyms_curated_full_f.yaml`
|
|
- **Extraction Date**: 2025-11-22
|
|
- **Total Entries Extracted**: 298
|
|
|
|
## Type 'F' Definition
|
|
|
|
According to the GLAMORCUBEPSXHFN taxonomy:
|
|
|
|
**FEATURES (F)**: Physical landscape features with heritage significance
|
|
- Examples: monuments, sculptures, statues, memorials, landmarks, cemeteries
|
|
- Not institutions maintaining collections, but the physical features themselves
|
|
- Heritage sites with cultural or historical significance
|
|
|
|
## Distribution by Hypernym Category
|
|
|
|
The 298 extracted entries are distributed across the following hypernym categories:
|
|
|
|
| Hypernym Category | Count | Percentage |
|
|
|-------------------------|-------|------------|
|
|
| heritage site | 144 | 48.3% |
|
|
| building | 33 | 11.1% |
|
|
| protected area | 23 | 7.7% |
|
|
| structure | 12 | 4.0% |
|
|
| museum | 8 | 2.7% |
|
|
| park | 7 | 2.3% |
|
|
| infrastructure | 6 | 2.0% |
|
|
| grave | 6 | 2.0% |
|
|
| space | 5 | 1.7% |
|
|
| memory space | 5 | 1.7% |
|
|
| information point | 5 | 1.7% |
|
|
| natural monument | 5 | 1.7% |
|
|
| tomb | 5 | 1.7% |
|
|
| object | 4 | 1.3% |
|
|
| geographical object | 4 | 1.3% |
|
|
| Other (27 categories) | 26 | 8.7% |
|
|
|
|
## Notable Examples
|
|
|
|
### Buildings
|
|
- Q1802963: mansion
|
|
- Q317557: parish church
|
|
- Q1021645: office building
|
|
|
|
### Structures
|
|
- Q336164: sewerage pumping station
|
|
- Q15710813: physical structure
|
|
- Q1411945: civil engineering construction
|
|
|
|
### Heritage Sites
|
|
- Q3694: vacation property
|
|
- Q2927789: buitenplaats (Dutch country estate)
|
|
- Q136396228: sacred shrine (Bali)
|
|
|
|
### Objects
|
|
- Q16686448: artificial object
|
|
- Q860861: sculpture
|
|
- Q223557: physical object
|
|
|
|
### Settlements
|
|
- Q124250988: urban settlement
|
|
- Q3957: town
|
|
|
|
## File Structure
|
|
|
|
The extracted YAML file maintains the same structure as the source file:
|
|
|
|
```yaml
|
|
metadata:
|
|
source_file: data/wikidata/GLAMORCUBEPSXHFN/hyponyms_curated_full.yaml
|
|
filter_criteria: 'type: F (FEATURES - Physical landscape features with heritage significance)'
|
|
extraction_date: '2025-11-20T18:28:12.123353+00:00'
|
|
count: 298
|
|
|
|
sources:
|
|
[original source metadata preserved]
|
|
|
|
hypernym:
|
|
- curated:
|
|
label: Q1802963
|
|
hypernym:
|
|
- building
|
|
type:
|
|
- F
|
|
wikidata:
|
|
[full Wikidata entity data]
|
|
enrichment_status: success
|
|
identifier: Q1802963
|
|
enrichment_date: '2025-11-20T18:28:12.600122+00:00'
|
|
[... 297 more entries]
|
|
```
|
|
|
|
## File Statistics
|
|
|
|
- **File Size**: 2.2 MB
|
|
- **Line Count**: 90,950 lines
|
|
- **Format**: YAML with UTF-8 encoding
|
|
|
|
## Usage
|
|
|
|
This file can be used for:
|
|
|
|
1. **Ontology mapping**: Understanding which Wikidata entities represent physical features
|
|
2. **Schema validation**: Ensuring FEATURES classification is correctly applied
|
|
3. **Geographic analysis**: Identifying heritage landscape features
|
|
4. **Data quality**: Cross-referencing with other heritage databases
|
|
|
|
## Schema Alignment
|
|
|
|
These entities should map to the following ontology classes in LinkML schema:
|
|
|
|
- **CIDOC-CRM**: `crm:E27_Site` (physical sites)
|
|
- **Schema.org**: `schema:Place`, `schema:LandmarksOrHistoricalBuildings`
|
|
- **RiC-O**: `rico:Place` (archival context)
|
|
|
|
## Next Steps
|
|
|
|
1. Map each Q-number to appropriate LinkML `class_uri`
|
|
2. Create `HeritageCustodian` records for institutions managing these features
|
|
3. Link physical features to custodian organizations via `manages_feature` relationship
|
|
4. Geocode locations for mapping visualization
|
|
|
|
## References
|
|
|
|
- Source taxonomy: GLAMORCUBEPSXHFN (19-type heritage classification)
|
|
- Base ontologies: `/data/ontology/` directory
|
|
- Schema definitions: `/schemas/20251121/linkml/01_custodian_name.yaml`
|
|
|
|
---
|
|
|
|
**Generated**: 2025-11-22
|
|
**Extraction Script**: Python YAML parser with type filter
|