backup CZ and JP
This commit is contained in:
parent
45e873ec0a
commit
b42d6bf5d2
10000 changed files with 2462216 additions and 87 deletions
235
.opencode/CLAUDE_CONVERSATION_MIGRATION_SPEC.md
Normal file
235
.opencode/CLAUDE_CONVERSATION_MIGRATION_SPEC.md
Normal file
|
|
@ -0,0 +1,235 @@
|
|||
# Migration Specification: `agent: claude-conversation` → Proper Provenance
|
||||
|
||||
**Created**: 2025-12-30
|
||||
**Status**: SPECIFICATION (Not Yet Implemented)
|
||||
**Related**: PROVENANCE_TIMESTAMP_RULES.md, WEB_OBSERVATION_PROVENANCE_RULES.md
|
||||
|
||||
## Problem Statement
|
||||
|
||||
24,328 custodian YAML files in `data/custodian/` have provenance statements with:
|
||||
- `agent: claude-conversation` (vague, non-specific agent identifier)
|
||||
- Single `timestamp` field (violates Rule 35: dual timestamp requirement)
|
||||
- No distinction between statement creation and source archival
|
||||
|
||||
## Affected Files
|
||||
|
||||
All files matching:
|
||||
```bash
|
||||
grep -l "agent: claude-conversation" data/custodian/*.yaml
|
||||
# Result: 24,328 files
|
||||
```
|
||||
|
||||
### Provenance Locations in Each File
|
||||
|
||||
1. **`ch_annotator.extraction_provenance.agent`** - Top-level extraction agent
|
||||
2. **`ch_annotator.entity_claims[].provenance.agent`** - Per-claim provenance (multiple instances)
|
||||
|
||||
## Source Data Categories
|
||||
|
||||
The 24,328 files come from different original sources, requiring different migration strategies:
|
||||
|
||||
### Category 1: ISIL Registry / CSV Sources (~18,000 files)
|
||||
|
||||
**Examples**: Japan, Austria, Switzerland, Czech, Bulgarian, Belgian ISIL registries
|
||||
|
||||
**Characteristics**:
|
||||
- `path: /files/{country}_complete.yaml`
|
||||
- Data originated from authoritative CSV registries
|
||||
- The CSV files are already archived in `data/instances/`
|
||||
|
||||
**Migration Strategy** (Scripted):
|
||||
```yaml
|
||||
# BEFORE
|
||||
extraction_provenance:
|
||||
path: /files/japan_complete.yaml
|
||||
timestamp: '2025-11-18T14:46:40.580095+00:00'
|
||||
agent: claude-conversation # ← INVALID
|
||||
|
||||
# AFTER
|
||||
extraction_provenance:
|
||||
source_type: isil_registry_csv
|
||||
source_path: /files/japan_complete.yaml
|
||||
source_archived_at: '2025-11-18T14:46:40.580095+00:00' # When CSV was processed
|
||||
statement_created_at: '2025-12-06T21:13:31.304940+00:00' # From annotation_date
|
||||
agent: batch-script-create-custodian-from-ch-annotator
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
```
|
||||
|
||||
### Category 2: Conversation-Extracted Data (~4,000 files)
|
||||
|
||||
**Examples**: Palestinian heritage custodians, some Latin American institutions
|
||||
|
||||
**Characteristics**:
|
||||
- `path: /conversations/{uuid}`
|
||||
- Data extracted from Claude conversation exports
|
||||
- Need to trace back to original sources mentioned IN the conversation
|
||||
|
||||
**Migration Strategy** (Requires GLM4.7 + Manual Review):
|
||||
1. Load the conversation JSON file
|
||||
2. Use GLM4.7 to identify the ACTUAL sources mentioned in conversation
|
||||
3. For each source type:
|
||||
- **Web sources**: Use web-reader to archive + extract with XPath
|
||||
- **Wikidata**: Add Wikidata entity provenance
|
||||
- **Academic sources**: Add DOI/citation provenance
|
||||
|
||||
### Category 3: Web-Enriched Data (~2,000 files)
|
||||
|
||||
**Examples**: Institutions with `web_enrichment`, `google_maps_enrichment`
|
||||
|
||||
**Characteristics**:
|
||||
- Have web-scraped data that needs XPath provenance
|
||||
- May have Google Maps or OSM enrichment
|
||||
|
||||
**Migration Strategy** (Requires web-reader + Playwright):
|
||||
1. Re-archive source websites using Playwright
|
||||
2. Use web-reader to extract claims with XPath provenance
|
||||
3. Generate dual timestamps from archival metadata
|
||||
|
||||
## Migration Pipeline Architecture
|
||||
|
||||
```
|
||||
┌─────────────────────────────────────────────────────────────────────┐
|
||||
│ MIGRATION PIPELINE │
|
||||
├─────────────────────────────────────────────────────────────────────┤
|
||||
│ │
|
||||
│ ┌─────────────┐ ┌──────────────────┐ ┌─────────────────┐ │
|
||||
│ │ Categorizer │ ──▶ │ Source Resolver │ ──▶ │ Provenance │ │
|
||||
│ │ │ │ │ │ Generator │ │
|
||||
│ │ - Detect │ │ - CSV Registry │ │ │ │
|
||||
│ │ source │ │ - Conversation │ │ - Dual │ │
|
||||
│ │ type │ │ - Web Archive │ │ timestamps │ │
|
||||
│ │ - Route to │ │ - Wikidata │ │ - Valid agent │ │
|
||||
│ │ handler │ │ │ │ - Source refs │ │
|
||||
│ └─────────────┘ └──────────────────┘ └─────────────────┘ │
|
||||
│ │ │ │ │
|
||||
│ ▼ ▼ ▼ │
|
||||
│ ┌─────────────────────────────────────────────────────────────┐ │
|
||||
│ │ Source-Specific Handlers │ │
|
||||
│ ├─────────────────────────────────────────────────────────────┤ │
|
||||
│ │ │ │
|
||||
│ │ ┌────────────────┐ ┌────────────────┐ ┌───────────────┐ │ │
|
||||
│ │ │ ISIL/CSV │ │ Conversation │ │ Web Archive │ │ │
|
||||
│ │ │ Handler │ │ Handler │ │ Handler │ │ │
|
||||
│ │ │ │ │ │ │ │ │ │
|
||||
│ │ │ - Read CSV │ │ - Parse JSON │ │ - Playwright │ │ │
|
||||
│ │ │ - Map to │ │ - GLM4.7 │ │ - web-reader │ │ │
|
||||
│ │ │ timestamps │ │ analysis │ │ - XPath │ │ │
|
||||
│ │ │ - Update │ │ - Source │ │ extraction │ │ │
|
||||
│ │ │ provenance │ │ tracing │ │ │ │ │
|
||||
│ │ └────────────────┘ └────────────────┘ └───────────────┘ │ │
|
||||
│ │ │ │
|
||||
│ └─────────────────────────────────────────────────────────────┘ │
|
||||
│ │
|
||||
│ ┌─────────────────────────────────────────────────────────────┐ │
|
||||
│ │ Validation Layer │ │
|
||||
│ ├─────────────────────────────────────────────────────────────┤ │
|
||||
│ │ - Dual timestamp check (Rule 35) │ │
|
||||
│ │ - Agent identifier validation │ │
|
||||
│ │ - source_archived_at <= statement_created_at │ │
|
||||
│ │ - XPath verification (where applicable) │ │
|
||||
│ └─────────────────────────────────────────────────────────────┘ │
|
||||
│ │
|
||||
└─────────────────────────────────────────────────────────────────────┘
|
||||
```
|
||||
|
||||
## Implementation Phases
|
||||
|
||||
### Phase 1: Category 1 - ISIL/CSV Sources (Scripted, No LLM Required)
|
||||
|
||||
**Scope**: ~18,000 files
|
||||
**Effort**: 1-2 days scripting
|
||||
**Tools**: Python script only
|
||||
|
||||
Files where `path` matches `/files/*.yaml` or `/files/*.csv`:
|
||||
- Parse the annotation_date to get `statement_created_at`
|
||||
- Use the original file's processing timestamp for `source_archived_at`
|
||||
- Replace `agent: claude-conversation` with source-specific agent
|
||||
|
||||
### Phase 2: Category 2 - Conversation Sources (GLM4.7 Required)
|
||||
|
||||
**Scope**: ~4,000 files
|
||||
**Effort**: 3-5 days with LLM processing
|
||||
**Tools**: GLM4.7 API, conversation JSON parser
|
||||
|
||||
For each file with `path: /conversations/{uuid}`:
|
||||
1. Load conversation JSON from archive (if available)
|
||||
2. Send to GLM4.7 with prompt to identify actual data sources
|
||||
3. Update provenance based on source analysis
|
||||
|
||||
### Phase 3: Category 3 - Web Sources (web-reader + Playwright)
|
||||
|
||||
**Scope**: ~2,000 files
|
||||
**Effort**: 5-10 days with web archival
|
||||
**Tools**: Playwright, web-reader MCP, GLM4.7
|
||||
|
||||
For files with web-derived claims:
|
||||
1. Archive source URLs using Playwright
|
||||
2. Extract claims with XPath using web-reader
|
||||
3. Generate dual timestamps from archival metadata
|
||||
|
||||
## File Updates
|
||||
|
||||
### Per-File Changes
|
||||
|
||||
For each of the 24,328 files:
|
||||
|
||||
1. **Update `ch_annotator.extraction_provenance`**:
|
||||
```yaml
|
||||
extraction_provenance:
|
||||
# Existing fields retained
|
||||
namespace: glam
|
||||
path: /files/japan_complete.yaml
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
|
||||
# NEW: Dual timestamps
|
||||
source_archived_at: '2025-11-18T14:46:40.580095+00:00'
|
||||
statement_created_at: '2025-12-06T21:13:31.304940+00:00'
|
||||
|
||||
# NEW: Valid agent identifier
|
||||
agent: batch-script-create-custodian-from-ch-annotator
|
||||
|
||||
# NEW: Source classification
|
||||
source_type: isil_registry_csv
|
||||
|
||||
# NEW: Migration tracking
|
||||
migration_note: 'Migrated from agent:claude-conversation on 2025-12-30'
|
||||
```
|
||||
|
||||
2. **Update each `ch_annotator.entity_claims[].provenance`**:
|
||||
```yaml
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/japan_complete.yaml
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
|
||||
# NEW: Dual timestamps (inherited from parent)
|
||||
source_archived_at: '2025-11-18T14:46:40.580095+00:00'
|
||||
statement_created_at: '2025-12-06T21:13:31.304940+00:00'
|
||||
|
||||
# NEW: Valid agent
|
||||
agent: batch-script-create-custodian-from-ch-annotator
|
||||
```
|
||||
|
||||
## Validation Criteria
|
||||
|
||||
After migration, every provenance block MUST pass:
|
||||
|
||||
1. ✅ `statement_created_at` is present (ISO 8601)
|
||||
2. ✅ `source_archived_at` is present (ISO 8601)
|
||||
3. ✅ `source_archived_at <= statement_created_at`
|
||||
4. ✅ `agent` is NOT `claude-conversation`, `claude`, `ai`, `opencode`, or `llm`
|
||||
5. ✅ `agent` follows format `{tool}-{model}-{version}` or `{script-name}`
|
||||
|
||||
## Rollback Strategy
|
||||
|
||||
Before migration:
|
||||
1. Create timestamped backup: `data/custodian.backup.2025-12-30/`
|
||||
2. Store original provenance in `_migration_backup` field
|
||||
3. Generate diff report for manual review
|
||||
|
||||
## References
|
||||
|
||||
- Rule 35: `.opencode/PROVENANCE_TIMESTAMP_RULES.md`
|
||||
- Rule 6: `.opencode/WEB_OBSERVATION_PROVENANCE_RULES.md`
|
||||
- CH-Annotator: `data/entity_annotation/ch_annotator-v1_7_0.yaml`
|
||||
- web-reader script: `scripts/add_web_claim_provenance.py`
|
||||
477
.opencode/GLM4_CONVERSATION_ANALYSIS_PROMPTS.md
Normal file
477
.opencode/GLM4_CONVERSATION_ANALYSIS_PROMPTS.md
Normal file
|
|
@ -0,0 +1,477 @@
|
|||
# GLM4.7 Prompts for Category 2: Conversation Source Analysis
|
||||
|
||||
**Created**: 2025-12-30
|
||||
**Status**: SPECIFICATION
|
||||
**Related**: CLAUDE_CONVERSATION_MIGRATION_SPEC.md, PROVENANCE_TIMESTAMP_RULES.md
|
||||
|
||||
## Purpose
|
||||
|
||||
Category 2 files (~4,000) have provenance paths like `/conversations/{uuid}` which reference Claude conversation exports. The actual data sources (Wikidata, websites, registries, academic papers) are mentioned WITHIN the conversation text.
|
||||
|
||||
GLM4.7 is used to:
|
||||
1. Parse conversation JSON files
|
||||
2. Identify the REAL data sources mentioned
|
||||
3. Extract source metadata (URLs, timestamps, identifiers)
|
||||
4. Generate proper dual-timestamp provenance
|
||||
|
||||
## Workflow Overview
|
||||
|
||||
```
|
||||
┌─────────────────────────────────────────────────────────────────┐
|
||||
│ CATEGORY 2 MIGRATION WORKFLOW │
|
||||
├─────────────────────────────────────────────────────────────────┤
|
||||
│ │
|
||||
│ ┌──────────────┐ ┌──────────────┐ ┌─────────────────┐ │
|
||||
│ │ Custodian │ │ Conversation │ │ GLM4.7 │ │
|
||||
│ │ YAML File │ ──▶ │ JSON Archive │ ──▶ │ Source Analysis │ │
|
||||
│ │ │ │ │ │ │ │
|
||||
│ │ path: /conv/ │ │ Full text of │ │ Identify: │ │
|
||||
│ │ {uuid} │ │ messages │ │ - URLs │ │
|
||||
│ └──────────────┘ └──────────────┘ │ - Wikidata IDs │ │
|
||||
│ │ - Registry refs │ │
|
||||
│ │ - API calls │ │
|
||||
│ └─────────────────┘ │
|
||||
│ │ │
|
||||
│ ▼ │
|
||||
│ ┌─────────────────────────────────┐ │
|
||||
│ │ Source-Specific Handlers │ │
|
||||
│ ├─────────────────────────────────┤ │
|
||||
│ │ │ │
|
||||
│ │ ┌───────────┐ ┌───────────────┐ │ │
|
||||
│ │ │ Web URLs │ │ Wikidata IDs │ │ │
|
||||
│ │ │ │ │ │ │ │
|
||||
│ │ │ Playwright│ │ SPARQL query │ │ │
|
||||
│ │ │ archive + │ │ to verify │ │ │
|
||||
│ │ │ web-reader│ │ claims │ │ │
|
||||
│ │ └───────────┘ └───────────────┘ │ │
|
||||
│ │ │ │
|
||||
│ │ ┌───────────┐ ┌───────────────┐ │ │
|
||||
│ │ │ Registry │ │ Academic │ │ │
|
||||
│ │ │ References│ │ Citations │ │ │
|
||||
│ │ │ │ │ │ │ │
|
||||
│ │ │ Map to │ │ DOI lookup │ │ │
|
||||
│ │ │ CSV files │ │ CrossRef API │ │ │
|
||||
│ │ └───────────┘ └───────────────┘ │ │
|
||||
│ └─────────────────────────────────┘ │
|
||||
│ │
|
||||
└─────────────────────────────────────────────────────────────────┘
|
||||
```
|
||||
|
||||
## GLM4.7 Prompts
|
||||
|
||||
### Prompt 1: Source Identification
|
||||
|
||||
```markdown
|
||||
# Task: Identify Data Sources in Heritage Custodian Conversation
|
||||
|
||||
You are analyzing a Claude conversation that was used to extract heritage institution data. Your task is to identify ALL data sources mentioned or used in this conversation.
|
||||
|
||||
## Conversation Content
|
||||
{conversation_json}
|
||||
|
||||
## Institution Being Analyzed
|
||||
- Name: {institution_name}
|
||||
- GHCID: {ghcid}
|
||||
- Current provenance path: /conversations/{conversation_uuid}
|
||||
|
||||
## Instructions
|
||||
|
||||
1. Read through the entire conversation carefully
|
||||
2. Identify every data source mentioned or used, including:
|
||||
- **Web URLs**: Institution websites, registry portals, news articles
|
||||
- **Wikidata**: Entity IDs (Q-numbers) referenced or queried
|
||||
- **API Calls**: Any structured data fetches (SPARQL, REST APIs)
|
||||
- **CSV/Registry References**: ISIL registries, national databases
|
||||
- **Academic Sources**: Papers, reports, DOIs
|
||||
- **Government Sources**: Official publications, gazettes
|
||||
|
||||
3. For each source, extract:
|
||||
- Source type (web, wikidata, api, registry, academic, government)
|
||||
- Source identifier (URL, Q-number, DOI, etc.)
|
||||
- What data was extracted from it
|
||||
- Approximate timestamp of access (if mentioned)
|
||||
|
||||
## Output Format
|
||||
|
||||
Return a JSON array of sources:
|
||||
|
||||
```json
|
||||
{
|
||||
"institution_name": "{institution_name}",
|
||||
"ghcid": "{ghcid}",
|
||||
"conversation_uuid": "{conversation_uuid}",
|
||||
"identified_sources": [
|
||||
{
|
||||
"source_type": "web",
|
||||
"source_url": "https://example.org/about",
|
||||
"source_identifier": null,
|
||||
"data_extracted": ["name", "address", "opening_hours"],
|
||||
"access_timestamp": "2025-09-22T14:40:00Z",
|
||||
"confidence": 0.95,
|
||||
"evidence_quote": "Looking at their website at example.org..."
|
||||
},
|
||||
{
|
||||
"source_type": "wikidata",
|
||||
"source_url": "https://www.wikidata.org/wiki/Q12345",
|
||||
"source_identifier": "Q12345",
|
||||
"data_extracted": ["instance_of", "country", "coordinates"],
|
||||
"access_timestamp": null,
|
||||
"confidence": 0.98,
|
||||
"evidence_quote": "According to Wikidata (Q12345)..."
|
||||
}
|
||||
],
|
||||
"analysis_notes": "Any relevant observations about source quality or gaps"
|
||||
}
|
||||
```
|
||||
|
||||
## Important
|
||||
|
||||
- Only include sources that were ACTUALLY used to extract data
|
||||
- Do not invent sources - if unsure, set confidence lower
|
||||
- Include the exact quote from conversation that references each source
|
||||
- If no sources can be identified, return empty array with explanation
|
||||
```
|
||||
|
||||
### Prompt 2: Claim-Source Attribution
|
||||
|
||||
```markdown
|
||||
# Task: Map Claims to Their Original Sources
|
||||
|
||||
You have identified the following sources used in a heritage custodian conversation:
|
||||
|
||||
## Identified Sources
|
||||
{identified_sources_json}
|
||||
|
||||
## Entity Claims from Custodian File
|
||||
{entity_claims_json}
|
||||
|
||||
## Institution
|
||||
- Name: {institution_name}
|
||||
- GHCID: {ghcid}
|
||||
|
||||
## Instructions
|
||||
|
||||
For each entity claim, determine which source(s) it was derived from.
|
||||
|
||||
1. Analyze each claim (full_name, institution_type, located_in_city, etc.)
|
||||
2. Match it to the most likely source based on:
|
||||
- What data each source provides
|
||||
- The conversation context
|
||||
- Claim confidence scores
|
||||
|
||||
3. Generate proper provenance for each claim
|
||||
|
||||
## Output Format
|
||||
|
||||
Return updated provenance for each claim:
|
||||
|
||||
```json
|
||||
{
|
||||
"claim_provenance_updates": [
|
||||
{
|
||||
"claim_type": "full_name",
|
||||
"claim_value": "Example Museum",
|
||||
"attributed_source": {
|
||||
"source_type": "web",
|
||||
"source_url": "https://example.org/about",
|
||||
"source_archived_at": "2025-09-22T14:40:00Z",
|
||||
"statement_created_at": "2025-12-06T21:13:31Z",
|
||||
"agent": "opencode-claude-sonnet-4",
|
||||
"attribution_confidence": 0.92
|
||||
},
|
||||
"attribution_rationale": "Name found on official website header"
|
||||
},
|
||||
{
|
||||
"claim_type": "wikidata_id",
|
||||
"claim_value": "Q12345",
|
||||
"attributed_source": {
|
||||
"source_type": "wikidata",
|
||||
"source_url": "https://www.wikidata.org/wiki/Q12345",
|
||||
"source_archived_at": "2025-09-22T14:45:00Z",
|
||||
"statement_created_at": "2025-12-06T21:13:31Z",
|
||||
"agent": "opencode-claude-sonnet-4",
|
||||
"attribution_confidence": 1.0
|
||||
},
|
||||
"attribution_rationale": "Directly queried from Wikidata"
|
||||
}
|
||||
],
|
||||
"unattributed_claims": [
|
||||
{
|
||||
"claim_type": "opening_hours",
|
||||
"claim_value": "Mon-Fri 9-17",
|
||||
"reason": "Source could not be determined from conversation"
|
||||
}
|
||||
]
|
||||
}
|
||||
```
|
||||
|
||||
## Rules
|
||||
|
||||
- If a claim cannot be attributed to any identified source, add to unattributed_claims
|
||||
- For unattributed claims, the migration script will flag for manual review
|
||||
- Use the conversation UUID as fallback source_archived_at if no timestamp available
|
||||
- statement_created_at should use the annotation_date from CH-Annotator
|
||||
```
|
||||
|
||||
### Prompt 3: Web Source Verification
|
||||
|
||||
```markdown
|
||||
# Task: Verify Web Sources for Archival
|
||||
|
||||
Before we archive web sources with Playwright, verify they are valid and relevant.
|
||||
|
||||
## Web Sources to Verify
|
||||
{web_sources_json}
|
||||
|
||||
## Institution
|
||||
- Name: {institution_name}
|
||||
- GHCID: {ghcid}
|
||||
|
||||
## Instructions
|
||||
|
||||
For each web source, determine:
|
||||
|
||||
1. **URL Validity**: Is the URL well-formed and likely still accessible?
|
||||
2. **Relevance**: Does this URL relate to the institution?
|
||||
3. **Archive Priority**: Should we archive this with Playwright?
|
||||
4. **Expected Content**: What data should we extract with web-reader?
|
||||
|
||||
## Output Format
|
||||
|
||||
```json
|
||||
{
|
||||
"web_source_verification": [
|
||||
{
|
||||
"source_url": "https://example.org/about",
|
||||
"url_valid": true,
|
||||
"is_institution_website": true,
|
||||
"archive_priority": "high",
|
||||
"expected_claims": ["name", "address", "description", "contact"],
|
||||
"web_reader_selectors": {
|
||||
"name": "h1.institution-name",
|
||||
"address": ".contact-info address",
|
||||
"description": "main .about-text"
|
||||
},
|
||||
"notes": "Official institution website - primary source"
|
||||
},
|
||||
{
|
||||
"source_url": "https://twitter.com/example",
|
||||
"url_valid": true,
|
||||
"is_institution_website": false,
|
||||
"archive_priority": "low",
|
||||
"expected_claims": ["social_media_handle"],
|
||||
"web_reader_selectors": null,
|
||||
"notes": "Social media - only need URL, not content"
|
||||
}
|
||||
],
|
||||
"sources_to_archive": ["https://example.org/about"],
|
||||
"sources_to_skip": ["https://twitter.com/example"]
|
||||
}
|
||||
```
|
||||
|
||||
## Priority Levels
|
||||
|
||||
- **high**: Institution's own website - archive immediately
|
||||
- **medium**: Government registries, Wikipedia - archive if accessible
|
||||
- **low**: Social media, aggregators - just store URL
|
||||
- **skip**: Dead links, paywalled content, dynamic apps
|
||||
```
|
||||
|
||||
## Implementation Script Outline
|
||||
|
||||
```python
|
||||
#!/usr/bin/env python3
|
||||
"""
|
||||
Phase 2 Migration: Conversation Sources → Proper Provenance
|
||||
|
||||
Uses GLM4.7 to analyze conversation JSON files and identify
|
||||
actual data sources for heritage custodian claims.
|
||||
"""
|
||||
|
||||
import json
|
||||
import os
|
||||
from pathlib import Path
|
||||
from datetime import datetime, timezone
|
||||
|
||||
import httpx
|
||||
import yaml
|
||||
|
||||
# Z.AI GLM API configuration (per Rule 11)
|
||||
ZAI_API_URL = "https://api.z.ai/api/coding/paas/v4/chat/completions"
|
||||
ZAI_MODEL = "glm-4.5" # or glm-4.6 for higher quality
|
||||
|
||||
|
||||
def get_zai_token() -> str:
|
||||
"""Get Z.AI API token from environment."""
|
||||
token = os.environ.get("ZAI_API_TOKEN")
|
||||
if not token:
|
||||
raise ValueError("ZAI_API_TOKEN environment variable not set")
|
||||
return token
|
||||
|
||||
|
||||
def call_glm4(prompt: str, system_prompt: str = None) -> str:
|
||||
"""Call GLM4 API with prompt."""
|
||||
headers = {
|
||||
"Authorization": f"Bearer {get_zai_token()}",
|
||||
"Content-Type": "application/json"
|
||||
}
|
||||
|
||||
messages = []
|
||||
if system_prompt:
|
||||
messages.append({"role": "system", "content": system_prompt})
|
||||
messages.append({"role": "user", "content": prompt})
|
||||
|
||||
payload = {
|
||||
"model": ZAI_MODEL,
|
||||
"messages": messages,
|
||||
"temperature": 0.1, # Low temperature for consistent extraction
|
||||
"max_tokens": 4096
|
||||
}
|
||||
|
||||
response = httpx.post(ZAI_API_URL, json=payload, headers=headers, timeout=60)
|
||||
response.raise_for_status()
|
||||
|
||||
return response.json()["choices"][0]["message"]["content"]
|
||||
|
||||
|
||||
def load_conversation_json(uuid: str) -> dict:
|
||||
"""Load conversation JSON from archive."""
|
||||
# Conversation archives stored in data/conversations/
|
||||
conv_path = Path(f"data/conversations/{uuid}.json")
|
||||
if not conv_path.exists():
|
||||
# Try alternative locations
|
||||
alt_path = Path(f"~/Documents/claude/glam/{uuid}.json").expanduser()
|
||||
if alt_path.exists():
|
||||
conv_path = alt_path
|
||||
else:
|
||||
return None
|
||||
|
||||
with open(conv_path, 'r') as f:
|
||||
return json.load(f)
|
||||
|
||||
|
||||
def identify_sources_for_institution(custodian_file: Path) -> dict:
|
||||
"""
|
||||
Analyze conversation to identify sources for a custodian.
|
||||
|
||||
Returns dict with:
|
||||
- identified_sources: list of sources found
|
||||
- claim_attributions: mapping of claims to sources
|
||||
- web_sources_to_archive: URLs needing Playwright archival
|
||||
"""
|
||||
# Load custodian YAML
|
||||
with open(custodian_file, 'r') as f:
|
||||
custodian = yaml.safe_load(f)
|
||||
|
||||
# Extract conversation UUID from provenance path
|
||||
ch_annotator = custodian.get('ch_annotator', {})
|
||||
extraction_prov = ch_annotator.get('extraction_provenance', {})
|
||||
path = extraction_prov.get('path', '')
|
||||
|
||||
if not path.startswith('/conversations/'):
|
||||
return {'error': 'Not a conversation source file'}
|
||||
|
||||
conv_uuid = path.replace('/conversations/', '')
|
||||
|
||||
# Load conversation JSON
|
||||
conversation = load_conversation_json(conv_uuid)
|
||||
if not conversation:
|
||||
return {'error': f'Conversation not found: {conv_uuid}'}
|
||||
|
||||
# Extract relevant info
|
||||
institution_name = custodian.get('custodian_name', {}).get('claim_value', 'Unknown')
|
||||
ghcid = custodian.get('ghcid', {}).get('ghcid_current', 'Unknown')
|
||||
entity_claims = ch_annotator.get('entity_claims', [])
|
||||
|
||||
# Step 1: Identify sources using GLM4
|
||||
source_prompt = PROMPT_1_SOURCE_IDENTIFICATION.format(
|
||||
conversation_json=json.dumps(conversation, indent=2)[:50000], # Truncate if needed
|
||||
institution_name=institution_name,
|
||||
ghcid=ghcid,
|
||||
conversation_uuid=conv_uuid
|
||||
)
|
||||
|
||||
sources_response = call_glm4(source_prompt)
|
||||
identified_sources = json.loads(sources_response)
|
||||
|
||||
# Step 2: Attribute claims to sources
|
||||
attribution_prompt = PROMPT_2_CLAIM_ATTRIBUTION.format(
|
||||
identified_sources_json=json.dumps(identified_sources['identified_sources'], indent=2),
|
||||
entity_claims_json=json.dumps(entity_claims, indent=2),
|
||||
institution_name=institution_name,
|
||||
ghcid=ghcid
|
||||
)
|
||||
|
||||
attributions_response = call_glm4(attribution_prompt)
|
||||
claim_attributions = json.loads(attributions_response)
|
||||
|
||||
# Step 3: Verify web sources
|
||||
web_sources = [s for s in identified_sources['identified_sources'] if s['source_type'] == 'web']
|
||||
|
||||
if web_sources:
|
||||
verification_prompt = PROMPT_3_WEB_VERIFICATION.format(
|
||||
web_sources_json=json.dumps(web_sources, indent=2),
|
||||
institution_name=institution_name,
|
||||
ghcid=ghcid
|
||||
)
|
||||
|
||||
verification_response = call_glm4(verification_prompt)
|
||||
web_verification = json.loads(verification_response)
|
||||
else:
|
||||
web_verification = {'sources_to_archive': [], 'sources_to_skip': []}
|
||||
|
||||
return {
|
||||
'custodian_file': str(custodian_file),
|
||||
'conversation_uuid': conv_uuid,
|
||||
'identified_sources': identified_sources,
|
||||
'claim_attributions': claim_attributions,
|
||||
'web_verification': web_verification
|
||||
}
|
||||
|
||||
|
||||
# Prompt templates (loaded from this file or external)
|
||||
PROMPT_1_SOURCE_IDENTIFICATION = """...""" # From Prompt 1 above
|
||||
PROMPT_2_CLAIM_ATTRIBUTION = """...""" # From Prompt 2 above
|
||||
PROMPT_3_WEB_VERIFICATION = """...""" # From Prompt 3 above
|
||||
```
|
||||
|
||||
## Conversation JSON Location
|
||||
|
||||
Conversation exports need to be located. Check these paths:
|
||||
|
||||
1. `~/Documents/claude/glam/*.json` - Original Claude exports
|
||||
2. `data/conversations/*.json` - Project archive location
|
||||
3. `data/instances/conversations/` - Alternative archive
|
||||
|
||||
If conversations are not archived, they may need to be re-exported from Claude.
|
||||
|
||||
## Integration with Phase 1
|
||||
|
||||
Phase 2 runs AFTER Phase 1 completes:
|
||||
|
||||
1. **Phase 1**: Migrates ~18,000 Category 1 files (ISIL/CSV sources)
|
||||
2. **Phase 2**: Processes ~4,000 Category 2 files (conversation sources)
|
||||
3. **Phase 3**: Archives web sources with Playwright for Category 3
|
||||
|
||||
## Cost Estimation
|
||||
|
||||
GLM4 API calls (per Rule 11: FREE via Z.AI Coding Plan):
|
||||
- ~4,000 files × 3 prompts = ~12,000 API calls
|
||||
- Cost: $0 (Z.AI Coding Plan)
|
||||
- Time: ~2-4 hours (rate limited)
|
||||
|
||||
## Validation Criteria
|
||||
|
||||
After Phase 2 migration, every Category 2 file MUST pass:
|
||||
|
||||
1. ✅ `source_archived_at` is present (from identified source or conversation timestamp)
|
||||
2. ✅ `statement_created_at` is present (from annotation_date)
|
||||
3. ✅ `agent` is valid (opencode-claude-sonnet-4 or similar)
|
||||
4. ✅ At least one source identified, OR flagged for manual review
|
||||
5. ✅ Web sources queued for Playwright archival (Phase 3)
|
||||
|
||||
## References
|
||||
|
||||
- Rule 11: `.opencode/ZAI_GLM_API_RULES.md`
|
||||
- Rule 35: `.opencode/PROVENANCE_TIMESTAMP_RULES.md`
|
||||
- Migration Spec: `.opencode/CLAUDE_CONVERSATION_MIGRATION_SPEC.md`
|
||||
393
.opencode/PROVENANCE_TIMESTAMP_RULES.md
Normal file
393
.opencode/PROVENANCE_TIMESTAMP_RULES.md
Normal file
|
|
@ -0,0 +1,393 @@
|
|||
# Provenance Timestamp Rules
|
||||
|
||||
**Created**: 2025-12-30
|
||||
**Updated**: 2025-12-30
|
||||
**Status**: Active Rule
|
||||
**Related**: WEB_CLAIM_PROVENANCE_SCHEMA.md, YAML_PROVENANCE_SCHEMA.md, WEB_OBSERVATION_PROVENANCE_RULES.md
|
||||
|
||||
## Core Principle: Every Provenance Statement MUST Have At Least Two Timestamps
|
||||
|
||||
**All provenance statements in custodian data MUST include at minimum two timestamps:**
|
||||
|
||||
1. **`statement_created_at`** - When the provenance statement/claim was created (extraction/annotation time)
|
||||
2. **`source_archived_at`** - When the source material was archived/captured
|
||||
|
||||
These two timestamps are MANDATORY. Additional temporal metadata is encouraged but optional.
|
||||
|
||||
---
|
||||
|
||||
## Mandatory Timestamps
|
||||
|
||||
### 1. Statement Created Timestamp (`statement_created_at`)
|
||||
|
||||
**Purpose**: Records when the claim/statement was extracted, annotated, or created by the agent.
|
||||
|
||||
**Format**: ISO 8601 with timezone (UTC preferred)
|
||||
|
||||
**Example**:
|
||||
```yaml
|
||||
statement_created_at: "2025-12-30T14:30:00Z"
|
||||
```
|
||||
|
||||
**Source**: Generated by the extraction/annotation agent at processing time.
|
||||
|
||||
### 2. Source Archived Timestamp (`source_archived_at`)
|
||||
|
||||
**Purpose**: Records when the source material (webpage, document, API response) was archived/captured.
|
||||
|
||||
**Format**: ISO 8601 with timezone (UTC preferred)
|
||||
|
||||
**Example**:
|
||||
```yaml
|
||||
source_archived_at: "2025-12-29T10:15:00Z"
|
||||
```
|
||||
|
||||
**Source**:
|
||||
- For web sources: Playwright archival timestamp, Wayback Machine memento datetime
|
||||
- For API sources: API response fetch timestamp
|
||||
- For documents: Document capture/download timestamp
|
||||
|
||||
---
|
||||
|
||||
## Optional Timestamps (Encouraged)
|
||||
|
||||
### 3. Source Created Timestamp (`source_created_at`)
|
||||
|
||||
**Purpose**: When the original source content was created/published.
|
||||
|
||||
**Example**:
|
||||
```yaml
|
||||
source_created_at: "2022-07-15T14:15:00Z" # Article publish date
|
||||
```
|
||||
|
||||
**Sources**:
|
||||
- `article:published_time` meta tag
|
||||
- `datePublished` in JSON-LD
|
||||
- File creation date
|
||||
- API response `created_at` field
|
||||
|
||||
### 4. Source Last Modified Timestamp (`source_last_modified_at`)
|
||||
|
||||
**Purpose**: When the source content was last updated.
|
||||
|
||||
**Example**:
|
||||
```yaml
|
||||
source_last_modified_at: "2023-01-10T09:00:00Z"
|
||||
```
|
||||
|
||||
**Sources**:
|
||||
- `article:modified_time` meta tag
|
||||
- `dateModified` in JSON-LD
|
||||
- HTTP `Last-Modified` header
|
||||
- File modification date
|
||||
|
||||
### 5. Verification Timestamp (`last_verified_at`)
|
||||
|
||||
**Purpose**: When the claim was last re-verified against the source.
|
||||
|
||||
**Example**:
|
||||
```yaml
|
||||
last_verified_at: "2025-12-30T14:30:00Z"
|
||||
```
|
||||
|
||||
### 6. Next Verification Due (`next_verification_due`)
|
||||
|
||||
**Purpose**: When the claim should be re-verified (for staleness tracking).
|
||||
|
||||
**Example**:
|
||||
```yaml
|
||||
next_verification_due: "2026-03-30T00:00:00Z" # 90 days from last verification
|
||||
```
|
||||
|
||||
---
|
||||
|
||||
## Complete Provenance Timestamp Structure
|
||||
|
||||
### For Web Claims
|
||||
|
||||
```yaml
|
||||
provenance:
|
||||
# MANDATORY (both required)
|
||||
statement_created_at: "2025-12-30T14:30:00Z" # When we extracted this
|
||||
source_archived_at: "2025-12-29T10:15:00Z" # When we archived the webpage
|
||||
|
||||
# OPTIONAL (encouraged)
|
||||
source_created_at: "2022-07-15T14:15:00Z" # When article was published
|
||||
source_last_modified_at: "2023-01-10T09:00:00Z" # When article was updated
|
||||
last_verified_at: "2025-12-30T14:30:00Z" # Last verification
|
||||
next_verification_due: "2026-03-30T00:00:00Z" # Re-verify in 90 days
|
||||
```
|
||||
|
||||
### For API-Sourced Data (Wikidata, Google Maps, etc.)
|
||||
|
||||
```yaml
|
||||
_provenance:
|
||||
# MANDATORY
|
||||
statement_created_at: "2025-12-30T14:30:00Z" # When we processed API response
|
||||
source_archived_at: "2025-12-30T14:29:55Z" # When API was queried (fetch_timestamp)
|
||||
|
||||
# OPTIONAL
|
||||
source_last_modified_at: "2025-12-15T00:00:00Z" # Wikidata entity last modified
|
||||
last_verified_at: "2025-12-30T14:30:00Z"
|
||||
```
|
||||
|
||||
### For CH-Annotator Extracted Claims
|
||||
|
||||
```yaml
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /conversations/edc75d66-ee42-4199-8e22-65b0d2347922
|
||||
|
||||
# MANDATORY
|
||||
statement_created_at: "2025-12-06T21:13:56Z" # When CH-Annotator processed this
|
||||
source_archived_at: "2025-11-06T08:02:44Z" # When conversation was exported
|
||||
|
||||
# Agent identification
|
||||
agent: opencode-claude-sonnet-4
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
```
|
||||
|
||||
---
|
||||
|
||||
## Invalid Provenance: `agent: claude-conversation`
|
||||
|
||||
**PROBLEM**: 24,328 custodian files currently contain provenance statements like:
|
||||
|
||||
```yaml
|
||||
# INVALID - Missing timestamps and proper source identification
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /conversations/edc75d66-ee42-4199-8e22-65b0d2347922
|
||||
timestamp: '2025-11-06T08:02:44.240037+00:00' # Only ONE timestamp!
|
||||
agent: claude-conversation # Vague agent identifier
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
```
|
||||
|
||||
**ISSUES**:
|
||||
1. `claude-conversation` is not a valid agent identifier (which Claude model? which session?)
|
||||
2. Only one timestamp - doesn't distinguish statement creation from source archival
|
||||
3. No UUID reference to the specific conversation
|
||||
4. No archived source path
|
||||
|
||||
---
|
||||
|
||||
## Valid Provenance Structure (Migration Target)
|
||||
|
||||
```yaml
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
|
||||
# Source identification
|
||||
source_type: claude_conversation_export
|
||||
source_path: /conversations/edc75d66-ee42-4199-8e22-65b0d2347922
|
||||
conversation_uuid: edc75d66-ee42-4199-8e22-65b0d2347922
|
||||
|
||||
# MANDATORY timestamps
|
||||
statement_created_at: "2025-12-06T21:13:56.173868+00:00" # Annotation time
|
||||
source_archived_at: "2025-11-06T08:02:44.240037+00:00" # Conversation export time
|
||||
|
||||
# Agent identification (proper format)
|
||||
agent:
|
||||
name: opencode-claude-sonnet-4
|
||||
model: claude-sonnet-4-20250514
|
||||
session_type: opencode_conversation
|
||||
|
||||
# Context
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
|
||||
# Archive reference
|
||||
archive:
|
||||
format: claude_conversation_json
|
||||
local_path: data/conversations/edc75d66-ee42-4199-8e22-65b0d2347922.json
|
||||
```
|
||||
|
||||
---
|
||||
|
||||
## Timestamp Hierarchy and Derivation
|
||||
|
||||
When only one timestamp is available, derive the other:
|
||||
|
||||
| Available | Derive `statement_created_at` | Derive `source_archived_at` |
|
||||
|-----------|------------------------------|----------------------------|
|
||||
| Only `timestamp` | Use as `statement_created_at` | Set to same value (assume simultaneous) |
|
||||
| Only `extraction_date` | Use as `statement_created_at` | Set to same value |
|
||||
| Only `fetch_timestamp` | Set to same value | Use as `source_archived_at` |
|
||||
| Only `annotation_date` | Use as `statement_created_at` | Look for `timestamp` in source |
|
||||
|
||||
**Migration rule**: If we cannot determine `source_archived_at`, use the earliest available timestamp from the source chain.
|
||||
|
||||
---
|
||||
|
||||
## Agent Identification Standards
|
||||
|
||||
### Invalid Agent Identifiers
|
||||
|
||||
```yaml
|
||||
# INVALID - Too vague
|
||||
agent: claude-conversation
|
||||
agent: claude
|
||||
agent: ai
|
||||
agent: llm
|
||||
agent: opencode
|
||||
```
|
||||
|
||||
### Valid Agent Identifiers
|
||||
|
||||
```yaml
|
||||
# Format: {tool}-{model}-{version}
|
||||
agent: opencode-claude-sonnet-4
|
||||
agent: opencode-claude-opus-4
|
||||
agent: batch-script-python-3.11
|
||||
agent: manual-human-curator
|
||||
|
||||
# Or structured format
|
||||
agent:
|
||||
name: opencode-claude-sonnet-4
|
||||
model: claude-sonnet-4-20250514
|
||||
tool: opencode
|
||||
version: "1.0.0"
|
||||
```
|
||||
|
||||
---
|
||||
|
||||
## PROV-O Alignment
|
||||
|
||||
These timestamps align with W3C PROV-O:
|
||||
|
||||
| Our Field | PROV-O Property | Description |
|
||||
|-----------|-----------------|-------------|
|
||||
| `statement_created_at` | `prov:generatedAtTime` | When entity was generated |
|
||||
| `source_archived_at` | `prov:atTime` (on Activity) | When archival activity occurred |
|
||||
| `source_created_at` | `dcterms:created` | Original creation date |
|
||||
| `source_last_modified_at` | `dcterms:modified` | Last modification date |
|
||||
|
||||
```yaml
|
||||
prov:
|
||||
generatedAtTime: "2025-12-30T14:30:00Z" # = statement_created_at
|
||||
wasGeneratedBy:
|
||||
"@type": "prov:Activity"
|
||||
name: "web_extraction"
|
||||
atTime: "2025-12-29T10:15:00Z" # = source_archived_at
|
||||
```
|
||||
|
||||
---
|
||||
|
||||
## Validation Rules
|
||||
|
||||
### Rule 1: Both Mandatory Timestamps Required
|
||||
|
||||
```python
|
||||
def validate_provenance_timestamps(provenance: dict) -> list[str]:
|
||||
errors = []
|
||||
|
||||
# Check for mandatory timestamps
|
||||
if 'statement_created_at' not in provenance:
|
||||
errors.append("Missing mandatory 'statement_created_at' timestamp")
|
||||
if 'source_archived_at' not in provenance:
|
||||
errors.append("Missing mandatory 'source_archived_at' timestamp")
|
||||
|
||||
return errors
|
||||
```
|
||||
|
||||
### Rule 2: Timestamps Must Be Valid ISO 8601
|
||||
|
||||
```python
|
||||
from datetime import datetime
|
||||
|
||||
def validate_timestamp_format(timestamp: str) -> bool:
|
||||
try:
|
||||
datetime.fromisoformat(timestamp.replace('Z', '+00:00'))
|
||||
return True
|
||||
except ValueError:
|
||||
return False
|
||||
```
|
||||
|
||||
### Rule 3: source_archived_at <= statement_created_at
|
||||
|
||||
The source must be archived BEFORE or AT the same time as the statement is created.
|
||||
|
||||
```python
|
||||
def validate_timestamp_order(provenance: dict) -> bool:
|
||||
archived = datetime.fromisoformat(provenance['source_archived_at'])
|
||||
created = datetime.fromisoformat(provenance['statement_created_at'])
|
||||
return archived <= created
|
||||
```
|
||||
|
||||
---
|
||||
|
||||
## Migration Strategy for Existing Files
|
||||
|
||||
### Phase 1: Identify Files Needing Migration
|
||||
|
||||
```bash
|
||||
# Count affected files
|
||||
find data/custodian -name "*.yaml" -exec grep -l "agent: claude-conversation" {} \; | wc -l
|
||||
# Result: 24,328 files
|
||||
```
|
||||
|
||||
### Phase 2: Parse and Transform
|
||||
|
||||
For each file with `agent: claude-conversation`:
|
||||
|
||||
1. Extract existing `timestamp` field
|
||||
2. Set `source_archived_at` = existing `timestamp`
|
||||
3. Set `statement_created_at` = `annotation_date` if present, else use current time
|
||||
4. Replace `agent: claude-conversation` with proper agent identifier
|
||||
5. Add conversation UUID from path
|
||||
|
||||
### Phase 3: Validate and Write
|
||||
|
||||
```python
|
||||
def migrate_provenance(data: dict) -> dict:
|
||||
"""Migrate old claude-conversation provenance to new format."""
|
||||
|
||||
if 'ch_annotator' in data:
|
||||
ch = data['ch_annotator']
|
||||
|
||||
if ch.get('extraction_provenance', {}).get('agent') == 'claude-conversation':
|
||||
old_prov = ch['extraction_provenance']
|
||||
|
||||
# Extract conversation UUID from path
|
||||
path = old_prov.get('path', '')
|
||||
conv_uuid = path.split('/')[-1] if '/conversations/' in path else None
|
||||
|
||||
# Get timestamps
|
||||
source_archived_at = old_prov.get('timestamp')
|
||||
statement_created_at = ch.get('annotation_provenance', {}).get('annotation_date', source_archived_at)
|
||||
|
||||
# Build new provenance
|
||||
ch['extraction_provenance'] = {
|
||||
'namespace': old_prov.get('namespace', 'glam'),
|
||||
'source_type': 'claude_conversation_export',
|
||||
'source_path': old_prov.get('path'),
|
||||
'conversation_uuid': conv_uuid,
|
||||
'statement_created_at': statement_created_at,
|
||||
'source_archived_at': source_archived_at,
|
||||
'agent': 'opencode-claude-sonnet-4', # Default migration value
|
||||
'context_convention': old_prov.get('context_convention'),
|
||||
'migration_note': 'Migrated from agent:claude-conversation on 2025-12-30'
|
||||
}
|
||||
|
||||
return data
|
||||
```
|
||||
|
||||
---
|
||||
|
||||
## Implementation Checklist
|
||||
|
||||
- [ ] Add `statement_created_at` to all new provenance statements
|
||||
- [ ] Add `source_archived_at` to all new provenance statements
|
||||
- [ ] Replace `agent: claude-conversation` with proper agent identifiers
|
||||
- [ ] Add conversation UUIDs where applicable
|
||||
- [ ] Migrate existing 24,328 files with invalid provenance
|
||||
- [ ] Update LinkML schema to require dual timestamps
|
||||
- [ ] Add validation to data pipeline
|
||||
|
||||
---
|
||||
|
||||
## Related Documentation
|
||||
|
||||
- `.opencode/WEB_CLAIM_PROVENANCE_SCHEMA.md` - Web claim provenance structure
|
||||
- `.opencode/YAML_PROVENANCE_SCHEMA.md` - YAML enrichment provenance
|
||||
- `.opencode/WEB_OBSERVATION_PROVENANCE_RULES.md` - XPath provenance requirements
|
||||
- `AGENTS.md` - Rule 35: Provenance Timestamps
|
||||
160
data/custodian.backup.20251230/CZ-10-PRA-H-BPK.yaml
Normal file
160
data/custodian.backup.20251230/CZ-10-PRA-H-BPK.yaml
Normal file
|
|
@ -0,0 +1,160 @@
|
|||
original_entry:
|
||||
name: BODHI PRAHA - Knihovna
|
||||
institution_type: HOLY_SITES
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABE360
|
||||
- &id002
|
||||
identifier_scheme: IČO
|
||||
identifier_value: '49627414'
|
||||
locations:
|
||||
- street_address: Slezská 140
|
||||
postal_code: 130 00
|
||||
city: Praha 3
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
latitude: 50.076911
|
||||
longitude: 14.464614
|
||||
processing_timestamp: '2025-12-06T23:37:20.849119+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-H-BPK
|
||||
ghcid_original: CZ-HL-PRA-H-BPK
|
||||
ghcid_uuid: 7c1fd1a4-ee17-527e-9349-55fd16c01486
|
||||
ghcid_uuid_sha256: 6367a388-cb73-8eee-53b7-8ea64c233ab4
|
||||
ghcid_numeric: 7162873540280291054
|
||||
record_id: 2a67ea10-8a77-44d3-bcaf-ae62cde0aa16
|
||||
generation_timestamp: '2025-12-06T23:37:20.849119+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-H-BPK
|
||||
valid_from: "2025-12-10T09:46:57Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-H-BPK
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:57Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-H-BPK
|
||||
ghcid_numeric: 7162873540280291054
|
||||
valid_from: '2025-12-06T23:37:20.849119+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: BODHI PRAHA - Knihovna
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-H-BPK
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 7c1fd1a4-ee17-527e-9349-55fd16c01486
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 6367a388-cb73-8eee-53b7-8ea64c233ab4
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '7162873540280291054'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 2a67ea10-8a77-44d3-bcaf-ae62cde0aa16
|
||||
- *id001
|
||||
- *id002
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.991619+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:46Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.HOL
|
||||
subtype_label: HOLY_SITES
|
||||
ontology_class: schema:PlaceOfWorship
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.991619+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date: null
|
||||
verified_by: null
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: BODHI PRAHA - Knihovna
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.991619+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: HOLY_SITES
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.991619+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 3
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.991619+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABE360
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.991619+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:20.849119+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
latitude: 50.076911
|
||||
longitude: 14.464614
|
||||
coordinate_provenance:
|
||||
source_type: ORIGINAL_ENTRY
|
||||
source_path: original_entry.locations[0]
|
||||
city: Praha 3
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 130 00
|
||||
street_address: Slezská 140
|
||||
normalization_timestamp: '2025-12-09T06:50:46.626432+00:00'
|
||||
292
data/custodian.backup.20251230/CZ-10-PRA-H-CCHUURUAM.yaml
Normal file
292
data/custodian.backup.20251230/CZ-10-PRA-H-CCHUURUAM.yaml
Normal file
|
|
@ -0,0 +1,292 @@
|
|||
original_entry:
|
||||
name: Církev československá husitská - Úřad Ústřední rady - Ústřední archiv a muzeum
|
||||
institution_type: HOLY_SITES
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABE460
|
||||
- &id002
|
||||
identifier_scheme: IČO
|
||||
identifier_value: 00445185
|
||||
- &id003
|
||||
identifier_scheme: Website
|
||||
identifier_value: https://www.ccsh.cz
|
||||
identifier_url: https://www.ccsh.cz
|
||||
- &id004
|
||||
identifier_scheme: Wikidata
|
||||
identifier_value: Q114619244
|
||||
identifier_url: https://www.wikidata.org/wiki/Q114619244
|
||||
locations:
|
||||
- street_address: Wuchterlova 5
|
||||
postal_code: 166 26
|
||||
city: Praha 6
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
latitude: 50.097831
|
||||
longitude: 14.397203
|
||||
processing_timestamp: '2025-12-06T23:37:24.561843+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-H-CCHUURUAM
|
||||
ghcid_original: CZ-HL-PRA-H-CČHÚÚRÚAM
|
||||
ghcid_uuid: 7d10ac5c-5235-5ba1-ba30-2edfef4c0af9
|
||||
ghcid_uuid_sha256: 6ff0d000-3401-8df7-a4c7-6eefa0376a81
|
||||
ghcid_numeric: 8066175631911677431
|
||||
record_id: c29783fc-ee79-410b-8d60-1acb510e3c72
|
||||
generation_timestamp: '2025-12-06T23:37:24.561843+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-H-CCHUURUAM
|
||||
valid_from: '2025-12-10T09:46:57Z'
|
||||
valid_to:
|
||||
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
|
||||
- ghcid: CZ-HL-PRA-H-CCHUURUAM
|
||||
valid_from:
|
||||
valid_to: '2025-12-10T09:46:57Z'
|
||||
reason: Previous GHCID with incorrect region code
|
||||
- ghcid: CZ-HL-PRA-H-CCHUURUAM
|
||||
ghcid_numeric: 8066175631911677431
|
||||
valid_from: '2025-12-08T11:21:41.193514+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-H-CČHÚÚRÚAM)'
|
||||
- ghcid: CZ-HL-PRA-H-CČHÚÚRÚAM
|
||||
ghcid_numeric: 4847761274494306046
|
||||
valid_from: '2025-12-06T23:37:24.561843+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:41.193514+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Církev československá husitská - Úřad Ústřední rady - Ústřední archiv a muzeum
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-H-CCHUURUAM
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 7d10ac5c-5235-5ba1-ba30-2edfef4c0af9
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 6ff0d000-3401-8df7-a4c7-6eefa0376a81
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '8066175631911677431'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: c29783fc-ee79-410b-8d60-1acb510e3c72
|
||||
- *id001
|
||||
- *id002
|
||||
- *id003
|
||||
- *id004
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:30.039417+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:46Z
|
||||
schema_version: 2.0.0
|
||||
enrichment_provenance:
|
||||
wikidata_enrichment:
|
||||
content_hash: sha256-meF0a8cLF7Bs9iN06RJgME6ags8hQktQ+hi14hroIfU=
|
||||
verified_at: '2025-12-28T19:58:22.443284+00:00'
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114619244
|
||||
provenance_schema_version: '2.0'
|
||||
standards_compliance:
|
||||
- W3C PROV-O
|
||||
- W3C SRI (content hashes)
|
||||
generated_at: '2025-12-28T19:58:22.443305+00:00'
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.HOL
|
||||
subtype_label: HOLY_SITES
|
||||
ontology_class: schema:PlaceOfWorship
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.039417+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Církev československá husitská - Úřad Ústřední rady - Ústřední archiv a muzeum
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.039417+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: HOLY_SITES
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.039417+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 6
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.039417+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: wikidata_id
|
||||
claim_value: Q114619244
|
||||
property_uri: owl:sameAs
|
||||
provenance:
|
||||
namespace: wikidata
|
||||
path: /entity/Q114619244
|
||||
timestamp: '2025-11-19T10:14:30.039417+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.98
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABE460
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.039417+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:24.561843+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
wikidata_enrichment:
|
||||
wikidata_entity_id: Q114619244
|
||||
api_metadata:
|
||||
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
fetch_timestamp: '2025-12-18T22:25:32.394804+00:00'
|
||||
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
|
||||
enrichment_version: 2.1_generic
|
||||
properties_found:
|
||||
- P31
|
||||
- P17
|
||||
- P159
|
||||
- P968
|
||||
- P749
|
||||
- P361
|
||||
wikidata_labels:
|
||||
cs: Církev československá husitská, Úřad Ústřední rady, Ústřední archiv a muzeum (knihovna)
|
||||
en: Czechoslovak Hussite Church, Central Council, Central Archiv and Museum
|
||||
wikidata_label_en: Czechoslovak Hussite Church, Central Council, Central Archiv and Museum
|
||||
wikidata_label_cs: Církev československá husitská, Úřad Ústřední rady, Ústřední archiv a muzeum (knihovna)
|
||||
wikidata_descriptions:
|
||||
cs: knihovna v Praze
|
||||
wikidata_aliases:
|
||||
cs:
|
||||
- ÚAM ÚÚR CČSH
|
||||
wikidata_classification:
|
||||
instance_of: &id005
|
||||
- id: Q13866185
|
||||
label: museum library
|
||||
description: library that is part of a museum
|
||||
wikidata_instance_of: *id005
|
||||
wikidata_location:
|
||||
country: &id006
|
||||
id: Q213
|
||||
label: Czech Republic
|
||||
description: country in Central Europe
|
||||
headquarters_location:
|
||||
id: Q2667290
|
||||
label: Prague 6
|
||||
description: municipal part of Prague
|
||||
wikidata_country: *id006
|
||||
wikidata_organization:
|
||||
parent_organization: &id007
|
||||
id: Q115654444
|
||||
label: Ústřední archiv a muzeum Církve československé husitské
|
||||
description: museum in Czechia
|
||||
part_of: *id007
|
||||
wikidata_contact:
|
||||
email: mailto:archiv@ccsh.cz
|
||||
_provenance:
|
||||
content_hash:
|
||||
algorithm: sha256
|
||||
value: sha256-meF0a8cLF7Bs9iN06RJgME6ags8hQktQ+hi14hroIfU=
|
||||
scope: enrichment_section
|
||||
computed_at: '2025-12-28T19:58:22.443284+00:00'
|
||||
prov:
|
||||
wasGeneratedBy:
|
||||
'@type': prov:Activity
|
||||
name: wikidata_api_fetch
|
||||
used: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114619244
|
||||
verification:
|
||||
status: verified
|
||||
last_verified: '2025-12-28T19:58:22.443294+00:00'
|
||||
location:
|
||||
latitude: 50.097831
|
||||
longitude: 14.397203
|
||||
coordinate_provenance:
|
||||
source_type: ORIGINAL_ENTRY
|
||||
source_path: original_entry.locations[0]
|
||||
city: Praha 6
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 166 26
|
||||
street_address: Wuchterlova 5
|
||||
normalization_timestamp: '2025-12-09T06:50:46.651423+00:00'
|
||||
logo_enrichment:
|
||||
enrichment_timestamp: '2025-12-23T21:55:27.491797+00:00'
|
||||
source_url: https://www.ccsh.cz
|
||||
extraction_method: crawl4ai
|
||||
claims:
|
||||
- claim_type: logo_url
|
||||
claim_value: https://www.ccsh.cz/images/logoG.png
|
||||
source_url: https://www.ccsh.cz
|
||||
css_selector: '#t3-header > div.row > div.col-xs-12.col-sm-6 > div.logo-image > a > img.logo-img'
|
||||
retrieved_on: '2025-12-23T21:55:27.491797+00:00'
|
||||
extraction_method: crawl4ai_header_logo
|
||||
detection_confidence: high
|
||||
alt_text: CČSH
|
||||
- claim_type: favicon_url
|
||||
claim_value: https://www.ccsh.cz/safari-pinned-tab.svg
|
||||
source_url: https://www.ccsh.cz
|
||||
css_selector: '[document] > html.com_content.view-featured > head > link:nth-of-type(39)'
|
||||
retrieved_on: '2025-12-23T21:55:27.491797+00:00'
|
||||
extraction_method: crawl4ai_link_rel
|
||||
favicon_type: ''
|
||||
favicon_sizes: ''
|
||||
- claim_type: og_image_url
|
||||
claim_value: https://nwork.vbox.cz/images/joomlart/cirkev/kdo_jsme.jpg
|
||||
source_url: https://www.ccsh.cz
|
||||
css_selector: '[document] > html.com_content.view-featured > head > meta:nth-of-type(4)'
|
||||
retrieved_on: '2025-12-23T21:55:27.491797+00:00'
|
||||
extraction_method: crawl4ai_meta_og
|
||||
summary:
|
||||
total_claims: 3
|
||||
has_primary_logo: true
|
||||
has_favicon: true
|
||||
has_og_image: true
|
||||
favicon_count: 5
|
||||
165
data/custodian.backup.20251230/CZ-10-PRA-H-CKACKK.yaml
Normal file
165
data/custodian.backup.20251230/CZ-10-PRA-H-CKACKK.yaml
Normal file
|
|
@ -0,0 +1,165 @@
|
|||
original_entry:
|
||||
name: Česká křesťanská akademie - Centrální katolická knihovna
|
||||
institution_type: HOLY_SITES
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABE454
|
||||
locations:
|
||||
- street_address: Thákurova 3
|
||||
postal_code: 160 00
|
||||
city: Praha 6
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:25.754984+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-H-CKACKK
|
||||
ghcid_original: CZ-HL-PRA-H-ČKACKK
|
||||
ghcid_uuid: 4ef4bd45-9642-5128-b3e9-d4d9edc50960
|
||||
ghcid_uuid_sha256: 834f87f4-a28d-8c8e-8612-25a472753f8d
|
||||
ghcid_numeric: 9461930826897628302
|
||||
record_id: 70f6726f-eb75-4ded-baf4-cf10eca63649
|
||||
generation_timestamp: '2025-12-06T23:37:25.754984+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-H-CKACKK
|
||||
valid_from: "2025-12-10T09:46:57Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-H-CKACKK
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:57Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-H-CKACKK
|
||||
ghcid_numeric: 9461930826897628302
|
||||
valid_from: '2025-12-08T11:21:38.069976+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-H-ČKACKK)'
|
||||
- ghcid: CZ-HL-PRA-H-ČKACKK
|
||||
ghcid_numeric: 2876805312996636643
|
||||
valid_from: '2025-12-06T23:37:25.754984+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:38.069976+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Česká křesťanská akademie - Centrální katolická knihovna
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-H-CKACKK
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 4ef4bd45-9642-5128-b3e9-d4d9edc50960
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 834f87f4-a28d-8c8e-8612-25a472753f8d
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '9461930826897628302'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 70f6726f-eb75-4ded-baf4-cf10eca63649
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:30.047751+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:46Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.HOL
|
||||
subtype_label: HOLY_SITES
|
||||
ontology_class: schema:PlaceOfWorship
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.047751+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Česká křesťanská akademie - Centrální katolická knihovna
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.047751+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: HOLY_SITES
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.047751+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 6
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.047751+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABE454
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.047751+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:25.754984+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 160 00
|
||||
street_address: Thákurova 3
|
||||
normalization_timestamp: '2025-12-09T20:41:03.900039+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:03.900021+00:00'
|
||||
|
|
@ -0,0 +1,263 @@
|
|||
original_entry:
|
||||
name: Česká Provincie řádu sv. Augustina - Farnost sv. Tomáše v Praze na Malé Straně - Augustiniánská knihovna
|
||||
institution_type: HOLY_SITES
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABE462
|
||||
- &id002
|
||||
identifier_scheme: Website
|
||||
identifier_value: https://augustiniani.cz/
|
||||
identifier_url: https://augustiniani.cz/
|
||||
- &id003
|
||||
identifier_scheme: Wikidata
|
||||
identifier_value: Q114619631
|
||||
identifier_url: https://www.wikidata.org/wiki/Q114619631
|
||||
locations:
|
||||
- street_address: Josefská 28/8
|
||||
postal_code: 118 01
|
||||
city: Praha 1
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
latitude: 50.088826
|
||||
longitude: 14.405173
|
||||
processing_timestamp: '2025-12-06T23:37:27.387413+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-H-CPRSAFSTVP-ceska_provincie_radu_sv_augustina_farnost_sv_tomas
|
||||
ghcid_original: CZ-HL-PRA-H-ČPŘSAFSTVP-ceska_provincie_radu_sv_augustina_farnost_sv_tomas
|
||||
ghcid_uuid: 6450dbfb-6387-5dff-93ab-f3e0ff979ee7
|
||||
ghcid_uuid_sha256: d52be405-b36d-8bb1-b10d-b15f1cee5323
|
||||
ghcid_numeric: 15360621667213487025
|
||||
record_id: 2955d359-aa74-4f10-b9cb-a9971074c851
|
||||
generation_timestamp: '2025-12-06T23:37:27.387413+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-H-CPRSAFSTVP-ceska_provincie_radu_sv_augustina_farnost_sv_tomas
|
||||
valid_from: '2025-12-10T09:46:57Z'
|
||||
valid_to:
|
||||
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
|
||||
- ghcid: CZ-HL-PRA-H-CPRSAFSTVP-ceska_provincie_radu_sv_augustina_farnost_sv_tomas
|
||||
valid_from:
|
||||
valid_to: '2025-12-10T09:46:57Z'
|
||||
reason: Previous GHCID with incorrect region code
|
||||
- ghcid: CZ-HL-PRA-H-CPRSAFSTVP-ceska_provincie_radu_sv_augustina_farnost_sv_tomas
|
||||
ghcid_numeric: 15360621667213487025
|
||||
valid_from: '2025-12-08T11:21:29.229081+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-H-ČPŘSAFSTVP-ceska_provincie_radu_sv_augustina_farnost_sv_tomas)'
|
||||
- ghcid: CZ-HL-PRA-H-ČPŘSAFSTVP-ceska_provincie_radu_sv_augustina_farnost_sv_tomas
|
||||
ghcid_numeric: 10650113264983091993
|
||||
valid_from: '2025-12-06T23:37:27.387413+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:29.229081+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Česká Provincie řádu sv. Augustina - Farnost sv. Tomáše v Praze na Malé Straně - Augustiniánská knihovna
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-H-CPRSAFSTVP-ceska_provincie_radu_sv_augustina_farnost_sv_tomas
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 6450dbfb-6387-5dff-93ab-f3e0ff979ee7
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: d52be405-b36d-8bb1-b10d-b15f1cee5323
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '15360621667213487025'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 2955d359-aa74-4f10-b9cb-a9971074c851
|
||||
- *id001
|
||||
- *id002
|
||||
- *id003
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:30.062682+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:46Z
|
||||
schema_version: 2.0.0
|
||||
enrichment_provenance:
|
||||
wikidata_enrichment:
|
||||
content_hash: sha256-wFZjrBWmASC3ygUuMmJ9WvXmSxWeu0b3P2suN3Jaa48=
|
||||
verified_at: '2025-12-28T19:58:22.476372+00:00'
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114619631
|
||||
provenance_schema_version: '2.0'
|
||||
standards_compliance:
|
||||
- W3C PROV-O
|
||||
- W3C SRI (content hashes)
|
||||
generated_at: '2025-12-28T19:58:22.476396+00:00'
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.HOL
|
||||
subtype_label: HOLY_SITES
|
||||
ontology_class: schema:PlaceOfWorship
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.062682+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Česká Provincie řádu sv. Augustina - Farnost sv. Tomáše v Praze na Malé Straně - Augustiniánská knihovna
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.062682+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: HOLY_SITES
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.062682+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 1
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.062682+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: wikidata_id
|
||||
claim_value: Q114619631
|
||||
property_uri: owl:sameAs
|
||||
provenance:
|
||||
namespace: wikidata
|
||||
path: /entity/Q114619631
|
||||
timestamp: '2025-11-19T10:14:30.062682+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.98
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABE462
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.062682+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:27.387413+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
wikidata_enrichment:
|
||||
wikidata_entity_id: Q114619631
|
||||
api_metadata:
|
||||
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
fetch_timestamp: '2025-12-18T22:25:40.452127+00:00'
|
||||
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
|
||||
enrichment_version: 2.1_generic
|
||||
properties_found:
|
||||
- P31
|
||||
- P17
|
||||
- P159
|
||||
- P968
|
||||
wikidata_labels:
|
||||
cs: Česká Provincie řádu sv. Augustina, Farnost sv. Tomáše v Praze na Malé Straně, Augustiniánská knihovna
|
||||
en: The Order of St. Augustine, St. Thomas Church in Prague, Library
|
||||
wikidata_label_en: The Order of St. Augustine, St. Thomas Church in Prague, Library
|
||||
wikidata_label_cs: Česká Provincie řádu sv. Augustina, Farnost sv. Tomáše v Praze na Malé Straně, Augustiniánská knihovna
|
||||
wikidata_descriptions:
|
||||
cs: knihovna v Praze
|
||||
wikidata_classification:
|
||||
instance_of: &id004
|
||||
- id: Q50379845
|
||||
label: religious library
|
||||
description: type of library with collections that focus on religion and related subjects
|
||||
wikidata_instance_of: *id004
|
||||
wikidata_location:
|
||||
country: &id005
|
||||
id: Q213
|
||||
label: Czech Republic
|
||||
description: country in Central Europe
|
||||
headquarters_location:
|
||||
id: Q973974
|
||||
label: Prague 1
|
||||
description: administrative district, municipal district and municipal part of Prague
|
||||
wikidata_country: *id005
|
||||
wikidata_contact:
|
||||
email: mailto:osaprag@augustiniani.cz
|
||||
_provenance:
|
||||
content_hash:
|
||||
algorithm: sha256
|
||||
value: sha256-wFZjrBWmASC3ygUuMmJ9WvXmSxWeu0b3P2suN3Jaa48=
|
||||
scope: enrichment_section
|
||||
computed_at: '2025-12-28T19:58:22.476372+00:00'
|
||||
prov:
|
||||
wasGeneratedBy:
|
||||
'@type': prov:Activity
|
||||
name: wikidata_api_fetch
|
||||
used: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114619631
|
||||
verification:
|
||||
status: verified
|
||||
last_verified: '2025-12-28T19:58:22.476385+00:00'
|
||||
location:
|
||||
latitude: 50.088826
|
||||
longitude: 14.405173
|
||||
coordinate_provenance:
|
||||
source_type: ORIGINAL_ENTRY
|
||||
source_path: original_entry.locations[0]
|
||||
city: Praha 1
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 118 01
|
||||
street_address: Josefská 28/8
|
||||
normalization_timestamp: '2025-12-09T06:50:46.697835+00:00'
|
||||
logo_enrichment:
|
||||
enrichment_timestamp: '2025-12-23T21:55:34.949111+00:00'
|
||||
source_url: https://augustiniani.cz
|
||||
extraction_method: crawl4ai
|
||||
claims:
|
||||
- claim_type: favicon_url
|
||||
claim_value: https://augustiniani.cz/wp-content/uploads/2021/03/cropped-znak-osa-180x180.png
|
||||
source_url: https://augustiniani.cz
|
||||
css_selector: '[document] > html > head > link:nth-of-type(62)'
|
||||
retrieved_on: '2025-12-23T21:55:34.949111+00:00'
|
||||
extraction_method: crawl4ai_link_rel
|
||||
favicon_type: ''
|
||||
favicon_sizes: ''
|
||||
summary:
|
||||
total_claims: 1
|
||||
has_primary_logo: false
|
||||
has_favicon: true
|
||||
has_og_image: false
|
||||
favicon_count: 3
|
||||
270
data/custodian.backup.20251230/CZ-10-PRA-H-CPRSAFSTVP.yaml
Normal file
270
data/custodian.backup.20251230/CZ-10-PRA-H-CPRSAFSTVP.yaml
Normal file
|
|
@ -0,0 +1,270 @@
|
|||
original_entry:
|
||||
name: Česká provincie řádu svatého Augustina - Farnost sv. Tomáše v Praze na Malé Straně - Knihovna
|
||||
institution_type: HOLY_SITES
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABE458
|
||||
- &id002
|
||||
identifier_scheme: IČO
|
||||
identifier_value: 00569631
|
||||
- &id003
|
||||
identifier_scheme: Website
|
||||
identifier_value: https://augustiniani.cz/
|
||||
identifier_url: https://augustiniani.cz/
|
||||
- &id004
|
||||
identifier_scheme: Wikidata
|
||||
identifier_value: Q114618792
|
||||
identifier_url: https://www.wikidata.org/wiki/Q114618792
|
||||
locations:
|
||||
- street_address: Josefská 28/8
|
||||
postal_code: 118 01
|
||||
city: Praha 1
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
latitude: 50.088825
|
||||
longitude: 14.405172
|
||||
processing_timestamp: '2025-12-06T23:37:21.578012+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-H-CPRSAFSTVP
|
||||
ghcid_original: CZ-HL-PRA-H-ČPŘSAFSTVP
|
||||
ghcid_uuid: 5f25d43b-863b-54b1-9d9b-cbfdfc103b98
|
||||
ghcid_uuid_sha256: 0aecf24a-999e-8c9b-a6c0-02886452c5c0
|
||||
ghcid_numeric: 787270437101800603
|
||||
record_id: c6886339-40c0-4790-967c-f6eaab3f4b72
|
||||
generation_timestamp: '2025-12-06T23:37:21.578012+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-H-CPRSAFSTVP
|
||||
valid_from: '2025-12-10T09:46:57Z'
|
||||
valid_to:
|
||||
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
|
||||
- ghcid: CZ-HL-PRA-H-CPRSAFSTVP
|
||||
valid_from:
|
||||
valid_to: '2025-12-10T09:46:57Z'
|
||||
reason: Previous GHCID with incorrect region code
|
||||
- ghcid: CZ-HL-PRA-H-CPRSAFSTVP
|
||||
ghcid_numeric: 787270437101800603
|
||||
valid_from: '2025-12-08T11:21:35.155241+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-H-ČPŘSAFSTVP)'
|
||||
- ghcid: CZ-HL-PRA-H-ČPŘSAFSTVP
|
||||
ghcid_numeric: 1916463988346664738
|
||||
valid_from: '2025-12-06T23:37:21.578012+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:35.155241+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Česká provincie řádu svatého Augustina - Farnost sv. Tomáše v Praze na Malé Straně - Knihovna
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-H-CPRSAFSTVP
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 5f25d43b-863b-54b1-9d9b-cbfdfc103b98
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 0aecf24a-999e-8c9b-a6c0-02886452c5c0
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '787270437101800603'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: c6886339-40c0-4790-967c-f6eaab3f4b72
|
||||
- *id001
|
||||
- *id002
|
||||
- *id003
|
||||
- *id004
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:30.009282+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:46Z
|
||||
schema_version: 2.0.0
|
||||
enrichment_provenance:
|
||||
wikidata_enrichment:
|
||||
content_hash: sha256-SljGXkAVhGLTN6EVn1SWAncLhXjJmVaAHZxw289RWxI=
|
||||
verified_at: '2025-12-28T19:58:22.498279+00:00'
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114618792
|
||||
provenance_schema_version: '2.0'
|
||||
standards_compliance:
|
||||
- W3C PROV-O
|
||||
- W3C SRI (content hashes)
|
||||
generated_at: '2025-12-28T19:58:22.498298+00:00'
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.HOL
|
||||
subtype_label: HOLY_SITES
|
||||
ontology_class: schema:PlaceOfWorship
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.009282+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Česká provincie řádu svatého Augustina - Farnost sv. Tomáše v Praze na Malé Straně - Knihovna
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.009282+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: HOLY_SITES
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.009282+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 1
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.009282+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: wikidata_id
|
||||
claim_value: Q114618792
|
||||
property_uri: owl:sameAs
|
||||
provenance:
|
||||
namespace: wikidata
|
||||
path: /entity/Q114618792
|
||||
timestamp: '2025-11-19T10:14:30.009282+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.98
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABE458
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.009282+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:21.578012+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
wikidata_enrichment:
|
||||
wikidata_entity_id: Q114618792
|
||||
api_metadata:
|
||||
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
fetch_timestamp: '2025-12-18T22:25:48.379615+00:00'
|
||||
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
|
||||
enrichment_version: 2.1_generic
|
||||
properties_found:
|
||||
- P31
|
||||
- P17
|
||||
- P159
|
||||
- P968
|
||||
wikidata_labels:
|
||||
cs: Česká provincie řádu svatého Augustina, Farnost sv. Tomáše v Praze na Malé Straně, knihovna
|
||||
en: Bohemica Augustinianorum Pragae
|
||||
wikidata_label_en: Bohemica Augustinianorum Pragae
|
||||
wikidata_label_cs: Česká provincie řádu svatého Augustina, Farnost sv. Tomáše v Praze na Malé Straně, knihovna
|
||||
wikidata_descriptions:
|
||||
cs: knihovna v Praze
|
||||
wikidata_aliases:
|
||||
cs:
|
||||
- BAUGP
|
||||
wikidata_classification:
|
||||
instance_of: &id005
|
||||
- id: Q50379845
|
||||
label: religious library
|
||||
description: type of library with collections that focus on religion and related subjects
|
||||
wikidata_instance_of: *id005
|
||||
wikidata_location:
|
||||
country: &id006
|
||||
id: Q213
|
||||
label: Czech Republic
|
||||
description: country in Central Europe
|
||||
headquarters_location:
|
||||
id: Q973974
|
||||
label: Prague 1
|
||||
description: administrative district, municipal district and municipal part of Prague
|
||||
wikidata_country: *id006
|
||||
wikidata_contact:
|
||||
email: mailto:osaprag@augustiniani.cz
|
||||
_provenance:
|
||||
content_hash:
|
||||
algorithm: sha256
|
||||
value: sha256-SljGXkAVhGLTN6EVn1SWAncLhXjJmVaAHZxw289RWxI=
|
||||
scope: enrichment_section
|
||||
computed_at: '2025-12-28T19:58:22.498279+00:00'
|
||||
prov:
|
||||
wasGeneratedBy:
|
||||
'@type': prov:Activity
|
||||
name: wikidata_api_fetch
|
||||
used: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114618792
|
||||
verification:
|
||||
status: verified
|
||||
last_verified: '2025-12-28T19:58:22.498289+00:00'
|
||||
location:
|
||||
latitude: 50.088825
|
||||
longitude: 14.405172
|
||||
coordinate_provenance:
|
||||
source_type: ORIGINAL_ENTRY
|
||||
source_path: original_entry.locations[0]
|
||||
city: Praha 1
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 118 01
|
||||
street_address: Josefská 28/8
|
||||
normalization_timestamp: '2025-12-09T06:50:46.723119+00:00'
|
||||
logo_enrichment:
|
||||
enrichment_timestamp: '2025-12-23T21:55:40.149799+00:00'
|
||||
source_url: https://augustiniani.cz
|
||||
extraction_method: crawl4ai
|
||||
claims:
|
||||
- claim_type: favicon_url
|
||||
claim_value: https://augustiniani.cz/wp-content/uploads/2021/03/cropped-znak-osa-180x180.png
|
||||
source_url: https://augustiniani.cz
|
||||
css_selector: '[document] > html > head > link:nth-of-type(62)'
|
||||
retrieved_on: '2025-12-23T21:55:40.149799+00:00'
|
||||
extraction_method: crawl4ai_link_rel
|
||||
favicon_type: ''
|
||||
favicon_sizes: ''
|
||||
summary:
|
||||
total_claims: 1
|
||||
has_primary_logo: false
|
||||
has_favicon: true
|
||||
has_og_image: false
|
||||
favicon_count: 3
|
||||
251
data/custodian.backup.20251230/CZ-10-PRA-H-ETSVOSTSK.yaml
Normal file
251
data/custodian.backup.20251230/CZ-10-PRA-H-ETSVOSTSK.yaml
Normal file
|
|
@ -0,0 +1,251 @@
|
|||
original_entry:
|
||||
name: Evangelikální teologický seminář - Vyšší odborná škola teologická a sociální - knihovna
|
||||
institution_type: HOLY_SITES
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABG450
|
||||
- &id002
|
||||
identifier_scheme: IČO
|
||||
identifier_value: '41690184'
|
||||
- &id003
|
||||
identifier_scheme: Website
|
||||
identifier_value: https://ets.jabok.cuni.cz/eg/opac/home
|
||||
identifier_url: https://ets.jabok.cuni.cz/eg/opac/home
|
||||
- &id004
|
||||
identifier_scheme: Wikidata
|
||||
identifier_value: Q114619199
|
||||
identifier_url: https://www.wikidata.org/wiki/Q114619199
|
||||
locations:
|
||||
- street_address: Stoliňská 2417/41a
|
||||
postal_code: 193 00
|
||||
city: Praha 9
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
latitude: 50.107464
|
||||
longitude: 14.592644
|
||||
processing_timestamp: '2025-12-06T23:37:24.239954+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-H-ETSVOSTSK
|
||||
ghcid_original: CZ-HL-PRA-H-ETSVOŠTSK
|
||||
ghcid_uuid: 3725cb10-ac06-5e92-8bae-37d6b7f6808d
|
||||
ghcid_uuid_sha256: 43204928-ab70-888c-a318-a62b466affca
|
||||
ghcid_numeric: 4836946438819739788
|
||||
record_id: 941be124-274b-4f6c-bdc2-26136c7ab711
|
||||
generation_timestamp: '2025-12-06T23:37:24.239954+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-H-ETSVOSTSK
|
||||
valid_from: '2025-12-10T09:46:57Z'
|
||||
valid_to:
|
||||
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
|
||||
- ghcid: CZ-HL-PRA-H-ETSVOSTSK
|
||||
valid_from:
|
||||
valid_to: '2025-12-10T09:46:57Z'
|
||||
reason: Previous GHCID with incorrect region code
|
||||
- ghcid: CZ-HL-PRA-H-ETSVOSTSK
|
||||
ghcid_numeric: 4836946438819739788
|
||||
valid_from: '2025-12-08T11:21:31.045391+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-H-ETSVOŠTSK)'
|
||||
- ghcid: CZ-HL-PRA-H-ETSVOŠTSK
|
||||
ghcid_numeric: 12474982366854812758
|
||||
valid_from: '2025-12-06T23:37:24.239954+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:31.045391+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Evangelikální teologický seminář - Vyšší odborná škola teologická a sociální - knihovna
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-H-ETSVOSTSK
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 3725cb10-ac06-5e92-8bae-37d6b7f6808d
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 43204928-ab70-888c-a318-a62b466affca
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '4836946438819739788'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 941be124-274b-4f6c-bdc2-26136c7ab711
|
||||
- *id001
|
||||
- *id002
|
||||
- *id003
|
||||
- *id004
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:30.035614+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:46Z
|
||||
schema_version: 2.0.0
|
||||
enrichment_provenance:
|
||||
wikidata_enrichment:
|
||||
content_hash: sha256-9q2YCHCyTeS8SeizRm2nZXth340jdonIZ71izeua7RU=
|
||||
verified_at: '2025-12-28T19:58:22.518111+00:00'
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114619199
|
||||
provenance_schema_version: '2.0'
|
||||
standards_compliance:
|
||||
- W3C PROV-O
|
||||
- W3C SRI (content hashes)
|
||||
generated_at: '2025-12-28T19:58:22.518129+00:00'
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.HOL
|
||||
subtype_label: HOLY_SITES
|
||||
ontology_class: schema:PlaceOfWorship
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.035614+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Evangelikální teologický seminář - Vyšší odborná škola teologická a sociální - knihovna
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.035614+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: HOLY_SITES
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.035614+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 9
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.035614+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: wikidata_id
|
||||
claim_value: Q114619199
|
||||
property_uri: owl:sameAs
|
||||
provenance:
|
||||
namespace: wikidata
|
||||
path: /entity/Q114619199
|
||||
timestamp: '2025-11-19T10:14:30.035614+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.98
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABG450
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.035614+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:24.239954+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
wikidata_enrichment:
|
||||
wikidata_entity_id: Q114619199
|
||||
api_metadata:
|
||||
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
fetch_timestamp: '2025-12-18T22:25:56.334339+00:00'
|
||||
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
|
||||
enrichment_version: 2.1_generic
|
||||
properties_found:
|
||||
- P31
|
||||
- P17
|
||||
- P159
|
||||
- P968
|
||||
wikidata_labels:
|
||||
cs: Evangelikální teologický seminář, Vyšší odborná škola teologická a sociální, knihovna
|
||||
en: Evangelical Teological Seminary, Higher Theological Vocational School
|
||||
wikidata_label_en: Evangelical Teological Seminary, Higher Theological Vocational School
|
||||
wikidata_label_cs: Evangelikální teologický seminář, Vyšší odborná škola teologická a sociální, knihovna
|
||||
wikidata_descriptions:
|
||||
cs: knihovna v Praze
|
||||
wikidata_aliases:
|
||||
cs:
|
||||
- ETS-VOŠTS
|
||||
wikidata_classification:
|
||||
instance_of: &id005
|
||||
- id: Q50379845
|
||||
label: religious library
|
||||
description: type of library with collections that focus on religion and related subjects
|
||||
wikidata_instance_of: *id005
|
||||
wikidata_location:
|
||||
country: &id006
|
||||
id: Q213
|
||||
label: Czech Republic
|
||||
description: country in Central Europe
|
||||
headquarters_location:
|
||||
id: Q2750534
|
||||
label: Prague 9
|
||||
description: administrative district and municipal part of Prague
|
||||
wikidata_country: *id006
|
||||
wikidata_contact:
|
||||
email: mailto:info@etspraha.cz
|
||||
_provenance:
|
||||
content_hash:
|
||||
algorithm: sha256
|
||||
value: sha256-9q2YCHCyTeS8SeizRm2nZXth340jdonIZ71izeua7RU=
|
||||
scope: enrichment_section
|
||||
computed_at: '2025-12-28T19:58:22.518111+00:00'
|
||||
prov:
|
||||
wasGeneratedBy:
|
||||
'@type': prov:Activity
|
||||
name: wikidata_api_fetch
|
||||
used: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114619199
|
||||
verification:
|
||||
status: verified
|
||||
last_verified: '2025-12-28T19:58:22.518120+00:00'
|
||||
location:
|
||||
latitude: 50.107464
|
||||
longitude: 14.592644
|
||||
coordinate_provenance:
|
||||
source_type: ORIGINAL_ENTRY
|
||||
source_path: original_entry.locations[0]
|
||||
city: Praha 9
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 193 00
|
||||
street_address: Stoliňská 2417/41a
|
||||
normalization_timestamp: '2025-12-09T06:50:46.750937+00:00'
|
||||
185
data/custodian.backup.20251230/CZ-10-PRA-H-HITS.yaml
Normal file
185
data/custodian.backup.20251230/CZ-10-PRA-H-HITS.yaml
Normal file
|
|
@ -0,0 +1,185 @@
|
|||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Husův institut teologických studií
|
||||
source_type: ch_annotator
|
||||
original_entry:
|
||||
name: Husův institut teologických studií
|
||||
institution_type: HOLY_SITES
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABE467
|
||||
- &id002
|
||||
identifier_scheme: IČO
|
||||
identifier_value: '73632571'
|
||||
- &id003
|
||||
identifier_scheme: Website
|
||||
identifier_value: https://www.hitspraha.cz/index.php/hits-top.html
|
||||
identifier_url: https://www.hitspraha.cz/index.php/hits-top.html
|
||||
locations:
|
||||
- street_address: Roháčova 1305/66
|
||||
postal_code: 130 00
|
||||
city: Praha 3
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
latitude: 50.088143
|
||||
longitude: 14.460887
|
||||
processing_timestamp: '2025-12-06T23:37:43.603692+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-H-HITS
|
||||
ghcid_original: CZ-HL-PRA-H-HITS
|
||||
ghcid_uuid: 46f9b78d-b0a3-51d4-8a9f-13f51703a2c8
|
||||
ghcid_uuid_sha256: 65f6b716-9bbc-852c-071f-e596068235c3
|
||||
ghcid_numeric: 7347261149831529772
|
||||
record_id: fa3e7a61-8914-4960-9e1a-736421221995
|
||||
generation_timestamp: '2025-12-06T23:37:43.603692+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-H-HITS
|
||||
valid_from: '2025-12-10T09:46:57Z'
|
||||
valid_to: null
|
||||
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha))
|
||||
per ISO 3166-2:CZ
|
||||
- ghcid: CZ-HL-PRA-H-HITS
|
||||
valid_from: null
|
||||
valid_to: '2025-12-10T09:46:57Z'
|
||||
reason: Previous GHCID with incorrect region code
|
||||
- ghcid: CZ-HL-PRA-H-HITS
|
||||
ghcid_numeric: 7347261149831529772
|
||||
valid_from: '2025-12-06T23:37:43.603692+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-H-HITS
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 46f9b78d-b0a3-51d4-8a9f-13f51703a2c8
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 65f6b716-9bbc-852c-071f-e596068235c3
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '7347261149831529772'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: fa3e7a61-8914-4960-9e1a-736421221995
|
||||
- *id001
|
||||
- *id002
|
||||
- *id003
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:30.309554+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:46Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.HOL
|
||||
subtype_label: HOLY_SITES
|
||||
ontology_class: schema:PlaceOfWorship
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.309554+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date: null
|
||||
verified_by: null
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Husův institut teologických studií
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.309554+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: HOLY_SITES
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.309554+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 3
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.309554+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABE467
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.309554+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:43.603692+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
latitude: 50.088143
|
||||
longitude: 14.460887
|
||||
coordinate_provenance:
|
||||
source_type: ORIGINAL_ENTRY
|
||||
source_path: original_entry.locations[0]
|
||||
city: Praha 3
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 130 00
|
||||
street_address: Roháčova 1305/66
|
||||
normalization_timestamp: '2025-12-09T06:50:46.768907+00:00'
|
||||
logo_enrichment:
|
||||
enrichment_timestamp: '2025-12-23T21:55:48.452790+00:00'
|
||||
source_url: https://www.hitspraha.cz/index.php/hits-top.html
|
||||
extraction_method: crawl4ai
|
||||
claims:
|
||||
- claim_type: favicon_url
|
||||
claim_value: https://www.hitspraha.cz/templates/hits/favicon.ico
|
||||
source_url: https://www.hitspraha.cz/index.php/hits-top.html
|
||||
css_selector: '[document] > html > head > link:nth-of-type(3)'
|
||||
retrieved_on: '2025-12-23T21:55:48.452790+00:00'
|
||||
extraction_method: crawl4ai_link_rel
|
||||
favicon_type: image/vnd.microsoft.icon
|
||||
favicon_sizes: ''
|
||||
summary:
|
||||
total_claims: 1
|
||||
has_primary_logo: false
|
||||
has_favicon: true
|
||||
has_og_image: false
|
||||
favicon_count: 1
|
||||
213
data/custodian.backup.20251230/CZ-10-PRA-H-KDSDMSJK.yaml
Normal file
213
data/custodian.backup.20251230/CZ-10-PRA-H-KDSDMSJK.yaml
Normal file
|
|
@ -0,0 +1,213 @@
|
|||
original_entry:
|
||||
name: Katolický domov studujících - domov mládeže a školní jídelna - Knihovna
|
||||
institution_type: HOLY_SITES
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABE463
|
||||
- &id002
|
||||
identifier_scheme: IČO
|
||||
identifier_value: '43001106'
|
||||
- &id003
|
||||
identifier_scheme: Website
|
||||
identifier_value: https://kds.jabok.cuni.cz/eg/opac/home
|
||||
identifier_url: https://kds.jabok.cuni.cz/eg/opac/home
|
||||
locations:
|
||||
- street_address: Černá 1610/14
|
||||
postal_code: 110 00
|
||||
city: Praha 1
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
latitude: 50.079251
|
||||
longitude: 14.418292
|
||||
processing_timestamp: '2025-12-06T23:37:27.465222+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-H-KDSDMSJK
|
||||
ghcid_original: CZ-HL-PRA-H-KDSDMŠJK
|
||||
ghcid_uuid: 1b95dc6b-e729-52d7-89a6-169a765493fa
|
||||
ghcid_uuid_sha256: b1506a18-e303-8d8f-bef1-d468e3f5cff9
|
||||
ghcid_numeric: 12776828797970501007
|
||||
record_id: 93f49c33-a042-4e90-bc3e-f621595371cb
|
||||
generation_timestamp: '2025-12-06T23:37:27.465222+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-H-KDSDMSJK
|
||||
valid_from: '2025-12-10T09:46:57Z'
|
||||
valid_to:
|
||||
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
|
||||
- ghcid: CZ-HL-PRA-H-KDSDMSJK
|
||||
valid_from:
|
||||
valid_to: '2025-12-10T09:46:57Z'
|
||||
reason: Previous GHCID with incorrect region code
|
||||
- ghcid: CZ-HL-PRA-H-KDSDMSJK
|
||||
ghcid_numeric: 12776828797970501007
|
||||
valid_from: '2025-12-08T11:21:31.569552+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-H-KDSDMŠJK)'
|
||||
- ghcid: CZ-HL-PRA-H-KDSDMŠJK
|
||||
ghcid_numeric: 10088411494359326508
|
||||
valid_from: '2025-12-06T23:37:27.465222+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:31.569552+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Katolický domov studujících - domov mládeže a školní jídelna - Knihovna
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-H-KDSDMSJK
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 1b95dc6b-e729-52d7-89a6-169a765493fa
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: b1506a18-e303-8d8f-bef1-d468e3f5cff9
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '12776828797970501007'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 93f49c33-a042-4e90-bc3e-f621595371cb
|
||||
- *id001
|
||||
- *id002
|
||||
- *id003
|
||||
- identifier_scheme: Wikidata
|
||||
identifier_value: Q114619639
|
||||
identifier_url: https://www.wikidata.org/wiki/Q114619639
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:30.063047+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:46Z
|
||||
schema_version: 2.0.0
|
||||
enrichment_provenance:
|
||||
wikidata_enrichment:
|
||||
content_hash: sha256-qvancJjwidfjm8sgTqwf/kcf9K/ABK4FBmwC5ASuUOw=
|
||||
verified_at: '2025-12-28T19:58:22.545896+00:00'
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114619639
|
||||
provenance_schema_version: '2.0'
|
||||
standards_compliance:
|
||||
- W3C PROV-O
|
||||
- W3C SRI (content hashes)
|
||||
generated_at: '2025-12-28T19:58:22.545911+00:00'
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.HOL
|
||||
subtype_label: HOLY_SITES
|
||||
ontology_class: schema:PlaceOfWorship
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.063047+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Katolický domov studujících - domov mládeže a školní jídelna - Knihovna
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.063047+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: HOLY_SITES
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.063047+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 1
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.063047+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABE463
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.063047+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:27.465222+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
latitude: 50.079251
|
||||
longitude: 14.418292
|
||||
coordinate_provenance:
|
||||
source_type: ORIGINAL_ENTRY
|
||||
source_path: original_entry.locations[0]
|
||||
city: Praha 1
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 110 00
|
||||
street_address: Černá 1610/14
|
||||
normalization_timestamp: '2025-12-09T06:50:46.792111+00:00'
|
||||
wikidata_enrichment:
|
||||
wikidata_id: Q114619639
|
||||
wikidata_url: https://www.wikidata.org/wiki/Q114619639
|
||||
matched_by: sigla_identifier
|
||||
matched_sigla: ABE463
|
||||
wikidata_label: knihovna Katolického domova studujících v Praze
|
||||
wikidata_description: knihovna v Praze
|
||||
enrichment_date: '2025-12-19T10:12:58.406651+00:00'
|
||||
enrichment_version: 2.1.0
|
||||
instance_of:
|
||||
- Q50379845
|
||||
- Q7075
|
||||
_provenance:
|
||||
content_hash:
|
||||
algorithm: sha256
|
||||
value: sha256-qvancJjwidfjm8sgTqwf/kcf9K/ABK4FBmwC5ASuUOw=
|
||||
scope: enrichment_section
|
||||
computed_at: '2025-12-28T19:58:22.545896+00:00'
|
||||
prov:
|
||||
wasGeneratedBy:
|
||||
'@type': prov:Activity
|
||||
name: wikidata_api_fetch
|
||||
used: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114619639
|
||||
generatedAtTime: '2025-12-19T10:12:58.406651+00:00'
|
||||
verification:
|
||||
status: verified
|
||||
last_verified: '2025-12-28T19:58:22.545903+00:00'
|
||||
wikidata_entity_id: Q114619639
|
||||
231
data/custodian.backup.20251230/CZ-10-PRA-H-KKPSSK.yaml
Normal file
231
data/custodian.backup.20251230/CZ-10-PRA-H-KKPSSK.yaml
Normal file
|
|
@ -0,0 +1,231 @@
|
|||
original_entry:
|
||||
name: Královská kanonie premonstrátů na Strahově - Strahovská knihovna
|
||||
institution_type: HOLY_SITES
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABE459
|
||||
- &id002
|
||||
identifier_scheme: IČO
|
||||
identifier_value: 00415090
|
||||
- &id003
|
||||
identifier_scheme: Website
|
||||
identifier_value: https://strahovskyklaster.tritius.cz/
|
||||
identifier_url: https://strahovskyklaster.tritius.cz/
|
||||
locations:
|
||||
- street_address: Strahovské nádvoří 132
|
||||
postal_code: 118 00
|
||||
city: Praha 1
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
latitude: 50.086114
|
||||
longitude: 14.388775
|
||||
processing_timestamp: '2025-12-06T23:37:20.874363+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-H-KKPSSK
|
||||
ghcid_original: CZ-HL-PRA-H-KKPSSK
|
||||
ghcid_uuid: b2c77662-30d1-5657-b6ec-508b07c76189
|
||||
ghcid_uuid_sha256: d4c0208c-8472-8a7a-19ec-cf9964c4be3f
|
||||
ghcid_numeric: 15330288919458781818
|
||||
record_id: 364d0cfc-9693-4138-9891-541407733834
|
||||
generation_timestamp: '2025-12-06T23:37:20.874363+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-H-KKPSSK
|
||||
valid_from: '2025-12-10T09:46:57Z'
|
||||
valid_to:
|
||||
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
|
||||
- ghcid: CZ-HL-PRA-H-KKPSSK
|
||||
valid_from:
|
||||
valid_to: '2025-12-10T09:46:57Z'
|
||||
reason: Previous GHCID with incorrect region code
|
||||
- ghcid: CZ-HL-PRA-H-KKPSSK
|
||||
ghcid_numeric: 15330288919458781818
|
||||
valid_from: '2025-12-06T23:37:20.874363+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Královská kanonie premonstrátů na Strahově - Strahovská knihovna
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-H-KKPSSK
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: b2c77662-30d1-5657-b6ec-508b07c76189
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: d4c0208c-8472-8a7a-19ec-cf9964c4be3f
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '15330288919458781818'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 364d0cfc-9693-4138-9891-541407733834
|
||||
- *id001
|
||||
- *id002
|
||||
- *id003
|
||||
- identifier_scheme: Wikidata
|
||||
identifier_value: Q12056498
|
||||
identifier_url: https://www.wikidata.org/wiki/Q12056498
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.992094+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:46Z
|
||||
schema_version: 2.0.0
|
||||
enrichment_provenance:
|
||||
wikidata_enrichment:
|
||||
content_hash: sha256-RhhGG/XuQP8MzMGh7TLIasW7BJlzmG09o/v3YuYczE4=
|
||||
verified_at: '2025-12-28T19:58:22.563127+00:00'
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q12056498
|
||||
provenance_schema_version: '2.0'
|
||||
standards_compliance:
|
||||
- W3C PROV-O
|
||||
- W3C SRI (content hashes)
|
||||
generated_at: '2025-12-28T19:58:22.563143+00:00'
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.HOL
|
||||
subtype_label: HOLY_SITES
|
||||
ontology_class: schema:PlaceOfWorship
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.992094+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Královská kanonie premonstrátů na Strahově - Strahovská knihovna
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.992094+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: HOLY_SITES
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.992094+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 1
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.992094+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABE459
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.992094+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:20.874363+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
latitude: 50.086114
|
||||
longitude: 14.388775
|
||||
coordinate_provenance:
|
||||
source_type: ORIGINAL_ENTRY
|
||||
source_path: original_entry.locations[0]
|
||||
city: Praha 1
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 118 00
|
||||
street_address: Strahovské nádvoří 132
|
||||
normalization_timestamp: '2025-12-09T06:50:46.810708+00:00'
|
||||
wikidata_enrichment:
|
||||
wikidata_id: Q12056498
|
||||
wikidata_url: https://www.wikidata.org/wiki/Q12056498
|
||||
matched_by: sigla_identifier
|
||||
matched_sigla: ABE459
|
||||
wikidata_label: Strahovská knihovna
|
||||
wikidata_description: klášterní knihovna
|
||||
enrichment_date: '2025-12-19T10:12:59.125749+00:00'
|
||||
enrichment_version: 2.1.0
|
||||
wikidata_coordinates:
|
||||
longitude: 14.3892417
|
||||
latitude: 50.0858778
|
||||
image: https://commons.wikimedia.org/wiki/Special:FilePath/Strahov_knihovna_2.jpg
|
||||
instance_of:
|
||||
- Q1776381
|
||||
located_in: Q1085
|
||||
_provenance:
|
||||
content_hash:
|
||||
algorithm: sha256
|
||||
value: sha256-RhhGG/XuQP8MzMGh7TLIasW7BJlzmG09o/v3YuYczE4=
|
||||
scope: enrichment_section
|
||||
computed_at: '2025-12-28T19:58:22.563127+00:00'
|
||||
prov:
|
||||
wasGeneratedBy:
|
||||
'@type': prov:Activity
|
||||
name: wikidata_api_fetch
|
||||
used: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q12056498
|
||||
generatedAtTime: '2025-12-19T10:12:59.125749+00:00'
|
||||
verification:
|
||||
status: verified
|
||||
last_verified: '2025-12-28T19:58:22.563134+00:00'
|
||||
wikidata_entity_id: Q12056498
|
||||
logo_enrichment:
|
||||
enrichment_timestamp: '2025-12-23T21:56:00.222655+00:00'
|
||||
source_url: https://strahovskyklaster.tritius.cz
|
||||
extraction_method: crawl4ai
|
||||
claims:
|
||||
- claim_type: favicon_url
|
||||
claim_value: https://strahovskyklaster.tritius.cz/apple-touch-icon-180x180.png
|
||||
source_url: https://strahovskyklaster.tritius.cz
|
||||
css_selector: '[document] > html > head > link:nth-of-type(14)'
|
||||
retrieved_on: '2025-12-23T21:56:00.222655+00:00'
|
||||
extraction_method: crawl4ai_link_rel
|
||||
favicon_type: ''
|
||||
favicon_sizes: 180x180
|
||||
summary:
|
||||
total_claims: 1
|
||||
has_primary_logo: false
|
||||
has_favicon: true
|
||||
has_og_image: false
|
||||
favicon_count: 14
|
||||
164
data/custodian.backup.20251230/CZ-10-PRA-H-MBTSK.yaml
Normal file
164
data/custodian.backup.20251230/CZ-10-PRA-H-MBTSK.yaml
Normal file
|
|
@ -0,0 +1,164 @@
|
|||
original_entry:
|
||||
name: Mezinárodní baptistický teologický seminář - Knihovna
|
||||
institution_type: HOLY_SITES
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABE453
|
||||
- &id002
|
||||
identifier_scheme: IČO
|
||||
identifier_value: '25741683'
|
||||
- &id003
|
||||
identifier_scheme: DIČ
|
||||
identifier_value: CZ25741683
|
||||
locations:
|
||||
- street_address: Nad Habrovkou 3 - Jenerálka
|
||||
postal_code: 164 00
|
||||
city: Praha 6
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
latitude: 50.105856
|
||||
longitude: 14.351694
|
||||
processing_timestamp: '2025-12-06T23:37:15.760570+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-H-MBTSK
|
||||
ghcid_original: CZ-HL-PRA-H-MBTSK
|
||||
ghcid_uuid: 04b9e8ce-ced9-53f0-99b8-9c2e092da029
|
||||
ghcid_uuid_sha256: 55dce9c3-069f-84b7-0b6a-0dd7326feead
|
||||
ghcid_numeric: 6187077011939239095
|
||||
record_id: cbc98ac0-743f-4787-8ce0-6e14e643b10b
|
||||
generation_timestamp: '2025-12-06T23:37:15.760570+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-H-MBTSK
|
||||
valid_from: "2025-12-10T09:46:57Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-H-MBTSK
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:57Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-H-MBTSK
|
||||
ghcid_numeric: 6187077011939239095
|
||||
valid_from: '2025-12-06T23:37:15.760570+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Mezinárodní baptistický teologický seminář - Knihovna
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-H-MBTSK
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 04b9e8ce-ced9-53f0-99b8-9c2e092da029
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 55dce9c3-069f-84b7-0b6a-0dd7326feead
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '6187077011939239095'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: cbc98ac0-743f-4787-8ce0-6e14e643b10b
|
||||
- *id001
|
||||
- *id002
|
||||
- *id003
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.913964+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:46Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.HOL
|
||||
subtype_label: HOLY_SITES
|
||||
ontology_class: schema:PlaceOfWorship
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.913964+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date: null
|
||||
verified_by: null
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Mezinárodní baptistický teologický seminář - Knihovna
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.913964+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: HOLY_SITES
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.913964+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 6
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.913964+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABE453
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.913964+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:15.760570+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
latitude: 50.105856
|
||||
longitude: 14.351694
|
||||
coordinate_provenance:
|
||||
source_type: ORIGINAL_ENTRY
|
||||
source_path: original_entry.locations[0]
|
||||
city: Praha 6
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 164 00
|
||||
street_address: Nad Habrovkou 3 - Jenerálka
|
||||
normalization_timestamp: '2025-12-09T06:50:46.833765+00:00'
|
||||
280
data/custodian.backup.20251230/CZ-10-PRA-H-PBFK.yaml
Normal file
280
data/custodian.backup.20251230/CZ-10-PRA-H-PBFK.yaml
Normal file
|
|
@ -0,0 +1,280 @@
|
|||
original_entry:
|
||||
name: Provincie bratří františkánů - Knihovna
|
||||
institution_type: HOLY_SITES
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABE450
|
||||
- &id002
|
||||
identifier_scheme: IČO
|
||||
identifier_value: 00169757
|
||||
- &id003
|
||||
identifier_scheme: Website
|
||||
identifier_value: https://www.ofm.cz/
|
||||
identifier_url: https://www.ofm.cz/
|
||||
- &id004
|
||||
identifier_scheme: Wikidata
|
||||
identifier_value: Q114617806
|
||||
identifier_url: https://www.wikidata.org/wiki/Q114617806
|
||||
locations:
|
||||
- street_address: Františkánský klášter, Jungmannovo nám. 18
|
||||
postal_code: 110 00
|
||||
city: Praha 1
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
latitude: 50.082922
|
||||
longitude: 14.422764
|
||||
processing_timestamp: '2025-12-06T23:37:15.748590+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-H-PBFK
|
||||
ghcid_original: CZ-HL-PRA-H-PBFK
|
||||
ghcid_uuid: 9f43a1a6-41ba-51f2-b7a4-ac6b39b4bbf8
|
||||
ghcid_uuid_sha256: 03bdfd08-7db8-8530-d93c-26b054e94fcc
|
||||
ghcid_numeric: 269649765622883632
|
||||
record_id: d87ab3a6-1853-4ee2-932b-db037763e890
|
||||
generation_timestamp: '2025-12-06T23:37:15.748590+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-H-PBFK
|
||||
valid_from: '2025-12-10T09:46:57Z'
|
||||
valid_to:
|
||||
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
|
||||
- ghcid: CZ-HL-PRA-H-PBFK
|
||||
valid_from:
|
||||
valid_to: '2025-12-10T09:46:57Z'
|
||||
reason: Previous GHCID with incorrect region code
|
||||
- ghcid: CZ-HL-PRA-H-PBFK
|
||||
ghcid_numeric: 269649765622883632
|
||||
valid_from: '2025-12-06T23:37:15.748590+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Provincie bratří františkánů - Knihovna
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-H-PBFK
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 9f43a1a6-41ba-51f2-b7a4-ac6b39b4bbf8
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 03bdfd08-7db8-8530-d93c-26b054e94fcc
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '269649765622883632'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: d87ab3a6-1853-4ee2-932b-db037763e890
|
||||
- *id001
|
||||
- *id002
|
||||
- *id003
|
||||
- *id004
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.913803+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:46Z
|
||||
schema_version: 2.0.0
|
||||
enrichment_provenance:
|
||||
wikidata_enrichment:
|
||||
content_hash: sha256-U1Y4vNGXYIhTAtrF4kYrXTpmMiLky7takLaS+zkOj1k=
|
||||
verified_at: '2025-12-28T19:58:22.593319+00:00'
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617806
|
||||
provenance_schema_version: '2.0'
|
||||
standards_compliance:
|
||||
- W3C PROV-O
|
||||
- W3C SRI (content hashes)
|
||||
generated_at: '2025-12-28T19:58:22.593344+00:00'
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.HOL
|
||||
subtype_label: HOLY_SITES
|
||||
ontology_class: schema:PlaceOfWorship
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.913803+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Provincie bratří františkánů - Knihovna
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.913803+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: HOLY_SITES
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.913803+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 1
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.913803+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: wikidata_id
|
||||
claim_value: Q114617806
|
||||
property_uri: owl:sameAs
|
||||
provenance:
|
||||
namespace: wikidata
|
||||
path: /entity/Q114617806
|
||||
timestamp: '2025-11-19T10:14:29.913803+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.98
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABE450
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.913803+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:15.748590+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
wikidata_enrichment:
|
||||
wikidata_entity_id: Q114617806
|
||||
api_metadata:
|
||||
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
fetch_timestamp: '2025-12-18T22:26:04.369044+00:00'
|
||||
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
|
||||
enrichment_version: 2.1_generic
|
||||
properties_found:
|
||||
- P31
|
||||
- P17
|
||||
- P159
|
||||
- P968
|
||||
- P18
|
||||
wikidata_labels:
|
||||
cs: Provincie bratří františkánů, knihovna
|
||||
en: Ordo Fratrum Minorum
|
||||
wikidata_label_en: Ordo Fratrum Minorum
|
||||
wikidata_label_cs: Provincie bratří františkánů, knihovna
|
||||
wikidata_descriptions:
|
||||
cs: knihovna v Praze
|
||||
wikidata_aliases:
|
||||
cs:
|
||||
- OFM
|
||||
wikidata_sitelinks:
|
||||
commonswiki: Category:Franciscan library (Monastery of Our Lady of the Snows)
|
||||
cswiki: Františkánská knihovna kláštera Panny Marie Sněžné v Praze
|
||||
wikidata_classification:
|
||||
instance_of: &id005
|
||||
- id: Q50379845
|
||||
label: religious library
|
||||
description: type of library with collections that focus on religion and related subjects
|
||||
wikidata_instance_of: *id005
|
||||
wikidata_location:
|
||||
country: &id006
|
||||
id: Q213
|
||||
label: Czech Republic
|
||||
description: country in Central Europe
|
||||
headquarters_location:
|
||||
id: Q973974
|
||||
label: Prague 1
|
||||
description: administrative district, municipal district and municipal part of Prague
|
||||
wikidata_country: *id006
|
||||
wikidata_media:
|
||||
image: Frantiskanska Knihovna PMS B.jpg
|
||||
wikidata_image: Frantiskanska Knihovna PMS B.jpg
|
||||
wikidata_contact:
|
||||
email: mailto:praha@ofm.cz
|
||||
_provenance:
|
||||
content_hash:
|
||||
algorithm: sha256
|
||||
value: sha256-U1Y4vNGXYIhTAtrF4kYrXTpmMiLky7takLaS+zkOj1k=
|
||||
scope: enrichment_section
|
||||
computed_at: '2025-12-28T19:58:22.593319+00:00'
|
||||
prov:
|
||||
wasGeneratedBy:
|
||||
'@type': prov:Activity
|
||||
name: wikidata_api_fetch
|
||||
used: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617806
|
||||
verification:
|
||||
status: verified
|
||||
last_verified: '2025-12-28T19:58:22.593334+00:00'
|
||||
location:
|
||||
latitude: 50.082922
|
||||
longitude: 14.422764
|
||||
coordinate_provenance:
|
||||
source_type: ORIGINAL_ENTRY
|
||||
source_path: original_entry.locations[0]
|
||||
city: Praha 1
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 110 00
|
||||
street_address: Františkánský klášter, Jungmannovo nám. 18
|
||||
normalization_timestamp: '2025-12-09T06:50:46.856803+00:00'
|
||||
logo_enrichment:
|
||||
enrichment_timestamp: '2025-12-23T21:56:07.748377+00:00'
|
||||
source_url: https://www.ofm.cz
|
||||
extraction_method: crawl4ai
|
||||
claims:
|
||||
- claim_type: logo_url
|
||||
claim_value: https://www.ofm.cz/wp-content/uploads/2020/03/Logo-mobil-s-tau.png
|
||||
source_url: https://www.ofm.cz
|
||||
css_selector: '#mobhead > div.bg-area.stickyable > div.logo-main-wrap.logo-mob-wrap > div.logo.logo-mobile > a > span.logo-img > img'
|
||||
retrieved_on: '2025-12-23T21:56:07.748377+00:00'
|
||||
extraction_method: crawl4ai_header_logo
|
||||
detection_confidence: high
|
||||
alt_text: ''
|
||||
- claim_type: favicon_url
|
||||
claim_value: https://www.ofm.cz/wp-content/uploads/2020/03/cropped-tau-180x180.png
|
||||
source_url: https://www.ofm.cz
|
||||
css_selector: '[document] > html > head > link:nth-of-type(21)'
|
||||
retrieved_on: '2025-12-23T21:56:07.748377+00:00'
|
||||
extraction_method: crawl4ai_link_rel
|
||||
favicon_type: ''
|
||||
favicon_sizes: ''
|
||||
summary:
|
||||
total_claims: 2
|
||||
has_primary_logo: true
|
||||
has_favicon: true
|
||||
has_og_image: false
|
||||
favicon_count: 3
|
||||
238
data/custodian.backup.20251230/CZ-10-PRA-H-ZOVPK.yaml
Normal file
238
data/custodian.backup.20251230/CZ-10-PRA-H-ZOVPK.yaml
Normal file
|
|
@ -0,0 +1,238 @@
|
|||
original_entry:
|
||||
name: Židovská obec v Praze - Knihovna
|
||||
institution_type: HOLY_SITES
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABE461
|
||||
- &id002
|
||||
identifier_scheme: IČO
|
||||
identifier_value: 00445258
|
||||
- &id003
|
||||
identifier_scheme: Website
|
||||
identifier_value: https://www.kehilaprag.cz/cs
|
||||
identifier_url: https://www.kehilaprag.cz/cs
|
||||
locations:
|
||||
- street_address: Jáchymova 3
|
||||
postal_code: 110 01
|
||||
city: Praha 1
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
latitude: 50.088591
|
||||
longitude: 14.419118
|
||||
- street_address: Maiselova 18
|
||||
postal_code: 110 01
|
||||
city: Praha 1
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
latitude: 50.089729
|
||||
longitude: 14.418438
|
||||
processing_timestamp: '2025-12-06T23:37:27.311926+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-H-ZOVPK
|
||||
ghcid_original: CZ-HL-PRA-H-ŽOVPK
|
||||
ghcid_uuid: df93d711-09ac-5a4e-8261-47033c74bd57
|
||||
ghcid_uuid_sha256: ddbab9d1-455e-8454-8d31-c524072da3f0
|
||||
ghcid_numeric: 15977286936513377364
|
||||
record_id: f61482d0-de24-410c-a3d4-b29046ced869
|
||||
generation_timestamp: '2025-12-06T23:37:27.311926+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-H-ZOVPK
|
||||
valid_from: '2025-12-10T09:46:57Z'
|
||||
valid_to:
|
||||
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
|
||||
- ghcid: CZ-HL-PRA-H-ZOVPK
|
||||
valid_from:
|
||||
valid_to: '2025-12-10T09:46:57Z'
|
||||
reason: Previous GHCID with incorrect region code
|
||||
- ghcid: CZ-HL-PRA-H-ZOVPK
|
||||
ghcid_numeric: 15977286936513377364
|
||||
valid_from: '2025-12-08T11:21:32.038215+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-H-ŽOVPK)'
|
||||
- ghcid: CZ-HL-PRA-H-ŽOVPK
|
||||
ghcid_numeric: 13994694256066163100
|
||||
valid_from: '2025-12-06T23:37:27.311926+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:32.038215+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Židovská obec v Praze - Knihovna
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-H-ZOVPK
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: df93d711-09ac-5a4e-8261-47033c74bd57
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: ddbab9d1-455e-8454-8d31-c524072da3f0
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '15977286936513377364'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: f61482d0-de24-410c-a3d4-b29046ced869
|
||||
- *id001
|
||||
- *id002
|
||||
- *id003
|
||||
- identifier_scheme: Wikidata
|
||||
identifier_value: Q114619623
|
||||
identifier_url: https://www.wikidata.org/wiki/Q114619623
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:30.062272+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:46Z
|
||||
schema_version: 2.0.0
|
||||
enrichment_provenance:
|
||||
wikidata_enrichment:
|
||||
content_hash: sha256-BwfWLwIgLXgtI0ZrvBCNnlwPntx8jJnJqHZl3vOwLKs=
|
||||
verified_at: '2025-12-28T19:58:22.612588+00:00'
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114619623
|
||||
provenance_schema_version: '2.0'
|
||||
standards_compliance:
|
||||
- W3C PROV-O
|
||||
- W3C SRI (content hashes)
|
||||
generated_at: '2025-12-28T19:58:22.612603+00:00'
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.HOL
|
||||
subtype_label: HOLY_SITES
|
||||
ontology_class: schema:PlaceOfWorship
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.062272+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Židovská obec v Praze - Knihovna
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.062272+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: HOLY_SITES
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.062272+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 1
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.062272+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABE461
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.062272+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:27.311926+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
latitude: 50.088591
|
||||
longitude: 14.419118
|
||||
coordinate_provenance:
|
||||
source_type: ORIGINAL_ENTRY
|
||||
source_path: original_entry.locations[0]
|
||||
city: Praha 1
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 110 01
|
||||
street_address: Jáchymova 3
|
||||
normalization_timestamp: '2025-12-09T06:50:46.879233+00:00'
|
||||
wikidata_enrichment:
|
||||
wikidata_id: Q114619623
|
||||
wikidata_url: https://www.wikidata.org/wiki/Q114619623
|
||||
matched_by: sigla_identifier
|
||||
matched_sigla: ABE461
|
||||
wikidata_label: Knihovna Židovské obce v Praze
|
||||
wikidata_description: knihovna v Praze
|
||||
enrichment_date: '2025-12-19T10:12:59.880497+00:00'
|
||||
enrichment_version: 2.1.0
|
||||
instance_of:
|
||||
- Q50379845
|
||||
_provenance:
|
||||
content_hash:
|
||||
algorithm: sha256
|
||||
value: sha256-BwfWLwIgLXgtI0ZrvBCNnlwPntx8jJnJqHZl3vOwLKs=
|
||||
scope: enrichment_section
|
||||
computed_at: '2025-12-28T19:58:22.612588+00:00'
|
||||
prov:
|
||||
wasGeneratedBy:
|
||||
'@type': prov:Activity
|
||||
name: wikidata_api_fetch
|
||||
used: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114619623
|
||||
generatedAtTime: '2025-12-19T10:12:59.880497+00:00'
|
||||
verification:
|
||||
status: verified
|
||||
last_verified: '2025-12-28T19:58:22.612594+00:00'
|
||||
wikidata_entity_id: Q114619623
|
||||
logo_enrichment:
|
||||
enrichment_timestamp: '2025-12-23T21:56:15.191049+00:00'
|
||||
source_url: https://www.kehilaprag.cz/cs
|
||||
extraction_method: crawl4ai
|
||||
claims:
|
||||
- claim_type: favicon_url
|
||||
claim_value: https://www.kehilaprag.cz/img/favicon.ico
|
||||
source_url: https://www.kehilaprag.cz/cs
|
||||
css_selector: '[document] > html.js.flexbox > head > link:nth-of-type(5)'
|
||||
retrieved_on: '2025-12-23T21:56:15.191049+00:00'
|
||||
extraction_method: crawl4ai_link_rel
|
||||
favicon_type: image/vnd.microsoft.icon
|
||||
favicon_sizes: ''
|
||||
summary:
|
||||
total_claims: 1
|
||||
has_primary_logo: false
|
||||
has_favicon: true
|
||||
has_og_image: false
|
||||
favicon_count: 1
|
||||
167
data/custodian.backup.20251230/CZ-10-PRA-L-A-aritma.yaml
Normal file
167
data/custodian.backup.20251230/CZ-10-PRA-L-A-aritma.yaml
Normal file
|
|
@ -0,0 +1,167 @@
|
|||
original_entry:
|
||||
name: ARITMA
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABF002
|
||||
locations:
|
||||
- street_address: Lužná 591
|
||||
postal_code: 160 05
|
||||
city: Praha 6
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:23.806189+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-A-aritma
|
||||
ghcid_original: CZ-XX-PRA-L-A-aritma
|
||||
ghcid_uuid: 76eb7195-fef9-517a-b3c3-a9c661376eba
|
||||
ghcid_uuid_sha256: bcf70c56-46dc-8470-481a-41cfc2a8a9d6
|
||||
ghcid_numeric: 13616365563073590384
|
||||
record_id: 118fa2dd-4ba6-484e-ad04-d3b1429e6973
|
||||
generation_timestamp: '2025-12-06T23:37:23.806189+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: '10'
|
||||
city_code: PRA
|
||||
method: GEONAMES_CITY_LOOKUP
|
||||
region_name: CZ-10
|
||||
geonames_id: 8629195
|
||||
resolution_timestamp: '2025-12-07T11:42:56.091145+00:00'
|
||||
ghcid_history:
|
||||
- ghcid: CZ-XX-PRA-L-A-aritma
|
||||
ghcid_numeric: 13616365563073590384
|
||||
valid_from: '2025-12-06T23:37:23.806189+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
- ghcid: CZ-10-PRA-L-A-aritma
|
||||
valid_from: '2025-12-07T11:42:56.091156+00:00'
|
||||
reason: 'Region resolved via GeoNames city lookup: XX->10 (Praha Klanovice)'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: ARITMA
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-XX-PRA-L-A-aritma
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 76eb7195-fef9-517a-b3c3-a9c661376eba
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: bcf70c56-46dc-8470-481a-41cfc2a8a9d6
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '13616365563073590384'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 118fa2dd-4ba6-484e-ad04-d3b1429e6973
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:30.031409+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- 'YouTube/Google Maps enrichment 2025-12-08T19:48:54Z: Maps: rejected by LLM; YouTube: not found'
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:19Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:48:05Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:49:29Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.031409+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: ARITMA
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.031409+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.031409+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 6
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.031409+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABF002
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.031409+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:23.806189+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
google_maps_status: NO_MATCH
|
||||
google_maps_rejected:
|
||||
candidate_name: Aritma AS
|
||||
rejection_reason: The candidate is rejected due to a location mismatch. The source institution is in the Czech Republic
|
||||
(CZ), while the Google Maps candidate is in Norway.
|
||||
timestamp: '2025-12-08T19:48:54.377041+00:00'
|
||||
youtube_status: NOT_FOUND
|
||||
youtube_search_query: ARITMA official
|
||||
youtube_search_timestamp: '2025-12-08T19:48:54.716445+00:00'
|
||||
location:
|
||||
city: Prague
|
||||
region: CZ-10
|
||||
region_code: '10'
|
||||
country: CZ
|
||||
postal_code: 160 05
|
||||
street_address: Lužná 591
|
||||
geonames_id: 3067696
|
||||
normalization_timestamp: '2025-12-09T20:40:58.384106+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:40:58.384087+00:00'
|
||||
380
data/custodian.backup.20251230/CZ-10-PRA-L-A.yaml
Normal file
380
data/custodian.backup.20251230/CZ-10-PRA-L-A.yaml
Normal file
|
|
@ -0,0 +1,380 @@
|
|||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: ARTIA
|
||||
source_type: ch_annotator
|
||||
original_entry:
|
||||
name: ARTIA
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABE391
|
||||
locations:
|
||||
- street_address: Ve Smečkách 30
|
||||
postal_code: 111 27
|
||||
city: Praha 1
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:23.793671+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-A
|
||||
ghcid_original: CZ-XX-PRA-L-A
|
||||
ghcid_uuid: 29a89c74-39be-5a59-a59b-478c406d7f2e
|
||||
ghcid_uuid_sha256: 50203024-eb0a-8b06-f19b-349f090f646f
|
||||
ghcid_numeric: 5773667657409272582
|
||||
record_id: e7c66a51-321a-4d0c-a201-37c2ad3e52a3
|
||||
generation_timestamp: '2025-12-06T23:37:23.793671+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: '10'
|
||||
city_code: PRA
|
||||
method: GEONAMES_CITY_LOOKUP
|
||||
region_name: CZ-10
|
||||
geonames_id: 8629195
|
||||
resolution_timestamp: '2025-12-07T11:42:54.550249+00:00'
|
||||
ghcid_history:
|
||||
- ghcid: CZ-XX-PRA-L-A
|
||||
ghcid_numeric: 5773667657409272582
|
||||
valid_from: '2025-12-06T23:37:23.793671+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
- ghcid: CZ-10-PRA-L-A
|
||||
valid_from: '2025-12-07T11:42:54.550261+00:00'
|
||||
reason: 'Region resolved via GeoNames city lookup: XX->10 (Praha Klanovice)'
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-XX-PRA-L-A
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 29a89c74-39be-5a59-a59b-478c406d7f2e
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 50203024-eb0a-8b06-f19b-349f090f646f
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '5773667657409272582'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: e7c66a51-321a-4d0c-a201-37c2ad3e52a3
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:30.031357+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- 'YouTube/Google Maps enrichment 2025-12-08T19:48:58Z: Maps: Aukční galerie Artia (conf: 0.80); YouTube: not found'
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:19Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:48:05Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:49:29Z
|
||||
schema_version: 2.0.0
|
||||
enrichment_provenance:
|
||||
google_maps_enrichment:
|
||||
content_hash: sha256-XhqEoqSXvzYXCoUgohSnvcuCdFAeXpKseKvmWLEEJhI=
|
||||
verified_at: '2025-12-28T19:58:22.648107+00:00'
|
||||
provenance_schema_version: '2.0'
|
||||
standards_compliance:
|
||||
- W3C PROV-O
|
||||
- W3C SRI (content hashes)
|
||||
generated_at: '2025-12-28T19:58:22.648163+00:00'
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.031357+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: ARTIA
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.031357+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.031357+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 1
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.031357+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABE391
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.031357+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:23.793671+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
google_maps_enrichment:
|
||||
place_id: ChIJPakg9oSUC0cRo5rduYqOuGw
|
||||
name: Aukční galerie Artia
|
||||
fetch_timestamp: '2025-12-08T19:48:55.686290+00:00'
|
||||
api_status: OK
|
||||
coordinates:
|
||||
latitude: 50.0832839
|
||||
longitude: 14.4320287
|
||||
formatted_address: Opletalova 1417/25, Nové Město, 110 00 Praha-Praha 1, Czechia
|
||||
short_address: Opletalova 1417/25, Nové Město
|
||||
phone_local: 222 220 298
|
||||
phone_international: +420 222 220 298
|
||||
website: http://www.aukcni-galerie.cz/
|
||||
google_place_types:
|
||||
- art_gallery
|
||||
- finance
|
||||
- home_goods_store
|
||||
- point_of_interest
|
||||
- store
|
||||
- establishment
|
||||
business_status: OPERATIONAL
|
||||
rating: 2.9
|
||||
total_ratings: 34
|
||||
reviews:
|
||||
- author_name: Paul Thomas
|
||||
author_uri: https://www.google.com/maps/contrib/100778712720214769548/reviews
|
||||
rating: 5
|
||||
relative_time_description: 8 years ago
|
||||
text: I was very satisfied with their service. I can recommend them.
|
||||
publish_time: '2017-11-21T16:47:48.188Z'
|
||||
- author_name: Marek Marek
|
||||
author_uri: https://www.google.com/maps/contrib/109431573954514849120/reviews
|
||||
rating: 1
|
||||
relative_time_description: a year ago
|
||||
text: 'I add my experience with the Olomouc branch:
|
||||
|
||||
First of all, I would like to say that I approached the gallery with confidence because I inherited a few paintings.
|
||||
As an art connoisseur and a busy person, I asked the gallery for an evaluation, pricing and subsequent sale of the
|
||||
paintings. And I would like to share the negative experience I had, which may help someone when choosing or not choosing
|
||||
an auction gallery.
|
||||
|
||||
On September 7, 2023, I contacted the auction house via WA with a request to see if they would take the paintings
|
||||
to the auction (8 pieces, photos of all paintings sent to WA), they replied that they would accept all the paintings.
|
||||
|
||||
On September 14, 2023, after a telephone appointment, I brought the paintings to the Olomouc branch (my fundamental
|
||||
mistake was that I did not wait for the list of paintings), I was assured by the branch employee that the list (contract)
|
||||
would be sent to my email, I agreed and left the branch.
|
||||
|
||||
On September 18, 2023, I received an email with a contract where two items were missing from the list and a whirlwind
|
||||
of inconveniences began.
|
||||
|
||||
One painting (religious direction - Madonna) and also "view of the castle".
|
||||
|
||||
After a phone call, I was assured by an employee that everything would be fixed and that he would find the painting
|
||||
and amend the contract...
|
||||
|
||||
During October 2023, there were a few phone calls when I was told that the painting was "lost", which the branch employee
|
||||
admitted that he did not understand how this could have happened and how I would imagine a replacement (I, as a complete
|
||||
layman in art, and with two photos of the paintings, am supposed to determine the amount of compensation).
|
||||
|
||||
And that for the Madonna painting I have to get a document approving the sale - that it was not stolen from the church.
|
||||
|
||||
At the beginning of December, I visited the branch in person and the employee still did not know how to explain /
|
||||
resolve the situation.
|
||||
|
||||
That he would discuss the situation with his superior who was sick that week and that he would call back. I would
|
||||
assume that a gallery that deals with art (I assume even higher values) will be equipped with at least a security
|
||||
camera. When I asked an employee to check the camera footage, I was told that they don''t have cameras.... In some
|
||||
discussions, the tone was that I didn''t bring the painting at all and that I was trying to damage the gallery/employee...
|
||||
I must also mention that I regularly had to call the Olomouc branch on my own initiative and call for a resolution
|
||||
to the situation.
|
||||
|
||||
After a few days, an employee of the Olomouc branch called me for the first time and said that the auction gallery
|
||||
was offering compensation in the amount of 1000 CZK and if I didn''t agree, I should resolve the situation with the
|
||||
management in Prague - politely said that they no longer care and let me resolve it with Prague myself.
|
||||
|
||||
On 10.1.2024, I picked up the paintings that fortunately were not lost and the compensation.
|
||||
|
||||
|
||||
And let the lost painting itself, or the amount from the sale of the lost painting, make the person in question happy...'
|
||||
publish_time: '2024-01-10T19:24:14.553436Z'
|
||||
- author_name: Marcela R
|
||||
author_uri: https://www.google.com/maps/contrib/107348664986788813882/reviews
|
||||
rating: 1
|
||||
relative_time_description: a month ago
|
||||
text: This gallery clearly does not communicate with clients, I do not recommend anyone to leave their paintings with
|
||||
them, according to reviews they either lose them or withhold payments.
|
||||
publish_time: '2025-10-25T17:35:35.975313577Z'
|
||||
- author_name: Jana Adámková
|
||||
author_uri: https://www.google.com/maps/contrib/101648565896947406982/reviews
|
||||
rating: 1
|
||||
relative_time_description: a year ago
|
||||
text: From the seller's perspective, my experience with the gallery is absolutely tragic. Mr. Stehlík has been completely
|
||||
ignoring me for 3/4 years, he doesn't respond, he doesn't pay out money, he has a catastrophic mess in his sold and
|
||||
unsold paintings. He has had my paintings with him for almost two years and he is unable to pay me for the auctioned
|
||||
items and return the correct number of unsold ones. His actions are leading to a criminal complaint, which is the
|
||||
only thing that will help. I DO NOT RECOMMEND this gallery at all!
|
||||
publish_time: '2024-12-04T11:01:12.318077Z'
|
||||
- author_name: Lubomír Frejka
|
||||
author_uri: https://www.google.com/maps/contrib/106970323732632081169/reviews
|
||||
rating: 1
|
||||
relative_time_description: a year ago
|
||||
text: 'It has now been about 2 months since I received the painting that I auctioned and paid for from the ARTIA GALLERY
|
||||
IN PRAGUE. I was told by one of the employees that he was sorry that they could not find the painting. I was asked
|
||||
if I wanted a refund or if I wanted to wait until they found it..(???). I said that I would wait until they found
|
||||
it.
|
||||
|
||||
A similar situation happened to me here at the Artia Gallery in Prague for the second time, once before this incident,
|
||||
they could not find the painting that I auctioned and paid for.
|
||||
|
||||
FROM THIS IT CAN BE LOGICALLY ASSUMED THAT THE AUCTIONED ITEMS ARE NOT PAID FOR WHEN THE AUCTIONEER GOES TO PICK UP
|
||||
THE AUCTIONED ITEM, BUT THE ARTIA GALLERY EMPLOYEES CANNOT FIND THE ITEM(S) AND PROPERLY HAND IT OVER.
|
||||
|
||||
!!! AS A CONDITION FOR PAYMENT OF THE AUCTIONED ITEM, THE HEAD OF THE ARTIA GALLERY STATES THAT THE AUCTIONED ITEM
|
||||
MUST BE PAID BY THE AUCTIONED ITEM AND TAKES THE ITEM BACK!!! ONLY AFTER THE GALLERY CAN SEND THE MONEY TO THE CLIENT.
|
||||
|
||||
So far, I have not been paid by the Artia gallery for a large amount of auctioned items that I put up for auction
|
||||
there. Otherwise, the payment usually took a loooooong time.
|
||||
|
||||
My items appeared in auctions only occasionally - very rarely, I often warned them about it and most of the time they
|
||||
did not appear in auctions again and again.
|
||||
|
||||
Yesterday I went to pick up the rest of the un-auctioned items, because I am ending the "relationship" with them,
|
||||
I did not get everything back. And moreover, I wrote them a question or request in advance for "payment" of my auctioned
|
||||
items. The boss didn''t comment on this, I asked him personally and when I was leaving, the boss told me that he didn''t
|
||||
know, that he would look into it and find out. AND I HAD HIM IN MY BACK AS HE SHOWED ME OUT SO THAT I WOULDN''T BE
|
||||
DELAYED ANYMORE. HE EVEN SENT HIS SUBORDINATE TO HELP ME - HE REALLY HELPED ME LOAD MY THINGS INTO THE CAR AND I THANK
|
||||
THAT GUY FOR THAT.
|
||||
|
||||
!!! USING THE INTERMEDIATION OF THE SALE OF ITEMS THROUGH THE ARTIA GALLERY IN PRAGUE - NEVER AGAINYYYYYY !!!
|
||||
|
||||
Now I will continue to wait to see whether the boss of the ARTIA GALLERY IN PRAGUE will send the money or not, so
|
||||
hopefully I will live to see it.'
|
||||
publish_time: '2024-08-15T00:20:34.897538Z'
|
||||
opening_hours:
|
||||
open_now: false
|
||||
weekday_text:
|
||||
- 'Monday: Closed'
|
||||
- 'Tuesday: 10:00 AM – 5:00 PM'
|
||||
- 'Wednesday: 10:00 AM – 5:00 PM'
|
||||
- 'Thursday: 10:00 AM – 5:00 PM'
|
||||
- 'Friday: Closed'
|
||||
- 'Saturday: Closed'
|
||||
- 'Sunday: Closed'
|
||||
photo_count: 10
|
||||
photos_metadata:
|
||||
- name: places/ChIJPakg9oSUC0cRo5rduYqOuGw/photos/AZLasHrSgcHRYMycbAUb1ldutEmvNEi3aIlJBTSB4JnR4Z2-SCxIRpHAUoef8r8iarhFedqL_gT6xyVszDlY4A2c6UdVrapiQ3ROwYmuygjuLwuslcgnrV58WrxTgCTIkRtRg6jsbJcwbjromcqQJAv6UJF-E0vQLqRTE_bOhikQPYKJH4EDyKjpr4uy4n2a89c-tSPt8yCLGE3IQ2I1-pVyDP9DYuZm-Wlq14ccrGeF2ib8u2EPieiyldf0bRd52_jF_0Ok49XEaYjfuache0B1copgeWTyIroWs2-IaCJ4mCBJ2w
|
||||
height: 1800
|
||||
width: 1200
|
||||
- name: places/ChIJPakg9oSUC0cRo5rduYqOuGw/photos/AZLasHowbxeoJBVwxkiyEfS0S8M95rlkyrNCNl2lzxk7I3FVAhPmK_1YPTu8yBKV_OdZdPPX414GCFSud2BuNZOKLMTepZvolbEoK8uoV6rWj657u0q1tl4xPpc64Lfg8JOg0qD1akDRiRn6stdePKQ8YFIaMrpcJjQnAnPShnw0EyuxoccttiQW0FYNe3LdlbuH2x9RUKt946vC8sSRl1QGCf2un0wCR1XZr7RwXCLbhnP7GTNU6nbkl-52mSHrxKyqfLuOdCAsG0SuDFf9G3P41A49_30eXNvGrgO9EhzNbeUDJw
|
||||
height: 1800
|
||||
width: 1200
|
||||
- name: places/ChIJPakg9oSUC0cRo5rduYqOuGw/photos/AZLasHpnLoHJoQfI4tJHD6obZh8oXfGIYA9etUTQZKKfirSs8LyDYVT7UTnSEFhjAldTO5SrSMBWLaPNlKpbdyrqWi9yJ9qIKdOxVRKgrdTfSMqZzjkDy7x2JYbLX7DmHb8ZQXqME4S7IAqsIpqSyP2NRdOo4USnp05aWQn75XnODy_O_G1Bz9I6gPcBnDS985xyDoe04JpvphRiIjgl9VzXjTF3L0SH0-LPQLU8kz_cTCgb-hHCu0Wg7BrWyeXQQxmP8YXGqt4Wb_wyOxuxO9hwSRpr0_xCQxF3otLoFN_9tAvkjg
|
||||
height: 2081
|
||||
width: 2551
|
||||
- name: places/ChIJPakg9oSUC0cRo5rduYqOuGw/photos/AZLasHrA6dRqiCr6RAvngdJAuHwduYql3XuiRnEy8D1hvhdwRD61dDQIWpC1ocMsR2W9p9Io403G7gj88QGefMjAyc2qMPHl_h2mi208T1F7Q871nMcXVQZJlntEmH1pHdfSj2Rk7PFp0TKVJfP7SDjTZ4zQKDKKa0hzbpjD_PA6r3w6aeiLd_m55aTjPIK-WtPYaJlx5sl8sScDQYs4N8s4yUULmDJDyu7Xd-4S_t8eVSLJS_9ybMvfqasKP9GZPez3SNxFSOmab5ODdo3D02ry9IrIxkGtlE3ovlquOJBOh4mPyg
|
||||
height: 800
|
||||
width: 1200
|
||||
- name: places/ChIJPakg9oSUC0cRo5rduYqOuGw/photos/AZLasHqq9korXxypDF6stf9Jm5YdSsOs3WGvQKHGN3GiNK8zaWoy5qi3bVDRFjwdLMtHvM3ft5aCTaGfREeGY0-et5siga7NHjnPUFfmvh-lseYEzIy683ejQt8VgYX22GTzLauRp-i_X3TmvMLfHX9lMeYdMjU_k2yDWL_Q3vv64vyO9fdPSP4gHUof0rM2WNMffyW87NpYWJ0cCnAe5lVEaaIWsfzbnRBDwAO2BXRyCLE1B-3vX03rMfspp5-Tj5oFcqHZFWTti7gW_3svOtzBqDeBM-GMsveoIiyLe2nm3zKLsg
|
||||
height: 2233
|
||||
width: 1680
|
||||
google_maps_url: https://maps.google.com/?cid=7834168278285916835&g_mp=Cidnb29nbGUubWFwcy5wbGFjZXMudjEuUGxhY2VzLlNlYXJjaFRleHQQAhgEIAA
|
||||
llm_verification:
|
||||
is_match: true
|
||||
confidence: 0.8
|
||||
entity_type: GRP.HER
|
||||
reasoning: 'NAME MATCH: The candidate name ''Aukční galerie Artia'' is a direct and specific match for the source institution
|
||||
''ARTIA''. The added words ''Aukční galerie'' (Auction Gallery) specify the institution''s function. LOCATION MATCH:
|
||||
Both the source (CZ) and candidate (Czechia) are in the same country. The source has no city specified, but a match
|
||||
is possible. TYPE MATCH: The candidate has the Google Place type ''art_gallery'', which is an expected type for a GRP.HER
|
||||
institution. ENTITY TYPE: The candidate is an auction gallery, which falls under the definition of an art gallery, a
|
||||
type of heritage custodian.'
|
||||
agent: glm-4.6
|
||||
verified: true
|
||||
ch_annotator_version: ch_annotator-v1_7_0
|
||||
_provenance:
|
||||
content_hash:
|
||||
algorithm: sha256
|
||||
value: sha256-XhqEoqSXvzYXCoUgohSnvcuCdFAeXpKseKvmWLEEJhI=
|
||||
scope: enrichment_section
|
||||
computed_at: '2025-12-28T19:58:22.648107+00:00'
|
||||
prov:
|
||||
wasDerivedFrom: https://maps.googleapis.com/maps/api/place/details/json?place_id=ChIJPakg9oSUC0cRo5rduYqOuGw
|
||||
generatedAtTime: '2025-12-08T19:48:55.686290+00:00'
|
||||
wasGeneratedBy:
|
||||
'@type': prov:Activity
|
||||
name: google_maps_api_fetch
|
||||
used: https://maps.googleapis.com/maps/api/place
|
||||
verification:
|
||||
status: verified
|
||||
last_verified: '2025-12-28T19:58:22.648119+00:00'
|
||||
google_maps_status: SUCCESS
|
||||
youtube_status: NOT_FOUND
|
||||
youtube_search_query: ARTIA official
|
||||
youtube_search_timestamp: '2025-12-08T19:48:58.971457+00:00'
|
||||
location:
|
||||
latitude: 50.0832839
|
||||
longitude: 14.4320287
|
||||
coordinate_provenance:
|
||||
source_type: GOOGLE_MAPS
|
||||
source_path: google_maps_enrichment.coordinates
|
||||
original_timestamp: '2025-12-08T19:48:55.686290+00:00'
|
||||
entity_id: ChIJPakg9oSUC0cRo5rduYqOuGw
|
||||
city: Praha 1
|
||||
region: CZ-10
|
||||
region_code: '10'
|
||||
country: CZ
|
||||
postal_code: 111 27
|
||||
street_address: Opletalova 1417/25, Nové Město
|
||||
formatted_address: Opletalova 1417/25, Nové Město, 110 00 Praha-Praha 1, Czechia
|
||||
geonames_id: 8629195
|
||||
normalization_timestamp: '2025-12-09T06:49:29.190208+00:00'
|
||||
logo_enrichment:
|
||||
enrichment_timestamp: '2025-12-23T21:56:24.466143+00:00'
|
||||
source_url: http://www.aukcni-galerie.cz
|
||||
extraction_method: crawl4ai
|
||||
claims:
|
||||
- claim_type: favicon_url
|
||||
claim_value: http://www.aukcni-galerie.cz/www/images/favicons/safari-pinned-tab.svg
|
||||
source_url: http://www.aukcni-galerie.cz
|
||||
css_selector: '[document] > html > head > link:nth-of-type(7)'
|
||||
retrieved_on: '2025-12-23T21:56:24.466143+00:00'
|
||||
extraction_method: crawl4ai_link_rel
|
||||
favicon_type: ''
|
||||
favicon_sizes: ''
|
||||
summary:
|
||||
total_claims: 1
|
||||
has_primary_logo: false
|
||||
has_favicon: true
|
||||
has_og_image: false
|
||||
favicon_count: 5
|
||||
279
data/custodian.backup.20251230/CZ-10-PRA-L-AAVSK.yaml
Normal file
279
data/custodian.backup.20251230/CZ-10-PRA-L-AAVSK.yaml
Normal file
|
|
@ -0,0 +1,279 @@
|
|||
original_entry:
|
||||
name: Anglo-americká vysoká škola - Knihovna
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABD185
|
||||
- &id002
|
||||
identifier_scheme: IČO
|
||||
identifier_value: '25940082'
|
||||
- &id003
|
||||
identifier_scheme: Website
|
||||
identifier_value: https://www.aauni.edu/student-life/student-services-support/library/
|
||||
identifier_url: https://www.aauni.edu/student-life/student-services-support/library/
|
||||
- &id004
|
||||
identifier_scheme: Wikidata
|
||||
identifier_value: Q114625129
|
||||
identifier_url: https://www.wikidata.org/wiki/Q114625129
|
||||
locations:
|
||||
- street_address: Letenská 1
|
||||
postal_code: 118 00
|
||||
city: Praha 1 - Malá Strana
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
latitude: 50.089958
|
||||
longitude: 14.409755
|
||||
processing_timestamp: '2025-12-06T23:37:43.296691+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-AAVSK
|
||||
ghcid_original: CZ-HL-PRA-L-AAVŠK
|
||||
ghcid_uuid: fa01e47a-1395-5d3b-b3f5-fd7a8a7e0203
|
||||
ghcid_uuid_sha256: 0e5fbd97-5d0e-8894-a497-4881404388a0
|
||||
ghcid_numeric: 1035754897117431956
|
||||
record_id: fa23751d-4037-462c-8093-fe2a739fdcb5
|
||||
generation_timestamp: '2025-12-06T23:37:43.296691+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-AAVSK
|
||||
valid_from: '2025-12-10T09:46:57Z'
|
||||
valid_to:
|
||||
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
|
||||
- ghcid: CZ-HL-PRA-L-AAVSK
|
||||
valid_from:
|
||||
valid_to: '2025-12-10T09:46:57Z'
|
||||
reason: Previous GHCID with incorrect region code
|
||||
- ghcid: CZ-HL-PRA-L-AAVSK
|
||||
ghcid_numeric: 1035754897117431956
|
||||
valid_from: '2025-12-08T11:21:31.224194+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-AAVŠK)'
|
||||
- ghcid: CZ-HL-PRA-L-AAVŠK
|
||||
ghcid_numeric: 3715561979721111793
|
||||
valid_from: '2025-12-06T23:37:43.296691+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:31.224194+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Anglo-americká vysoká škola - Knihovna
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-AAVSK
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: fa01e47a-1395-5d3b-b3f5-fd7a8a7e0203
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 0e5fbd97-5d0e-8894-a497-4881404388a0
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '1035754897117431956'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: fa23751d-4037-462c-8093-fe2a739fdcb5
|
||||
- *id001
|
||||
- *id002
|
||||
- *id003
|
||||
- *id004
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:30.303976+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:46Z
|
||||
schema_version: 2.0.0
|
||||
enrichment_provenance:
|
||||
wikidata_enrichment:
|
||||
content_hash: sha256-u9mzMLp1Si5DnHEknWbZcHYBr6yUDdVUyLMZKUJcdi4=
|
||||
verified_at: '2025-12-28T19:58:22.674183+00:00'
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114625129
|
||||
provenance_schema_version: '2.0'
|
||||
standards_compliance:
|
||||
- W3C PROV-O
|
||||
- W3C SRI (content hashes)
|
||||
generated_at: '2025-12-28T19:58:22.674202+00:00'
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.303976+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Anglo-americká vysoká škola - Knihovna
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.303976+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.303976+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 1 - Malá Strana
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.303976+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: wikidata_id
|
||||
claim_value: Q114625129
|
||||
property_uri: owl:sameAs
|
||||
provenance:
|
||||
namespace: wikidata
|
||||
path: /entity/Q114625129
|
||||
timestamp: '2025-11-19T10:14:30.303976+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.98
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABD185
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.303976+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:43.296691+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
wikidata_enrichment:
|
||||
wikidata_entity_id: Q114625129
|
||||
api_metadata:
|
||||
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
fetch_timestamp: '2025-12-18T22:26:12.304688+00:00'
|
||||
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
|
||||
enrichment_version: 2.1_generic
|
||||
properties_found:
|
||||
- P31
|
||||
- P17
|
||||
- P159
|
||||
- P968
|
||||
- P463
|
||||
wikidata_labels:
|
||||
cs: Anglo-americká vysoká škola, knihovna
|
||||
wikidata_label_cs: Anglo-americká vysoká škola, knihovna
|
||||
wikidata_descriptions:
|
||||
cs: knihovna v Praze
|
||||
wikidata_classification:
|
||||
instance_of: &id005
|
||||
- id: Q1622062
|
||||
label: university library
|
||||
description: general library at an institution of higher education
|
||||
wikidata_instance_of: *id005
|
||||
wikidata_location:
|
||||
country: &id006
|
||||
id: Q213
|
||||
label: Czech Republic
|
||||
description: country in Central Europe
|
||||
headquarters_location:
|
||||
id: Q753289
|
||||
label: Malá Strana
|
||||
description: neighborhood of Prague
|
||||
wikidata_country: *id006
|
||||
wikidata_organization:
|
||||
member_of:
|
||||
- id: Q4809890
|
||||
label: Association of Library and Information Professionals of the Czech Republic
|
||||
description: organization
|
||||
wikidata_contact:
|
||||
email: mailto:library@aauni.edu
|
||||
_provenance:
|
||||
content_hash:
|
||||
algorithm: sha256
|
||||
value: sha256-u9mzMLp1Si5DnHEknWbZcHYBr6yUDdVUyLMZKUJcdi4=
|
||||
scope: enrichment_section
|
||||
computed_at: '2025-12-28T19:58:22.674183+00:00'
|
||||
prov:
|
||||
wasGeneratedBy:
|
||||
'@type': prov:Activity
|
||||
name: wikidata_api_fetch
|
||||
used: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114625129
|
||||
verification:
|
||||
status: verified
|
||||
last_verified: '2025-12-28T19:58:22.674193+00:00'
|
||||
location:
|
||||
latitude: 50.089958
|
||||
longitude: 14.409755
|
||||
coordinate_provenance:
|
||||
source_type: ORIGINAL_ENTRY
|
||||
source_path: original_entry.locations[0]
|
||||
city: Praha 1 - Malá Strana
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 118 00
|
||||
street_address: Letenská 1
|
||||
normalization_timestamp: '2025-12-09T06:50:46.899191+00:00'
|
||||
logo_enrichment:
|
||||
enrichment_timestamp: '2025-12-23T21:56:29.425209+00:00'
|
||||
source_url: https://www.aauni.edu/student-life/student-services-support/library
|
||||
extraction_method: crawl4ai
|
||||
claims:
|
||||
- claim_type: logo_url
|
||||
claim_value: https://res.cloudinary.com/aauni/w_352,h_72,c_fill,q_auto:best,f_auto/web/2020/04/aau-logo.png
|
||||
source_url: https://www.aauni.edu/student-life/student-services-support/library
|
||||
css_selector: '#document > body.wp-singular.page-template > div.container.container_website > header > div.container.container_border > div.row.row_justify-space > div.logo > a > img'
|
||||
retrieved_on: '2025-12-23T21:56:29.425209+00:00'
|
||||
extraction_method: crawl4ai_header_logo
|
||||
detection_confidence: high
|
||||
alt_text: aau-logo
|
||||
- claim_type: favicon_url
|
||||
claim_value: http://www.aauni.edu/wp-content/themes/ds-prime/safari-pinned-tab.svg
|
||||
source_url: https://www.aauni.edu/student-life/student-services-support/library
|
||||
css_selector: '#document > head > link:nth-of-type(4)'
|
||||
retrieved_on: '2025-12-23T21:56:29.425209+00:00'
|
||||
extraction_method: crawl4ai_link_rel
|
||||
favicon_type: ''
|
||||
favicon_sizes: ''
|
||||
summary:
|
||||
total_claims: 2
|
||||
has_primary_logo: true
|
||||
has_favicon: true
|
||||
has_og_image: false
|
||||
favicon_count: 4
|
||||
161
data/custodian.backup.20251230/CZ-10-PRA-L-AC.yaml
Normal file
161
data/custodian.backup.20251230/CZ-10-PRA-L-AC.yaml
Normal file
|
|
@ -0,0 +1,161 @@
|
|||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: ART CENTRUM
|
||||
source_type: ch_annotator
|
||||
original_entry:
|
||||
name: ART CENTRUM
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABE169
|
||||
locations:
|
||||
- street_address: Nerudova 16
|
||||
postal_code: 118 00
|
||||
city: Praha 1
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:15.342432+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-AC
|
||||
ghcid_original: CZ-HL-PRA-L-AC
|
||||
ghcid_uuid: db7cbf05-3f46-5268-9150-644896014727
|
||||
ghcid_uuid_sha256: aa99e5c4-da60-85de-c377-ca43c3d6deff
|
||||
ghcid_numeric: 12293109291524617694
|
||||
record_id: bcafb00d-8c2d-459b-8d01-c5725dbe58e9
|
||||
generation_timestamp: '2025-12-06T23:37:15.342432+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-AC
|
||||
valid_from: '2025-12-10T09:46:57Z'
|
||||
valid_to: null
|
||||
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha))
|
||||
per ISO 3166-2:CZ
|
||||
- ghcid: CZ-HL-PRA-L-AC
|
||||
valid_from: null
|
||||
valid_to: '2025-12-10T09:46:57Z'
|
||||
reason: Previous GHCID with incorrect region code
|
||||
- ghcid: CZ-HL-PRA-L-AC
|
||||
ghcid_numeric: 12293109291524617694
|
||||
valid_from: '2025-12-06T23:37:15.342432+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-AC
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: db7cbf05-3f46-5268-9150-644896014727
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: aa99e5c4-da60-85de-c377-ca43c3d6deff
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '12293109291524617694'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: bcafb00d-8c2d-459b-8d01-c5725dbe58e9
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.908583+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:46Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.908583+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date: null
|
||||
verified_by: null
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: ART CENTRUM
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.908583+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.908583+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 1
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.908583+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABE169
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.908583+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:15.342432+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 118 00
|
||||
street_address: Nerudova 16
|
||||
normalization_timestamp: '2025-12-09T20:41:03.958695+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:03.958672+00:00'
|
||||
160
data/custodian.backup.20251230/CZ-10-PRA-L-ACRSRK.yaml
Normal file
160
data/custodian.backup.20251230/CZ-10-PRA-L-ACRSRK.yaml
Normal file
|
|
@ -0,0 +1,160 @@
|
|||
original_entry:
|
||||
name: AstraZeneca Czech Republic, s.r.o. - Knihovna
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABF315
|
||||
- &id002
|
||||
identifier_scheme: IČO
|
||||
identifier_value: '63984482'
|
||||
locations:
|
||||
- street_address: Jinonice 921
|
||||
postal_code: 158 00
|
||||
city: Praha 5
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
latitude: 50.055986
|
||||
longitude: 14.375324
|
||||
processing_timestamp: '2025-12-06T23:37:16.225751+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-ACRSRK
|
||||
ghcid_original: CZ-HL-PRA-L-ACRSRK
|
||||
ghcid_uuid: cc4caad2-e43e-5bbc-8cce-57d96fc83458
|
||||
ghcid_uuid_sha256: 2d85a754-0282-8671-b9cd-b9ef6a47f693
|
||||
ghcid_numeric: 3280211882870462065
|
||||
record_id: 1a9051b6-2094-4c18-b1c7-d6b4f561c74e
|
||||
generation_timestamp: '2025-12-06T23:37:16.225751+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-ACRSRK
|
||||
valid_from: "2025-12-10T09:46:57Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-ACRSRK
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:57Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-ACRSRK
|
||||
ghcid_numeric: 3280211882870462065
|
||||
valid_from: '2025-12-06T23:37:16.225751+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: AstraZeneca Czech Republic, s.r.o. - Knihovna
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-ACRSRK
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: cc4caad2-e43e-5bbc-8cce-57d96fc83458
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 2d85a754-0282-8671-b9cd-b9ef6a47f693
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '3280211882870462065'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 1a9051b6-2094-4c18-b1c7-d6b4f561c74e
|
||||
- *id001
|
||||
- *id002
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.917728+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:46Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.917728+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date: null
|
||||
verified_by: null
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: AstraZeneca Czech Republic, s.r.o. - Knihovna
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.917728+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.917728+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 5
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.917728+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABF315
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.917728+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:16.225751+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
latitude: 50.055986
|
||||
longitude: 14.375324
|
||||
coordinate_provenance:
|
||||
source_type: ORIGINAL_ENTRY
|
||||
source_path: original_entry.locations[0]
|
||||
city: Praha 5
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 158 00
|
||||
street_address: Jinonice 921
|
||||
normalization_timestamp: '2025-12-09T06:50:46.935738+00:00'
|
||||
169
data/custodian.backup.20251230/CZ-10-PRA-L-ACSRK.yaml
Normal file
169
data/custodian.backup.20251230/CZ-10-PRA-L-ACSRK.yaml
Normal file
|
|
@ -0,0 +1,169 @@
|
|||
original_entry:
|
||||
name: AKCENT College s.r.o. - Knihovna
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABG320
|
||||
- &id002
|
||||
identifier_scheme: IČO
|
||||
identifier_value: '27259439'
|
||||
- &id003
|
||||
identifier_scheme: DIČ
|
||||
identifier_value: CZ27259439
|
||||
- &id004
|
||||
identifier_scheme: Website
|
||||
identifier_value: http://akcentcollege.cz/
|
||||
identifier_url: http://akcentcollege.cz/
|
||||
locations:
|
||||
- street_address: Bítovská 5
|
||||
postal_code: 140 00
|
||||
city: Praha 4
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
latitude: 50.046178
|
||||
longitude: 14.454242
|
||||
processing_timestamp: '2025-12-06T23:37:37.561076+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-ACSRK
|
||||
ghcid_original: CZ-HL-PRA-L-ACSRK
|
||||
ghcid_uuid: 025e0773-85f7-581d-a32c-9d41a749ba6c
|
||||
ghcid_uuid_sha256: ecc50b69-0996-88e3-5be9-a546c5b76de8
|
||||
ghcid_numeric: 17061055309123332323
|
||||
record_id: 704ef8ec-3d1c-4561-a641-671398d77657
|
||||
generation_timestamp: '2025-12-06T23:37:37.561076+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-ACSRK
|
||||
valid_from: "2025-12-10T09:46:57Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-ACSRK
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:57Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-ACSRK
|
||||
ghcid_numeric: 17061055309123332323
|
||||
valid_from: '2025-12-06T23:37:37.561076+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: AKCENT College s.r.o. - Knihovna
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-ACSRK
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 025e0773-85f7-581d-a32c-9d41a749ba6c
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: ecc50b69-0996-88e3-5be9-a546c5b76de8
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '17061055309123332323'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 704ef8ec-3d1c-4561-a641-671398d77657
|
||||
- *id001
|
||||
- *id002
|
||||
- *id003
|
||||
- *id004
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:30.209358+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:46Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.209358+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date: null
|
||||
verified_by: null
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: AKCENT College s.r.o. - Knihovna
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.209358+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.209358+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 4
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.209358+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABG320
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.209358+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:37.561076+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
latitude: 50.046178
|
||||
longitude: 14.454242
|
||||
coordinate_provenance:
|
||||
source_type: ORIGINAL_ENTRY
|
||||
source_path: original_entry.locations[0]
|
||||
city: Praha 4
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 140 00
|
||||
street_address: Bítovská 5
|
||||
normalization_timestamp: '2025-12-09T06:50:46.958551+00:00'
|
||||
228
data/custodian.backup.20251230/CZ-10-PRA-L-ACVUK.yaml
Normal file
228
data/custodian.backup.20251230/CZ-10-PRA-L-ACVUK.yaml
Normal file
|
|
@ -0,0 +1,228 @@
|
|||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Americké centrum - Velvyslanectví USA - knihovna
|
||||
source_type: ch_annotator
|
||||
original_entry:
|
||||
name: Americké centrum - Velvyslanectví USA - knihovna
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABE193
|
||||
- &id002
|
||||
identifier_scheme: Website
|
||||
identifier_value: https://merkur.tritius.cz/library/americkecentrum/
|
||||
identifier_url: https://merkur.tritius.cz/library/americkecentrum/
|
||||
locations:
|
||||
- street_address: Tržiště 13
|
||||
postal_code: 118 01
|
||||
city: Praha 1
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
latitude: 50.087198
|
||||
longitude: 14.402043
|
||||
processing_timestamp: '2025-12-06T23:37:27.015821+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-ACVUK
|
||||
ghcid_original: CZ-HL-PRA-L-ACVUK
|
||||
ghcid_uuid: 4e7ba70c-00a8-5e05-93c3-b52fa276cc35
|
||||
ghcid_uuid_sha256: 5768f4cb-1461-8e1e-273d-20cdae6537db
|
||||
ghcid_numeric: 6298553231935086110
|
||||
record_id: 44876a62-12bc-4625-853f-5637562ba19f
|
||||
generation_timestamp: '2025-12-06T23:37:27.015821+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-ACVUK
|
||||
valid_from: '2025-12-10T09:46:57Z'
|
||||
valid_to:
|
||||
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
|
||||
- ghcid: CZ-HL-PRA-L-ACVUK
|
||||
valid_from:
|
||||
valid_to: '2025-12-10T09:46:57Z'
|
||||
reason: Previous GHCID with incorrect region code
|
||||
- ghcid: CZ-HL-PRA-L-ACVUK
|
||||
ghcid_numeric: 6298553231935086110
|
||||
valid_from: '2025-12-06T23:37:27.015821+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-ACVUK
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 4e7ba70c-00a8-5e05-93c3-b52fa276cc35
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 5768f4cb-1461-8e1e-273d-20cdae6537db
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '6298553231935086110'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 44876a62-12bc-4625-853f-5637562ba19f
|
||||
- *id001
|
||||
- *id002
|
||||
- identifier_scheme: Wikidata
|
||||
identifier_value: Q10817014
|
||||
identifier_url: https://www.wikidata.org/wiki/Q10817014
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:30.059345+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:46Z
|
||||
schema_version: 2.0.0
|
||||
enrichment_provenance:
|
||||
wikidata_enrichment:
|
||||
content_hash: sha256-gIZuWVWgdD1I4IEkEbjObCa+m84Z/CPDXPrUsfGYlc8=
|
||||
verified_at: '2025-12-28T19:58:22.721704+00:00'
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q10817014
|
||||
provenance_schema_version: '2.0'
|
||||
standards_compliance:
|
||||
- W3C PROV-O
|
||||
- W3C SRI (content hashes)
|
||||
generated_at: '2025-12-28T19:58:22.721722+00:00'
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.059345+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Americké centrum - Velvyslanectví USA - knihovna
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.059345+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.059345+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 1
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.059345+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABE193
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.059345+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:27.015821+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
latitude: 50.087198
|
||||
longitude: 14.402043
|
||||
coordinate_provenance:
|
||||
source_type: ORIGINAL_ENTRY
|
||||
source_path: original_entry.locations[0]
|
||||
city: Praha 1
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 118 01
|
||||
street_address: Tržiště 13
|
||||
normalization_timestamp: '2025-12-09T06:50:46.978265+00:00'
|
||||
wikidata_enrichment:
|
||||
wikidata_id: Q10817014
|
||||
wikidata_url: https://www.wikidata.org/wiki/Q10817014
|
||||
matched_by: sigla_identifier
|
||||
matched_sigla: ABE193
|
||||
wikidata_label: Americké centrum
|
||||
wikidata_description: kulturní a informační centrum v Praze
|
||||
enrichment_date: '2025-12-19T10:13:00.700773+00:00'
|
||||
enrichment_version: 2.1.0
|
||||
wikidata_coordinates:
|
||||
longitude: 14.4020222
|
||||
latitude: 50.0871972
|
||||
official_website: http://www.americkecentrum.cz/
|
||||
image: https://commons.wikimedia.org/wiki/Special:FilePath/Knihovna_Amerického_centra_v_Praze,_Vratislavský_palác,_Tržiště_13,_Malá_Strana.jpg
|
||||
instance_of:
|
||||
- Q7075
|
||||
located_in: Q1085
|
||||
_provenance:
|
||||
content_hash:
|
||||
algorithm: sha256
|
||||
value: sha256-gIZuWVWgdD1I4IEkEbjObCa+m84Z/CPDXPrUsfGYlc8=
|
||||
scope: enrichment_section
|
||||
computed_at: '2025-12-28T19:58:22.721704+00:00'
|
||||
prov:
|
||||
wasGeneratedBy:
|
||||
'@type': prov:Activity
|
||||
name: wikidata_api_fetch
|
||||
used: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q10817014
|
||||
generatedAtTime: '2025-12-19T10:13:00.700773+00:00'
|
||||
verification:
|
||||
status: verified
|
||||
last_verified: '2025-12-28T19:58:22.721712+00:00'
|
||||
wikidata_entity_id: Q10817014
|
||||
logo_enrichment:
|
||||
enrichment_timestamp: '2025-12-23T21:56:46.760476+00:00'
|
||||
source_url: https://merkur.tritius.cz/library/americkecentrum
|
||||
extraction_method: crawl4ai
|
||||
claims:
|
||||
- claim_type: favicon_url
|
||||
claim_value: https://merkur.tritius.cz/apple-touch-icon-180x180.png
|
||||
source_url: https://merkur.tritius.cz/library/americkecentrum
|
||||
css_selector: '[document] > html > head > link:nth-of-type(14)'
|
||||
retrieved_on: '2025-12-23T21:56:46.760476+00:00'
|
||||
extraction_method: crawl4ai_link_rel
|
||||
favicon_type: ''
|
||||
favicon_sizes: 180x180
|
||||
summary:
|
||||
total_claims: 1
|
||||
has_primary_logo: false
|
||||
has_favicon: true
|
||||
has_og_image: false
|
||||
favicon_count: 14
|
||||
160
data/custodian.backup.20251230/CZ-10-PRA-L-AHS.yaml
Normal file
160
data/custodian.backup.20251230/CZ-10-PRA-L-AHS.yaml
Normal file
|
|
@ -0,0 +1,160 @@
|
|||
original_entry:
|
||||
name: AERO HOLDING, a.s.
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABF184
|
||||
locations:
|
||||
- street_address: Beranových 130
|
||||
postal_code: 199 04
|
||||
city: Praha 9 - Letňany
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:16.121078+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-AHS
|
||||
ghcid_original: CZ-HL-PRA-L-AHS
|
||||
ghcid_uuid: df88c0a2-a575-52d1-b7ff-863c7db10bd0
|
||||
ghcid_uuid_sha256: f7dc7b4f-cb80-8f38-1590-178d46991ada
|
||||
ghcid_numeric: 17860285804891377464
|
||||
record_id: 2f7af94f-56d1-4625-b35f-7b827ef121b0
|
||||
generation_timestamp: '2025-12-06T23:37:16.121078+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-AHS
|
||||
valid_from: "2025-12-10T09:46:57Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-AHS
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:57Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-AHS
|
||||
ghcid_numeric: 17860285804891377464
|
||||
valid_from: '2025-12-06T23:37:16.121078+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: AERO HOLDING, a.s.
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-AHS
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: df88c0a2-a575-52d1-b7ff-863c7db10bd0
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: f7dc7b4f-cb80-8f38-1590-178d46991ada
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '17860285804891377464'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 2f7af94f-56d1-4625-b35f-7b827ef121b0
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.916759+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:46Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.916759+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: AERO HOLDING, a.s.
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.916759+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.916759+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 9 - Letňany
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.916759+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABF184
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.916759+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:16.121078+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 199 04
|
||||
street_address: Beranových 130
|
||||
normalization_timestamp: '2025-12-09T20:41:04.012779+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:04.012755+00:00'
|
||||
160
data/custodian.backup.20251230/CZ-10-PRA-L-AHUV.yaml
Normal file
160
data/custodian.backup.20251230/CZ-10-PRA-L-AHUV.yaml
Normal file
|
|
@ -0,0 +1,160 @@
|
|||
original_entry:
|
||||
name: Asociace hudebních umělců a vědců
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABE414
|
||||
locations:
|
||||
- street_address: Radlická 99
|
||||
postal_code: 150 00
|
||||
city: Praha 5
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:15.711374+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-AHUV
|
||||
ghcid_original: CZ-HL-PRA-L-AHUV
|
||||
ghcid_uuid: 17d2ba54-4381-58e4-b630-c602beb6e1c7
|
||||
ghcid_uuid_sha256: 12e7c450-6a56-84cb-c4ea-873bd4746685
|
||||
ghcid_numeric: 1362273261963334859
|
||||
record_id: e1401c07-fa12-4691-9fd3-f54dad479bf1
|
||||
generation_timestamp: '2025-12-06T23:37:15.711374+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-AHUV
|
||||
valid_from: "2025-12-10T09:46:57Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-AHUV
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:57Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-AHUV
|
||||
ghcid_numeric: 1362273261963334859
|
||||
valid_from: '2025-12-06T23:37:15.711374+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Asociace hudebních umělců a vědců
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-AHUV
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 17d2ba54-4381-58e4-b630-c602beb6e1c7
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 12e7c450-6a56-84cb-c4ea-873bd4746685
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '1362273261963334859'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: e1401c07-fa12-4691-9fd3-f54dad479bf1
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.913416+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.913416+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Asociace hudebních umělců a vědců
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.913416+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.913416+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 5
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.913416+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABE414
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.913416+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:15.711374+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 150 00
|
||||
street_address: Radlická 99
|
||||
normalization_timestamp: '2025-12-09T20:41:04.060480+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:04.060454+00:00'
|
||||
272
data/custodian.backup.20251230/CZ-10-PRA-L-AMSK.yaml
Normal file
272
data/custodian.backup.20251230/CZ-10-PRA-L-AMSK.yaml
Normal file
|
|
@ -0,0 +1,272 @@
|
|||
original_entry:
|
||||
name: Albatros Media, a.s. - Knihovna
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABE395
|
||||
- &id002
|
||||
identifier_scheme: IČO
|
||||
identifier_value: '28377141'
|
||||
- &id003
|
||||
identifier_scheme: DIČ
|
||||
identifier_value: CZ28377141
|
||||
- &id004
|
||||
identifier_scheme: Website
|
||||
identifier_value: https://www.albatros.cz/
|
||||
identifier_url: https://www.albatros.cz/
|
||||
- &id005
|
||||
identifier_scheme: Wikidata
|
||||
identifier_value: Q114617801
|
||||
identifier_url: https://www.wikidata.org/wiki/Q114617801
|
||||
locations:
|
||||
- street_address: 5. května 1746/22
|
||||
postal_code: 140 00
|
||||
city: Praha 4
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
latitude: 50.055332
|
||||
longitude: 14.438384
|
||||
processing_timestamp: '2025-12-06T23:37:15.679722+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-AMSK
|
||||
ghcid_original: CZ-HL-PRA-L-AMSK
|
||||
ghcid_uuid: c1e86a97-1cbf-512b-a55d-f2bef63e9ec9
|
||||
ghcid_uuid_sha256: 385251a3-c9cc-824f-4807-3e6f793ecbe5
|
||||
ghcid_numeric: 4058395978121359951
|
||||
record_id: 0c98496e-9330-4f00-8187-75bd0a8db3d6
|
||||
generation_timestamp: '2025-12-06T23:37:15.679722+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-AMSK
|
||||
valid_from: '2025-12-10T09:46:57Z'
|
||||
valid_to:
|
||||
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
|
||||
- ghcid: CZ-HL-PRA-L-AMSK
|
||||
valid_from:
|
||||
valid_to: '2025-12-10T09:46:57Z'
|
||||
reason: Previous GHCID with incorrect region code
|
||||
- ghcid: CZ-HL-PRA-L-AMSK
|
||||
ghcid_numeric: 4058395978121359951
|
||||
valid_from: '2025-12-06T23:37:15.679722+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Albatros Media, a.s. - Knihovna
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-AMSK
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: c1e86a97-1cbf-512b-a55d-f2bef63e9ec9
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 385251a3-c9cc-824f-4807-3e6f793ecbe5
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '4058395978121359951'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 0c98496e-9330-4f00-8187-75bd0a8db3d6
|
||||
- *id001
|
||||
- *id002
|
||||
- *id003
|
||||
- *id004
|
||||
- *id005
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.913103+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
schema_version: 2.0.0
|
||||
enrichment_provenance:
|
||||
wikidata_enrichment:
|
||||
content_hash: sha256-9QX7o3HSaCfPf0zp6mHwoxtSUV64HvydJehpPRTPjTE=
|
||||
verified_at: '2025-12-28T19:58:22.760818+00:00'
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617801
|
||||
provenance_schema_version: '2.0'
|
||||
standards_compliance:
|
||||
- W3C PROV-O
|
||||
- W3C SRI (content hashes)
|
||||
generated_at: '2025-12-28T19:58:22.760835+00:00'
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.913103+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Albatros Media, a.s. - Knihovna
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.913103+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.913103+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 4
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.913103+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: wikidata_id
|
||||
claim_value: Q114617801
|
||||
property_uri: owl:sameAs
|
||||
provenance:
|
||||
namespace: wikidata
|
||||
path: /entity/Q114617801
|
||||
timestamp: '2025-11-19T10:14:29.913103+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.98
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABE395
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.913103+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:15.679722+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
wikidata_enrichment:
|
||||
wikidata_entity_id: Q114617801
|
||||
api_metadata:
|
||||
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
fetch_timestamp: '2025-12-18T22:26:20.334630+00:00'
|
||||
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
|
||||
enrichment_version: 2.1_generic
|
||||
properties_found:
|
||||
- P31
|
||||
- P17
|
||||
- P159
|
||||
- P968
|
||||
wikidata_labels:
|
||||
cs: Albatros Media, a.s., knihovna
|
||||
en: Albatros Media, Ltd., Library
|
||||
wikidata_label_en: Albatros Media, Ltd., Library
|
||||
wikidata_label_cs: Albatros Media, a.s., knihovna
|
||||
wikidata_descriptions:
|
||||
cs: knihovna v Praze
|
||||
wikidata_classification:
|
||||
instance_of: &id006
|
||||
- id: Q385994
|
||||
label: special library
|
||||
description: specialized library providing resources on a particular topic or discipline
|
||||
wikidata_instance_of: *id006
|
||||
wikidata_location:
|
||||
country: &id007
|
||||
id: Q213
|
||||
label: Czech Republic
|
||||
description: country in Central Europe
|
||||
headquarters_location:
|
||||
id: Q2686587
|
||||
label: Prague 4
|
||||
description: municipal part of Prague
|
||||
wikidata_country: *id007
|
||||
wikidata_contact:
|
||||
email: mailto:albatros@albatrosmedia.cz
|
||||
_provenance:
|
||||
content_hash:
|
||||
algorithm: sha256
|
||||
value: sha256-9QX7o3HSaCfPf0zp6mHwoxtSUV64HvydJehpPRTPjTE=
|
||||
scope: enrichment_section
|
||||
computed_at: '2025-12-28T19:58:22.760818+00:00'
|
||||
prov:
|
||||
wasGeneratedBy:
|
||||
'@type': prov:Activity
|
||||
name: wikidata_api_fetch
|
||||
used: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617801
|
||||
verification:
|
||||
status: verified
|
||||
last_verified: '2025-12-28T19:58:22.760826+00:00'
|
||||
location:
|
||||
latitude: 50.055332
|
||||
longitude: 14.438384
|
||||
coordinate_provenance:
|
||||
source_type: ORIGINAL_ENTRY
|
||||
source_path: original_entry.locations[0]
|
||||
city: Praha 4
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 140 00
|
||||
street_address: 5. května 1746/22
|
||||
normalization_timestamp: '2025-12-09T06:50:47.041381+00:00'
|
||||
logo_enrichment:
|
||||
enrichment_timestamp: '2025-12-23T21:56:57.568457+00:00'
|
||||
source_url: https://www.albatros.cz
|
||||
extraction_method: crawl4ai
|
||||
claims:
|
||||
- claim_type: favicon_url
|
||||
claim_value: https://www.albatros.cz/Content/images/favicons/albatros/safari-pinned-tab.svg
|
||||
source_url: https://www.albatros.cz
|
||||
css_selector: '[document] > html.js > head > link:nth-of-type(7)'
|
||||
retrieved_on: '2025-12-23T21:56:57.568457+00:00'
|
||||
extraction_method: crawl4ai_link_rel
|
||||
favicon_type: ''
|
||||
favicon_sizes: ''
|
||||
- claim_type: og_image_url
|
||||
claim_value: https://www.albatros.cz/Content/images/albatrosmedia.png
|
||||
source_url: https://www.albatros.cz
|
||||
css_selector: '[document] > html.js > head > meta:nth-of-type(12)'
|
||||
retrieved_on: '2025-12-23T21:56:57.568457+00:00'
|
||||
extraction_method: crawl4ai_meta_og
|
||||
summary:
|
||||
total_claims: 2
|
||||
has_primary_logo: false
|
||||
has_favicon: true
|
||||
has_og_image: true
|
||||
favicon_count: 4
|
||||
275
data/custodian.backup.20251230/CZ-10-PRA-L-AMUVPKA.yaml
Normal file
275
data/custodian.backup.20251230/CZ-10-PRA-L-AMUVPKA.yaml
Normal file
|
|
@ -0,0 +1,275 @@
|
|||
original_entry:
|
||||
name: Akademie múzických umění v Praze - Knihovny AMU
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABD020
|
||||
- &id002
|
||||
identifier_scheme: IČO
|
||||
identifier_value: '61384984'
|
||||
- &id003
|
||||
identifier_scheme: DIČ
|
||||
identifier_value: CZ61384984
|
||||
- &id004
|
||||
identifier_scheme: Website
|
||||
identifier_value: https://tritius.amu.cz/library/damu
|
||||
identifier_url: https://tritius.amu.cz/library/damu
|
||||
- &id005
|
||||
identifier_scheme: Wikidata
|
||||
identifier_value: Q114617654
|
||||
identifier_url: https://www.wikidata.org/wiki/Q114617654
|
||||
locations:
|
||||
- street_address: Malostranské nám. 13
|
||||
postal_code: 118 00
|
||||
city: Praha 1
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
latitude: 50.088175
|
||||
longitude: 14.402125
|
||||
processing_timestamp: '2025-12-06T23:37:14.752021+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-AMUVPKA
|
||||
ghcid_original: CZ-HL-PRA-L-AMUVPKA
|
||||
ghcid_uuid: b3555de4-5475-59d4-86d8-b7fa1f70ef1f
|
||||
ghcid_uuid_sha256: d89fa37d-eda3-8c48-3afa-91cf9efb0c6d
|
||||
ghcid_numeric: 15609374594742574152
|
||||
record_id: 838af26a-375a-44d3-906b-bf4e512bbd58
|
||||
generation_timestamp: '2025-12-06T23:37:14.752021+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-AMUVPKA
|
||||
valid_from: '2025-12-10T09:46:57Z'
|
||||
valid_to:
|
||||
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
|
||||
- ghcid: CZ-HL-PRA-L-AMUVPKA
|
||||
valid_from:
|
||||
valid_to: '2025-12-10T09:46:57Z'
|
||||
reason: Previous GHCID with incorrect region code
|
||||
- ghcid: CZ-HL-PRA-L-AMUVPKA
|
||||
ghcid_numeric: 15609374594742574152
|
||||
valid_from: '2025-12-06T23:37:14.752021+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Akademie múzických umění v Praze - Knihovny AMU
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-AMUVPKA
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: b3555de4-5475-59d4-86d8-b7fa1f70ef1f
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: d89fa37d-eda3-8c48-3afa-91cf9efb0c6d
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '15609374594742574152'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 838af26a-375a-44d3-906b-bf4e512bbd58
|
||||
- *id001
|
||||
- *id002
|
||||
- *id003
|
||||
- *id004
|
||||
- *id005
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.805576+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
schema_version: 2.0.0
|
||||
enrichment_provenance:
|
||||
wikidata_enrichment:
|
||||
content_hash: sha256-+35CKkOOVUe+x+M9wiRhyAWq3YDjjK3Lq69YsuZ8PvE=
|
||||
verified_at: '2025-12-28T19:58:22.781623+00:00'
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617654
|
||||
provenance_schema_version: '2.0'
|
||||
standards_compliance:
|
||||
- W3C PROV-O
|
||||
- W3C SRI (content hashes)
|
||||
generated_at: '2025-12-28T19:58:22.781639+00:00'
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.805576+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Akademie múzických umění v Praze - Knihovny AMU
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.805576+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.805576+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 1
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.805576+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: wikidata_id
|
||||
claim_value: Q114617654
|
||||
property_uri: owl:sameAs
|
||||
provenance:
|
||||
namespace: wikidata
|
||||
path: /entity/Q114617654
|
||||
timestamp: '2025-11-19T10:14:29.805576+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.98
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABD020
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.805576+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:14.752021+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
wikidata_enrichment:
|
||||
wikidata_entity_id: Q114617654
|
||||
api_metadata:
|
||||
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
fetch_timestamp: '2025-12-18T22:26:28.375300+00:00'
|
||||
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
|
||||
enrichment_version: 2.1_generic
|
||||
properties_found:
|
||||
- P31
|
||||
- P17
|
||||
- P159
|
||||
- P968
|
||||
- P463
|
||||
wikidata_labels:
|
||||
cs: Akademie múzických umění v Praze, Knihovny AMU
|
||||
en: Academy of Performing Arts in Prague, Libraries
|
||||
wikidata_label_en: Academy of Performing Arts in Prague, Libraries
|
||||
wikidata_label_cs: Akademie múzických umění v Praze, Knihovny AMU
|
||||
wikidata_descriptions:
|
||||
cs: knihovna v Praze
|
||||
wikidata_aliases:
|
||||
cs:
|
||||
- AMU
|
||||
wikidata_classification:
|
||||
instance_of: &id006
|
||||
- id: Q1622062
|
||||
label: university library
|
||||
description: general library at an institution of higher education
|
||||
wikidata_instance_of: *id006
|
||||
wikidata_location:
|
||||
country: &id007
|
||||
id: Q213
|
||||
label: Czech Republic
|
||||
description: country in Central Europe
|
||||
headquarters_location:
|
||||
id: Q973974
|
||||
label: Prague 1
|
||||
description: administrative district, municipal district and municipal part of Prague
|
||||
wikidata_country: *id007
|
||||
wikidata_organization:
|
||||
member_of:
|
||||
- id: Q4809890
|
||||
label: Association of Library and Information Professionals of the Czech Republic
|
||||
description: organization
|
||||
wikidata_contact:
|
||||
email: mailto:knihovna.damu@damu.cz
|
||||
_provenance:
|
||||
content_hash:
|
||||
algorithm: sha256
|
||||
value: sha256-+35CKkOOVUe+x+M9wiRhyAWq3YDjjK3Lq69YsuZ8PvE=
|
||||
scope: enrichment_section
|
||||
computed_at: '2025-12-28T19:58:22.781623+00:00'
|
||||
prov:
|
||||
wasGeneratedBy:
|
||||
'@type': prov:Activity
|
||||
name: wikidata_api_fetch
|
||||
used: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617654
|
||||
verification:
|
||||
status: verified
|
||||
last_verified: '2025-12-28T19:58:22.781631+00:00'
|
||||
location:
|
||||
latitude: 50.088175
|
||||
longitude: 14.402125
|
||||
coordinate_provenance:
|
||||
source_type: ORIGINAL_ENTRY
|
||||
source_path: original_entry.locations[0]
|
||||
city: Praha 1
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 118 00
|
||||
street_address: Malostranské nám. 13
|
||||
normalization_timestamp: '2025-12-09T06:50:47.066498+00:00'
|
||||
logo_enrichment:
|
||||
enrichment_timestamp: '2025-12-23T21:57:03.111057+00:00'
|
||||
source_url: https://tritius.amu.cz/library/damu
|
||||
extraction_method: crawl4ai
|
||||
claims:
|
||||
- claim_type: favicon_url
|
||||
claim_value: https://tritius.amu.cz/apple-touch-icon-180x180.png
|
||||
source_url: https://tritius.amu.cz/library/damu
|
||||
css_selector: '[document] > html > head > link:nth-of-type(14)'
|
||||
retrieved_on: '2025-12-23T21:57:03.111057+00:00'
|
||||
extraction_method: crawl4ai_link_rel
|
||||
favicon_type: ''
|
||||
favicon_sizes: 180x180
|
||||
summary:
|
||||
total_claims: 1
|
||||
has_primary_logo: false
|
||||
has_favicon: true
|
||||
has_og_image: false
|
||||
favicon_count: 14
|
||||
|
|
@ -0,0 +1,165 @@
|
|||
original_entry:
|
||||
name: ACADEMIA, nakladatelství AV ČR
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABB067
|
||||
locations:
|
||||
- street_address: Legerova 61
|
||||
postal_code: 120 00
|
||||
city: Praha 2
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:14.357057+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-ANAC-academia_nakladatelstvi_av_cr
|
||||
ghcid_original: CZ-HL-PRA-L-ANAČ-academia_nakladatelstvi_av_cr
|
||||
ghcid_uuid: 6eefcdea-8b2b-53b0-8440-999db38fbe76
|
||||
ghcid_uuid_sha256: 7059f920-6ced-8800-b2b6-2dac6ecae7eb
|
||||
ghcid_numeric: 8095775722836920320
|
||||
record_id: 5855f68a-ce13-47d7-a9a4-4267d226f758
|
||||
generation_timestamp: '2025-12-06T23:37:14.357057+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-ANAC-academia_nakladatelstvi_av_cr
|
||||
valid_from: "2025-12-10T09:46:57Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-ANAC-academia_nakladatelstvi_av_cr
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:57Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-ANAC-academia_nakladatelstvi_av_cr
|
||||
ghcid_numeric: 8095775722836920320
|
||||
valid_from: '2025-12-08T11:21:22.227365+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ANAČ-academia_nakladatelstvi_av_cr)'
|
||||
- ghcid: CZ-HL-PRA-L-ANAČ-academia_nakladatelstvi_av_cr
|
||||
ghcid_numeric: 583388196963575266
|
||||
valid_from: '2025-12-06T23:37:14.357057+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:22.227365+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: ACADEMIA, nakladatelství AV ČR
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-ANAC-academia_nakladatelstvi_av_cr
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 6eefcdea-8b2b-53b0-8440-999db38fbe76
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 7059f920-6ced-8800-b2b6-2dac6ecae7eb
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '8095775722836920320'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 5855f68a-ce13-47d7-a9a4-4267d226f758
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.799649+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.799649+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: ACADEMIA, nakladatelství AV ČR
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.799649+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.799649+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 2
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.799649+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABB067
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.799649+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:14.357057+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 120 00
|
||||
street_address: Legerova 61
|
||||
normalization_timestamp: '2025-12-09T20:41:04.123334+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:04.123312+00:00'
|
||||
165
data/custodian.backup.20251230/CZ-10-PRA-L-ANAC.yaml
Normal file
165
data/custodian.backup.20251230/CZ-10-PRA-L-ANAC.yaml
Normal file
|
|
@ -0,0 +1,165 @@
|
|||
original_entry:
|
||||
name: ACADEMIA, nakladatelství AV ČR
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABB067
|
||||
locations:
|
||||
- street_address: Legerova 61
|
||||
postal_code: 120 00
|
||||
city: Praha 2
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:35:06.403081+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-ANAC
|
||||
ghcid_original: CZ-HL-PRA-L-ANAČ
|
||||
ghcid_uuid: a8e8fa95-0774-5fc3-afff-70a2447670b9
|
||||
ghcid_uuid_sha256: f12b095c-1ccc-8763-9626-4e69a3314c32
|
||||
ghcid_numeric: 17377993878363981667
|
||||
record_id: 5f2390fe-8032-4741-b321-f806a60356df
|
||||
generation_timestamp: '2025-12-06T23:35:06.403081+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-ANAC
|
||||
valid_from: "2025-12-10T09:46:57Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-ANAC
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:57Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-ANAC
|
||||
ghcid_numeric: 17377993878363981667
|
||||
valid_from: '2025-12-08T11:21:22.663796+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ANAČ)'
|
||||
- ghcid: CZ-HL-PRA-L-ANAČ
|
||||
ghcid_numeric: 15633231074025097142
|
||||
valid_from: '2025-12-06T23:35:06.403081+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:22.663796+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: ACADEMIA, nakladatelství AV ČR
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-ANAC
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: a8e8fa95-0774-5fc3-afff-70a2447670b9
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: f12b095c-1ccc-8763-9626-4e69a3314c32
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '17377993878363981667'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 5f2390fe-8032-4741-b321-f806a60356df
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.799649+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.799649+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: ACADEMIA, nakladatelství AV ČR
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.799649+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.799649+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 2
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.799649+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABB067
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.799649+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:35:06.403081+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 120 00
|
||||
street_address: Legerova 61
|
||||
normalization_timestamp: '2025-12-09T20:41:04.179472+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:04.179459+00:00'
|
||||
167
data/custodian.backup.20251230/CZ-10-PRA-L-AP.yaml
Normal file
167
data/custodian.backup.20251230/CZ-10-PRA-L-AP.yaml
Normal file
|
|
@ -0,0 +1,167 @@
|
|||
original_entry:
|
||||
name: Automobilový průmysl
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABF015
|
||||
locations:
|
||||
- street_address: Rašínovo nábř. 42
|
||||
postal_code: 120 00
|
||||
city: Praha 2
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:23.820271+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-AP
|
||||
ghcid_original: CZ-XX-PRA-L-AP
|
||||
ghcid_uuid: 490a6b0c-0f6f-5df9-bd2c-854da65d18f3
|
||||
ghcid_uuid_sha256: fb11807f-dfaf-898d-1f4a-ee52df3fd0b3
|
||||
ghcid_numeric: 18091382464826001805
|
||||
record_id: 98145ed4-5147-40d4-afe7-0a5b0ffb9e30
|
||||
generation_timestamp: '2025-12-06T23:37:23.820271+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: '10'
|
||||
city_code: PRA
|
||||
method: GEONAMES_CITY_LOOKUP
|
||||
region_name: CZ-10
|
||||
geonames_id: 8629195
|
||||
resolution_timestamp: '2025-12-07T11:42:55.003894+00:00'
|
||||
ghcid_history:
|
||||
- ghcid: CZ-XX-PRA-L-AP
|
||||
ghcid_numeric: 18091382464826001805
|
||||
valid_from: '2025-12-06T23:37:23.820271+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
- ghcid: CZ-10-PRA-L-AP
|
||||
valid_from: '2025-12-07T11:42:55.003929+00:00'
|
||||
reason: 'Region resolved via GeoNames city lookup: XX->10 (Praha Klanovice)'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Automobilový průmysl
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-XX-PRA-L-AP
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 490a6b0c-0f6f-5df9-bd2c-854da65d18f3
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: fb11807f-dfaf-898d-1f4a-ee52df3fd0b3
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '18091382464826001805'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 98145ed4-5147-40d4-afe7-0a5b0ffb9e30
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:30.031447+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- 'YouTube/Google Maps enrichment 2025-12-08T19:49:16Z: Maps: rejected by LLM; YouTube: not found'
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:19Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:48:05Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:49:29Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.031447+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Automobilový průmysl
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.031447+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.031447+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 2
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.031447+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABF015
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.031447+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:23.820271+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
google_maps_status: NO_MATCH
|
||||
google_maps_rejected:
|
||||
candidate_name: Czech Automotive Industry Association
|
||||
rejection_reason: The candidate is an industry association, not a heritage institution (museum, library, archive, or gallery).
|
||||
The Google Place types ('point_of_interest', 'establishment') do not match the expected heritage institution types.
|
||||
timestamp: '2025-12-08T19:49:15.880112+00:00'
|
||||
youtube_status: NOT_FOUND
|
||||
youtube_search_query: Automobilový průmysl official
|
||||
youtube_search_timestamp: '2025-12-08T19:49:16.250587+00:00'
|
||||
location:
|
||||
city: Prague
|
||||
region: CZ-10
|
||||
region_code: '10'
|
||||
country: CZ
|
||||
postal_code: 120 00
|
||||
street_address: Rašínovo nábř. 42
|
||||
geonames_id: 3067696
|
||||
normalization_timestamp: '2025-12-09T20:40:58.427319+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:40:58.427306+00:00'
|
||||
160
data/custodian.backup.20251230/CZ-10-PRA-L-APS.yaml
Normal file
160
data/custodian.backup.20251230/CZ-10-PRA-L-APS.yaml
Normal file
|
|
@ -0,0 +1,160 @@
|
|||
original_entry:
|
||||
name: AROMA Praha, a.s.
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABF317
|
||||
locations:
|
||||
- street_address: Novovysočanská 46
|
||||
postal_code: 190 00
|
||||
city: Praha 9
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:24.083522+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-APS
|
||||
ghcid_original: CZ-HL-PRA-L-APS
|
||||
ghcid_uuid: 6fcce8bd-0a1d-525d-a914-b5357e0cbfa2
|
||||
ghcid_uuid_sha256: 659f4781-5cc9-8faa-3a35-2a0d9bcd8b7c
|
||||
ghcid_numeric: 7322650140060753834
|
||||
record_id: a84b633c-0086-4a3a-904f-463117833f08
|
||||
generation_timestamp: '2025-12-06T23:37:24.083522+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-APS
|
||||
valid_from: "2025-12-10T09:46:57Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-APS
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:57Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-APS
|
||||
ghcid_numeric: 7322650140060753834
|
||||
valid_from: '2025-12-06T23:37:24.083522+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: AROMA Praha, a.s.
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-APS
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 6fcce8bd-0a1d-525d-a914-b5357e0cbfa2
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 659f4781-5cc9-8faa-3a35-2a0d9bcd8b7c
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '7322650140060753834'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: a84b633c-0086-4a3a-904f-463117833f08
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:30.033411+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.033411+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: AROMA Praha, a.s.
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.033411+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.033411+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 9
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.033411+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABF317
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.033411+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:24.083522+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 190 00
|
||||
street_address: Novovysočanská 46
|
||||
normalization_timestamp: '2025-12-09T20:41:04.227218+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:04.227198+00:00'
|
||||
160
data/custodian.backup.20251230/CZ-10-PRA-L-APSR.yaml
Normal file
160
data/custodian.backup.20251230/CZ-10-PRA-L-APSR.yaml
Normal file
|
|
@ -0,0 +1,160 @@
|
|||
original_entry:
|
||||
name: AGROPROJEKT Praha, s.r.o.
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABF415
|
||||
locations:
|
||||
- street_address: Ve Smečkách 33
|
||||
postal_code: 111 21
|
||||
city: Praha 1
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:16.285031+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-APSR
|
||||
ghcid_original: CZ-HL-PRA-L-APSR
|
||||
ghcid_uuid: 8f29ce36-e21b-5f51-b040-04e873534248
|
||||
ghcid_uuid_sha256: 392e3f84-e307-84d6-3512-a6e0bf51a65b
|
||||
ghcid_numeric: 4120300549067703510
|
||||
record_id: d0de9f8b-915c-4405-927c-13cf8f6d2d15
|
||||
generation_timestamp: '2025-12-06T23:37:16.285031+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-APSR
|
||||
valid_from: "2025-12-10T09:46:57Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-APSR
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:57Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-APSR
|
||||
ghcid_numeric: 4120300549067703510
|
||||
valid_from: '2025-12-06T23:37:16.285031+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: AGROPROJEKT Praha, s.r.o.
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-APSR
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 8f29ce36-e21b-5f51-b040-04e873534248
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 392e3f84-e307-84d6-3512-a6e0bf51a65b
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '4120300549067703510'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: d0de9f8b-915c-4405-927c-13cf8f6d2d15
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.918112+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.918112+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: AGROPROJEKT Praha, s.r.o.
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.918112+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.918112+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 1
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.918112+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABF415
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.918112+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:16.285031+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 111 21
|
||||
street_address: Ve Smečkách 33
|
||||
normalization_timestamp: '2025-12-09T20:41:04.277933+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:04.277918+00:00'
|
||||
164
data/custodian.backup.20251230/CZ-10-PRA-L-APSRSPKDTK.yaml
Normal file
164
data/custodian.backup.20251230/CZ-10-PRA-L-APSRSPKDTK.yaml
Normal file
|
|
@ -0,0 +1,164 @@
|
|||
original_entry:
|
||||
name: AŽD Praha, s.r.o. - Systémy pro kolejovou dopravu - Technická knihovna
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABF005
|
||||
- &id002
|
||||
identifier_scheme: IČO
|
||||
identifier_value: '48029483'
|
||||
locations:
|
||||
- street_address: Žirovnická 2/3146
|
||||
postal_code: 106 17
|
||||
city: Praha 10
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:15.777262+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-APSRSPKDTK
|
||||
ghcid_original: CZ-HL-PRA-L-APSRSPKDTK
|
||||
ghcid_uuid: 73b6f6a0-6aa1-525e-9695-ad0aaf495b5e
|
||||
ghcid_uuid_sha256: 25d36358-6701-8854-8694-d3ca1d69880b
|
||||
ghcid_numeric: 2725631430825711700
|
||||
record_id: cf6b4e23-e9fa-4dc4-801e-1edbf526f27c
|
||||
generation_timestamp: '2025-12-06T23:37:15.777262+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-APSRSPKDTK
|
||||
valid_from: "2025-12-10T09:46:57Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-APSRSPKDTK
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:57Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-APSRSPKDTK
|
||||
ghcid_numeric: 2725631430825711700
|
||||
valid_from: '2025-12-06T23:37:15.777262+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: AŽD Praha, s.r.o. - Systémy pro kolejovou dopravu - Technická knihovna
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-APSRSPKDTK
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 73b6f6a0-6aa1-525e-9695-ad0aaf495b5e
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 25d36358-6701-8854-8694-d3ca1d69880b
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '2725631430825711700'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: cf6b4e23-e9fa-4dc4-801e-1edbf526f27c
|
||||
- *id001
|
||||
- *id002
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.914081+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.914081+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: AŽD Praha, s.r.o. - Systémy pro kolejovou dopravu - Technická knihovna
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.914081+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.914081+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 10
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.914081+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABF005
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.914081+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:15.777262+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 106 17
|
||||
street_address: Žirovnická 2/3146
|
||||
normalization_timestamp: '2025-12-09T20:41:04.323764+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:04.323750+00:00'
|
||||
160
data/custodian.backup.20251230/CZ-10-PRA-L-AS.yaml
Normal file
160
data/custodian.backup.20251230/CZ-10-PRA-L-AS.yaml
Normal file
|
|
@ -0,0 +1,160 @@
|
|||
original_entry:
|
||||
name: ARMABETON, a.s.
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABF162
|
||||
locations:
|
||||
- street_address: Antala Staška 30
|
||||
postal_code: 147 00
|
||||
city: Praha 4
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:16.098082+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-AS
|
||||
ghcid_original: CZ-HL-PRA-L-AS
|
||||
ghcid_uuid: dd6aeb75-9246-5dec-ab3d-02b949e8a665
|
||||
ghcid_uuid_sha256: d68cdb6a-1317-8375-e0f0-11abb10bf0a4
|
||||
ghcid_numeric: 15459972869489394549
|
||||
record_id: 96bd21dd-759b-4b88-9a85-50f0ad328846
|
||||
generation_timestamp: '2025-12-06T23:37:16.098082+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-AS
|
||||
valid_from: "2025-12-10T09:46:58Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-AS
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:58Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-AS
|
||||
ghcid_numeric: 15459972869489394549
|
||||
valid_from: '2025-12-06T23:37:16.098082+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: ARMABETON, a.s.
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-AS
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: dd6aeb75-9246-5dec-ab3d-02b949e8a665
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: d68cdb6a-1317-8375-e0f0-11abb10bf0a4
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '15459972869489394549'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 96bd21dd-759b-4b88-9a85-50f0ad328846
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.916496+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.916496+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: ARMABETON, a.s.
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.916496+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.916496+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 4
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.916496+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABF162
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.916496+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:16.098082+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 147 00
|
||||
street_address: Antala Staška 30
|
||||
normalization_timestamp: '2025-12-09T20:41:04.374645+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:04.374619+00:00'
|
||||
260
data/custodian.backup.20251230/CZ-10-PRA-L-ASPK.yaml
Normal file
260
data/custodian.backup.20251230/CZ-10-PRA-L-ASPK.yaml
Normal file
|
|
@ -0,0 +1,260 @@
|
|||
original_entry:
|
||||
name: Arcibiskupský seminář Praha - Knihovna
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABD092
|
||||
- &id002
|
||||
identifier_scheme: IČO
|
||||
identifier_value: 06522971
|
||||
- &id003
|
||||
identifier_scheme: Website
|
||||
identifier_value: https://knihovna.seminar-praha.cz/
|
||||
identifier_url: https://knihovna.seminar-praha.cz/
|
||||
- &id004
|
||||
identifier_scheme: Wikidata
|
||||
identifier_value: Q114625124
|
||||
identifier_url: https://www.wikidata.org/wiki/Q114625124
|
||||
locations:
|
||||
- street_address: Thákurova 676/3
|
||||
postal_code: 160 00
|
||||
city: Praha 6
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
latitude: 50.102269
|
||||
longitude: 14.387193
|
||||
processing_timestamp: '2025-12-06T23:37:43.284354+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-ASPK
|
||||
ghcid_original: CZ-HL-PRA-L-ASPK
|
||||
ghcid_uuid: eda9c255-cddd-538b-a2f5-2a412cfafba0
|
||||
ghcid_uuid_sha256: b918706f-3041-82c9-f75d-478ef8db9ad6
|
||||
ghcid_numeric: 13337533919310975689
|
||||
record_id: 47bd51e5-a309-453b-ad45-227a6ca0a9a0
|
||||
generation_timestamp: '2025-12-06T23:37:43.284354+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-ASPK
|
||||
valid_from: '2025-12-10T09:46:58Z'
|
||||
valid_to:
|
||||
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
|
||||
- ghcid: CZ-HL-PRA-L-ASPK
|
||||
valid_from:
|
||||
valid_to: '2025-12-10T09:46:58Z'
|
||||
reason: Previous GHCID with incorrect region code
|
||||
- ghcid: CZ-HL-PRA-L-ASPK
|
||||
ghcid_numeric: 13337533919310975689
|
||||
valid_from: '2025-12-06T23:37:43.284354+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Arcibiskupský seminář Praha - Knihovna
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-ASPK
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: eda9c255-cddd-538b-a2f5-2a412cfafba0
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: b918706f-3041-82c9-f75d-478ef8db9ad6
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '13337533919310975689'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 47bd51e5-a309-453b-ad45-227a6ca0a9a0
|
||||
- *id001
|
||||
- *id002
|
||||
- *id003
|
||||
- *id004
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:30.303750+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
schema_version: 2.0.0
|
||||
enrichment_provenance:
|
||||
wikidata_enrichment:
|
||||
content_hash: sha256-ntHR4RBP4Mj51klRdUy7DKK0ApDvoeGLimWqigTPhaA=
|
||||
verified_at: '2025-12-28T19:58:22.869748+00:00'
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114625124
|
||||
provenance_schema_version: '2.0'
|
||||
standards_compliance:
|
||||
- W3C PROV-O
|
||||
- W3C SRI (content hashes)
|
||||
generated_at: '2025-12-28T19:58:22.869781+00:00'
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.303750+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Arcibiskupský seminář Praha - Knihovna
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.303750+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.303750+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 6
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.303750+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: wikidata_id
|
||||
claim_value: Q114625124
|
||||
property_uri: owl:sameAs
|
||||
provenance:
|
||||
namespace: wikidata
|
||||
path: /entity/Q114625124
|
||||
timestamp: '2025-11-19T10:14:30.303750+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.98
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABD092
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.303750+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:43.284354+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
wikidata_enrichment:
|
||||
wikidata_entity_id: Q114625124
|
||||
api_metadata:
|
||||
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
fetch_timestamp: '2025-12-18T22:26:36.445708+00:00'
|
||||
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
|
||||
enrichment_version: 2.1_generic
|
||||
properties_found:
|
||||
- P31
|
||||
- P17
|
||||
- P159
|
||||
- P968
|
||||
wikidata_labels:
|
||||
cs: Arcibiskupský seminář Praha, knihovna
|
||||
wikidata_label_cs: Arcibiskupský seminář Praha, knihovna
|
||||
wikidata_descriptions:
|
||||
cs: knihovna v Praze
|
||||
wikidata_classification:
|
||||
instance_of: &id005
|
||||
- id: Q50379845
|
||||
label: religious library
|
||||
description: type of library with collections that focus on religion and related subjects
|
||||
wikidata_instance_of: *id005
|
||||
wikidata_location:
|
||||
country: &id006
|
||||
id: Q213
|
||||
label: Czech Republic
|
||||
description: country in Central Europe
|
||||
headquarters_location:
|
||||
id: Q2667290
|
||||
label: Prague 6
|
||||
description: municipal part of Prague
|
||||
wikidata_country: *id006
|
||||
wikidata_contact:
|
||||
email: mailto:knihovna@seminar-praha.cz
|
||||
_provenance:
|
||||
content_hash:
|
||||
algorithm: sha256
|
||||
value: sha256-ntHR4RBP4Mj51klRdUy7DKK0ApDvoeGLimWqigTPhaA=
|
||||
scope: enrichment_section
|
||||
computed_at: '2025-12-28T19:58:22.869748+00:00'
|
||||
prov:
|
||||
wasGeneratedBy:
|
||||
'@type': prov:Activity
|
||||
name: wikidata_api_fetch
|
||||
used: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114625124
|
||||
verification:
|
||||
status: verified
|
||||
last_verified: '2025-12-28T19:58:22.869770+00:00'
|
||||
location:
|
||||
latitude: 50.102269
|
||||
longitude: 14.387193
|
||||
coordinate_provenance:
|
||||
source_type: ORIGINAL_ENTRY
|
||||
source_path: original_entry.locations[0]
|
||||
city: Praha 6
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 160 00
|
||||
street_address: Thákurova 676/3
|
||||
normalization_timestamp: '2025-12-09T06:50:47.206103+00:00'
|
||||
logo_enrichment:
|
||||
enrichment_timestamp: '2025-12-23T21:57:23.368831+00:00'
|
||||
source_url: https://knihovna.seminar-praha.cz
|
||||
extraction_method: crawl4ai
|
||||
claims:
|
||||
- claim_type: favicon_url
|
||||
claim_value: https://knihovna.seminar-praha.cz/opac-tmpl/bootstrap/images/favicon.ico
|
||||
source_url: https://knihovna.seminar-praha.cz
|
||||
css_selector: '[document] > html.js.fonts-loaded > head > link:nth-of-type(3)'
|
||||
retrieved_on: '2025-12-23T21:57:23.368831+00:00'
|
||||
extraction_method: crawl4ai_link_rel
|
||||
favicon_type: image/x-icon
|
||||
favicon_sizes: ''
|
||||
summary:
|
||||
total_claims: 1
|
||||
has_primary_logo: false
|
||||
has_favicon: true
|
||||
has_og_image: false
|
||||
favicon_count: 1
|
||||
|
|
@ -0,0 +1,161 @@
|
|||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: ACIDOTECHNA, s.r.o.
|
||||
source_type: ch_annotator
|
||||
original_entry:
|
||||
name: ACIDOTECHNA, s.r.o.
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABF096
|
||||
locations:
|
||||
- street_address: Michelská 12a
|
||||
postal_code: 145 01
|
||||
city: Praha 4
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:15.953529+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-ASR-acidotechna_sro
|
||||
ghcid_original: CZ-HL-PRA-L-ASR-acidotechna_sro
|
||||
ghcid_uuid: 9c2c2209-9d56-5375-bb53-09c75ba423ef
|
||||
ghcid_uuid_sha256: bce8eb61-ded2-8b28-874e-645c068f85be
|
||||
ghcid_numeric: 13612388679310035752
|
||||
record_id: 63f54d2a-db70-4c4c-b0ba-6fb873826750
|
||||
generation_timestamp: '2025-12-06T23:37:15.953529+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-ASR-acidotechna_sro
|
||||
valid_from: '2025-12-10T09:46:58Z'
|
||||
valid_to: null
|
||||
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha))
|
||||
per ISO 3166-2:CZ
|
||||
- ghcid: CZ-HL-PRA-L-ASR-acidotechna_sro
|
||||
valid_from: null
|
||||
valid_to: '2025-12-10T09:46:58Z'
|
||||
reason: Previous GHCID with incorrect region code
|
||||
- ghcid: CZ-HL-PRA-L-ASR-acidotechna_sro
|
||||
ghcid_numeric: 13612388679310035752
|
||||
valid_from: '2025-12-06T23:37:15.953529+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-ASR-acidotechna_sro
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 9c2c2209-9d56-5375-bb53-09c75ba423ef
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: bce8eb61-ded2-8b28-874e-645c068f85be
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '13612388679310035752'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 63f54d2a-db70-4c4c-b0ba-6fb873826750
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.915592+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.915592+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date: null
|
||||
verified_by: null
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: ACIDOTECHNA, s.r.o.
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.915592+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.915592+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 4
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.915592+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABF096
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.915592+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:15.953529+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 145 01
|
||||
street_address: Michelská 12a
|
||||
normalization_timestamp: '2025-12-09T20:41:04.420625+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:04.420612+00:00'
|
||||
160
data/custodian.backup.20251230/CZ-10-PRA-L-ASR.yaml
Normal file
160
data/custodian.backup.20251230/CZ-10-PRA-L-ASR.yaml
Normal file
|
|
@ -0,0 +1,160 @@
|
|||
original_entry:
|
||||
name: AGMECO, s.r.o.
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABC092
|
||||
locations:
|
||||
- street_address: Turkova 828
|
||||
postal_code: 149 00
|
||||
city: Praha 4
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:14.565312+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-ASR
|
||||
ghcid_original: CZ-HL-PRA-L-ASR
|
||||
ghcid_uuid: cd691ef7-1bfd-52ef-a488-72f631ad3a7e
|
||||
ghcid_uuid_sha256: 26b6bda7-4fa1-86bc-9c4c-4a233085c9ea
|
||||
ghcid_numeric: 2789625545495766716
|
||||
record_id: 6559bed9-6e1f-4aeb-b6b1-ac1f44c6317e
|
||||
generation_timestamp: '2025-12-06T23:37:14.565312+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-ASR
|
||||
valid_from: "2025-12-10T09:46:58Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-ASR
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:58Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-ASR
|
||||
ghcid_numeric: 2789625545495766716
|
||||
valid_from: '2025-12-06T23:37:14.565312+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: AGMECO, s.r.o.
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-ASR
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: cd691ef7-1bfd-52ef-a488-72f631ad3a7e
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 26b6bda7-4fa1-86bc-9c4c-4a233085c9ea
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '2789625545495766716'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 6559bed9-6e1f-4aeb-b6b1-ac1f44c6317e
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.803393+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.803393+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: AGMECO, s.r.o.
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.803393+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.803393+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 4
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.803393+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABC092
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.803393+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:14.565312+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 149 00
|
||||
street_address: Turkova 828
|
||||
normalization_timestamp: '2025-12-09T20:41:04.467290+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:04.467210+00:00'
|
||||
273
data/custodian.backup.20251230/CZ-10-PRA-L-ATPSK.yaml
Normal file
273
data/custodian.backup.20251230/CZ-10-PRA-L-ATPSK.yaml
Normal file
|
|
@ -0,0 +1,273 @@
|
|||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: ATS-TELCOM PRAHA, a.s. - Knihovna
|
||||
source_type: ch_annotator
|
||||
original_entry:
|
||||
name: ATS-TELCOM PRAHA, a.s. - Knihovna
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABF153
|
||||
- &id002
|
||||
identifier_scheme: IČO
|
||||
identifier_value: '61860409'
|
||||
- &id003
|
||||
identifier_scheme: DIČ
|
||||
identifier_value: CZ61860409
|
||||
- &id004
|
||||
identifier_scheme: Website
|
||||
identifier_value: https://www.atstelcom.cz/
|
||||
identifier_url: https://www.atstelcom.cz/
|
||||
- &id005
|
||||
identifier_scheme: Wikidata
|
||||
identifier_value: Q114617620
|
||||
identifier_url: https://www.wikidata.org/wiki/Q114617620
|
||||
locations:
|
||||
- street_address: Milíčova 553/14
|
||||
postal_code: 130 00
|
||||
city: Praha 3
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
latitude: 50.085483
|
||||
longitude: 14.451664
|
||||
- street_address: Nad elektrárnou 1526/45
|
||||
postal_code: 106 00
|
||||
city: Praha 10
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
latitude: 50.059649
|
||||
longitude: 14.482379
|
||||
processing_timestamp: '2025-12-06T23:37:14.579079+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-ATPSK
|
||||
ghcid_original: CZ-HL-PRA-L-ATPSK
|
||||
ghcid_uuid: 5e551f13-d489-53fa-af0d-b6e82fa47d51
|
||||
ghcid_uuid_sha256: 7d900938-62fa-869d-3b96-9cb930692af6
|
||||
ghcid_numeric: 9047741789170710173
|
||||
record_id: 4645f24c-0e18-4d6f-91de-8f8c6d333731
|
||||
generation_timestamp: '2025-12-06T23:37:14.579079+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-ATPSK
|
||||
valid_from: '2025-12-10T09:46:58Z'
|
||||
valid_to:
|
||||
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
|
||||
- ghcid: CZ-HL-PRA-L-ATPSK
|
||||
valid_from:
|
||||
valid_to: '2025-12-10T09:46:58Z'
|
||||
reason: Previous GHCID with incorrect region code
|
||||
- ghcid: CZ-HL-PRA-L-ATPSK
|
||||
ghcid_numeric: 9047741789170710173
|
||||
valid_from: '2025-12-06T23:37:14.579079+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-ATPSK
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 5e551f13-d489-53fa-af0d-b6e82fa47d51
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 7d900938-62fa-869d-3b96-9cb930692af6
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '9047741789170710173'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 4645f24c-0e18-4d6f-91de-8f8c6d333731
|
||||
- *id001
|
||||
- *id002
|
||||
- *id003
|
||||
- *id004
|
||||
- *id005
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.803514+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
schema_version: 2.0.0
|
||||
enrichment_provenance:
|
||||
wikidata_enrichment:
|
||||
content_hash: sha256-Ai9x7D73cX3tVNjU3LVqIKD/eoQYNuqfN5tBF2LNVn0=
|
||||
verified_at: '2025-12-28T19:58:22.910555+00:00'
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617620
|
||||
provenance_schema_version: '2.0'
|
||||
standards_compliance:
|
||||
- W3C PROV-O
|
||||
- W3C SRI (content hashes)
|
||||
generated_at: '2025-12-28T19:58:22.910577+00:00'
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.803514+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: ATS-TELCOM PRAHA, a.s. - Knihovna
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.803514+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.803514+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 3
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.803514+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: wikidata_id
|
||||
claim_value: Q114617620
|
||||
property_uri: owl:sameAs
|
||||
provenance:
|
||||
namespace: wikidata
|
||||
path: /entity/Q114617620
|
||||
timestamp: '2025-11-19T10:14:29.803514+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.98
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABF153
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.803514+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:14.579079+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
wikidata_enrichment:
|
||||
wikidata_entity_id: Q114617620
|
||||
api_metadata:
|
||||
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
fetch_timestamp: '2025-12-18T22:26:44.488623+00:00'
|
||||
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
|
||||
enrichment_version: 2.1_generic
|
||||
properties_found:
|
||||
- P31
|
||||
- P17
|
||||
- P159
|
||||
- P968
|
||||
wikidata_labels:
|
||||
cs: ATS-TELCOM PRAHA, a.s., knihovna
|
||||
en: ATS-TELCOM PRAHA, JSC, Library
|
||||
wikidata_label_en: ATS-TELCOM PRAHA, JSC, Library
|
||||
wikidata_label_cs: ATS-TELCOM PRAHA, a.s., knihovna
|
||||
wikidata_descriptions:
|
||||
cs: knihovna v Praze
|
||||
wikidata_classification:
|
||||
instance_of: &id006
|
||||
- id: Q385994
|
||||
label: special library
|
||||
description: specialized library providing resources on a particular topic or discipline
|
||||
wikidata_instance_of: *id006
|
||||
wikidata_location:
|
||||
country: &id007
|
||||
id: Q213
|
||||
label: Czech Republic
|
||||
description: country in Central Europe
|
||||
headquarters_location:
|
||||
id: Q2444921
|
||||
label: Prague 10
|
||||
description: municipal part and administrative district of Prague
|
||||
wikidata_country: *id007
|
||||
wikidata_contact:
|
||||
email: mailto:info@atstelcom.cz
|
||||
_provenance:
|
||||
content_hash:
|
||||
algorithm: sha256
|
||||
value: sha256-Ai9x7D73cX3tVNjU3LVqIKD/eoQYNuqfN5tBF2LNVn0=
|
||||
scope: enrichment_section
|
||||
computed_at: '2025-12-28T19:58:22.910555+00:00'
|
||||
prov:
|
||||
wasGeneratedBy:
|
||||
'@type': prov:Activity
|
||||
name: wikidata_api_fetch
|
||||
used: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617620
|
||||
verification:
|
||||
status: verified
|
||||
last_verified: '2025-12-28T19:58:22.910567+00:00'
|
||||
location:
|
||||
latitude: 50.085483
|
||||
longitude: 14.451664
|
||||
coordinate_provenance:
|
||||
source_type: ORIGINAL_ENTRY
|
||||
source_path: original_entry.locations[0]
|
||||
city: Praha 3
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 130 00
|
||||
street_address: Milíčova 553/14
|
||||
normalization_timestamp: '2025-12-09T06:50:47.255819+00:00'
|
||||
logo_enrichment:
|
||||
enrichment_timestamp: '2025-12-23T21:57:36.669627+00:00'
|
||||
source_url: https://www.atstelcom.cz
|
||||
extraction_method: crawl4ai
|
||||
claims:
|
||||
- claim_type: favicon_url
|
||||
claim_value: https://www.atstelcom.cz/wp-content/themes/atstelcom/images/favicon.png
|
||||
source_url: https://www.atstelcom.cz
|
||||
css_selector: '[document] > html > head > link:nth-of-type(24)'
|
||||
retrieved_on: '2025-12-23T21:57:36.669627+00:00'
|
||||
extraction_method: crawl4ai_link_rel
|
||||
favicon_type: image/png
|
||||
favicon_sizes: 64x64
|
||||
summary:
|
||||
total_claims: 1
|
||||
has_primary_logo: false
|
||||
has_favicon: true
|
||||
has_og_image: false
|
||||
favicon_count: 1
|
||||
268
data/custodian.backup.20251230/CZ-10-PRA-L-AUACPVVK.yaml
Normal file
268
data/custodian.backup.20251230/CZ-10-PRA-L-AUACPVVK.yaml
Normal file
|
|
@ -0,0 +1,268 @@
|
|||
original_entry:
|
||||
name: Archeologický ústav AV ČR, Praha, v. v. i. - Knihovna
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABB001
|
||||
- &id002
|
||||
identifier_scheme: IČO
|
||||
identifier_value: '67985912'
|
||||
- &id003
|
||||
identifier_scheme: DIČ
|
||||
identifier_value: CZ67985912
|
||||
- &id004
|
||||
identifier_scheme: Website
|
||||
identifier_value: https://aleph.lib.cas.cz/F?func=file&file_name=find-b&local_base=ARU
|
||||
identifier_url: https://aleph.lib.cas.cz/F?func=file&file_name=find-b&local_base=ARU
|
||||
- &id005
|
||||
identifier_scheme: Wikidata
|
||||
identifier_value: Q114617535
|
||||
identifier_url: https://www.wikidata.org/wiki/Q114617535
|
||||
locations:
|
||||
- street_address: Letenská 4
|
||||
postal_code: 118 01
|
||||
city: Praha 1
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
latitude: 50.090292
|
||||
longitude: 14.409381
|
||||
processing_timestamp: '2025-12-06T23:35:06.261010+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-AUACPVVK
|
||||
ghcid_original: CZ-HL-PRA-L-AÚAČPVVK
|
||||
ghcid_uuid: 924b81b6-c531-53fb-bbaa-afd62e44a6cc
|
||||
ghcid_uuid_sha256: bd78b1c5-240f-874c-8893-89b8d58c855f
|
||||
ghcid_numeric: 13652857730645309260
|
||||
record_id: 9b0a2dd2-fb5b-4cba-87e5-bcb1cdbd0c78
|
||||
generation_timestamp: '2025-12-06T23:35:06.261010+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-AUACPVVK
|
||||
valid_from: '2025-12-10T09:46:58Z'
|
||||
valid_to:
|
||||
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
|
||||
- ghcid: CZ-HL-PRA-L-AUACPVVK
|
||||
valid_from:
|
||||
valid_to: '2025-12-10T09:46:58Z'
|
||||
reason: Previous GHCID with incorrect region code
|
||||
- ghcid: CZ-HL-PRA-L-AUACPVVK
|
||||
ghcid_numeric: 13652857730645309260
|
||||
valid_from: '2025-12-08T11:21:41.171139+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-AÚAČPVVK)'
|
||||
- ghcid: CZ-HL-PRA-L-AÚAČPVVK
|
||||
ghcid_numeric: 17938697631577694371
|
||||
valid_from: '2025-12-06T23:35:06.261010+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:41.171139+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Archeologický ústav AV ČR, Praha, v. v. i. - Knihovna
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-AUACPVVK
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 924b81b6-c531-53fb-bbaa-afd62e44a6cc
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: bd78b1c5-240f-874c-8893-89b8d58c855f
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '13652857730645309260'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 9b0a2dd2-fb5b-4cba-87e5-bcb1cdbd0c78
|
||||
- *id001
|
||||
- *id002
|
||||
- *id003
|
||||
- *id004
|
||||
- *id005
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.796766+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
schema_version: 2.0.0
|
||||
enrichment_provenance:
|
||||
wikidata_enrichment:
|
||||
content_hash: sha256-E7SnXARgb3OG9F55KshgrRA+rk285ZpEO2RGIY1V9ys=
|
||||
verified_at: '2025-12-28T19:58:22.931365+00:00'
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617535
|
||||
provenance_schema_version: '2.0'
|
||||
standards_compliance:
|
||||
- W3C PROV-O
|
||||
- W3C SRI (content hashes)
|
||||
generated_at: '2025-12-28T19:58:22.931385+00:00'
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.796766+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Archeologický ústav AV ČR, Praha, v. v. i. - Knihovna
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.796766+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.796766+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 1
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.796766+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: wikidata_id
|
||||
claim_value: Q114617535
|
||||
property_uri: owl:sameAs
|
||||
provenance:
|
||||
namespace: wikidata
|
||||
path: /entity/Q114617535
|
||||
timestamp: '2025-11-19T10:14:29.796766+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.98
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABB001
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.796766+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:35:06.261010+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
wikidata_enrichment:
|
||||
wikidata_entity_id: Q114617535
|
||||
api_metadata:
|
||||
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
fetch_timestamp: '2025-12-18T22:26:52.524632+00:00'
|
||||
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
|
||||
enrichment_version: 2.1_generic
|
||||
properties_found:
|
||||
- P31
|
||||
- P17
|
||||
- P159
|
||||
- P968
|
||||
- P18
|
||||
- P463
|
||||
wikidata_labels:
|
||||
cs: Archeologický ústav AV ČR, Praha, v. v. i., knihovna
|
||||
en: Institute of Archaeology CAS, Prague, p. r. i., Library
|
||||
wikidata_label_en: Institute of Archaeology CAS, Prague, p. r. i., Library
|
||||
wikidata_label_cs: Archeologický ústav AV ČR, Praha, v. v. i., knihovna
|
||||
wikidata_descriptions:
|
||||
cs: knihovna v Praze
|
||||
wikidata_aliases:
|
||||
cs:
|
||||
- ARÚ AV ČR
|
||||
wikidata_classification:
|
||||
instance_of: &id006
|
||||
- id: Q1438040
|
||||
label: research library
|
||||
description: library that collects materials on one or more subjects to support scholarly or scientific research
|
||||
wikidata_instance_of: *id006
|
||||
wikidata_location:
|
||||
country: &id007
|
||||
id: Q213
|
||||
label: Czech Republic
|
||||
description: country in Central Europe
|
||||
headquarters_location:
|
||||
id: Q973974
|
||||
label: Prague 1
|
||||
description: administrative district, municipal district and municipal part of Prague
|
||||
wikidata_country: *id007
|
||||
wikidata_organization:
|
||||
member_of:
|
||||
- id: Q106192307
|
||||
label: Czech Republic Libraries Association
|
||||
description: organization in Czechia
|
||||
- id: Q4809890
|
||||
label: Association of Library and Information Professionals of the Czech Republic
|
||||
description: organization
|
||||
wikidata_media:
|
||||
image: Praha Mala Strana Letenska 4.JPG
|
||||
wikidata_image: Praha Mala Strana Letenska 4.JPG
|
||||
wikidata_contact:
|
||||
email: mailto:mvs-edd@arup.cas.cz
|
||||
_provenance:
|
||||
content_hash:
|
||||
algorithm: sha256
|
||||
value: sha256-E7SnXARgb3OG9F55KshgrRA+rk285ZpEO2RGIY1V9ys=
|
||||
scope: enrichment_section
|
||||
computed_at: '2025-12-28T19:58:22.931365+00:00'
|
||||
prov:
|
||||
wasGeneratedBy:
|
||||
'@type': prov:Activity
|
||||
name: wikidata_api_fetch
|
||||
used: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617535
|
||||
verification:
|
||||
status: verified
|
||||
last_verified: '2025-12-28T19:58:22.931375+00:00'
|
||||
location:
|
||||
latitude: 50.090292
|
||||
longitude: 14.409381
|
||||
coordinate_provenance:
|
||||
source_type: ORIGINAL_ENTRY
|
||||
source_path: original_entry.locations[0]
|
||||
city: Praha 1
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 118 01
|
||||
street_address: Letenská 4
|
||||
normalization_timestamp: '2025-12-09T06:50:47.272728+00:00'
|
||||
|
|
@ -0,0 +1,165 @@
|
|||
original_entry:
|
||||
name: Akademie věd ČR - Astronomický ústav - pobočka Praha
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABB003
|
||||
locations:
|
||||
- street_address: Boční II/1401a
|
||||
postal_code: 141 31
|
||||
city: Praha 4 - Spořilov
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:14.336723+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-AVCAUPP-akademie_ved_cr_astronomicky_ustav_pobocka_praha
|
||||
ghcid_original: CZ-HL-PRA-L-AVČAÚPP-akademie_ved_cr_astronomicky_ustav_pobocka_praha
|
||||
ghcid_uuid: 58f34098-2260-5a52-a55a-bd35658b9c4c
|
||||
ghcid_uuid_sha256: b3b20835-7309-8201-99b1-e47c28429836
|
||||
ghcid_numeric: 12948420904299880961
|
||||
record_id: bcb4ac09-49d9-4ea3-93dc-a45ac8db7b22
|
||||
generation_timestamp: '2025-12-06T23:37:14.336723+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-AVCAUPP-akademie_ved_cr_astronomicky_ustav_pobocka_praha
|
||||
valid_from: "2025-12-10T09:46:58Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-AVCAUPP-akademie_ved_cr_astronomicky_ustav_pobocka_praha
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:58Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-AVCAUPP-akademie_ved_cr_astronomicky_ustav_pobocka_praha
|
||||
ghcid_numeric: 12948420904299880961
|
||||
valid_from: '2025-12-08T11:21:37.421254+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-AVČAÚPP-akademie_ved_cr_astronomicky_ustav_pobocka_praha)'
|
||||
- ghcid: CZ-HL-PRA-L-AVČAÚPP-akademie_ved_cr_astronomicky_ustav_pobocka_praha
|
||||
ghcid_numeric: 7223755232174600416
|
||||
valid_from: '2025-12-06T23:37:14.336723+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:37.421254+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Akademie věd ČR - Astronomický ústav - pobočka Praha
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-AVCAUPP-akademie_ved_cr_astronomicky_ustav_pobocka_praha
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 58f34098-2260-5a52-a55a-bd35658b9c4c
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: b3b20835-7309-8201-99b1-e47c28429836
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '12948420904299880961'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: bcb4ac09-49d9-4ea3-93dc-a45ac8db7b22
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.796909+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.796909+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Akademie věd ČR - Astronomický ústav - pobočka Praha
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.796909+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.796909+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 4 - Spořilov
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.796909+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABB003
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.796909+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:14.336723+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 141 31
|
||||
street_address: Boční II/1401a
|
||||
normalization_timestamp: '2025-12-09T20:41:04.527872+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:04.527796+00:00'
|
||||
166
data/custodian.backup.20251230/CZ-10-PRA-L-AVCAUPP.yaml
Normal file
166
data/custodian.backup.20251230/CZ-10-PRA-L-AVCAUPP.yaml
Normal file
|
|
@ -0,0 +1,166 @@
|
|||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Akademie věd ČR - Astronomický ústav - pobočka Praha
|
||||
source_type: ch_annotator
|
||||
original_entry:
|
||||
name: Akademie věd ČR - Astronomický ústav - pobočka Praha
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABB003
|
||||
locations:
|
||||
- street_address: Boční II/1401a
|
||||
postal_code: 141 31
|
||||
city: Praha 4 - Spořilov
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:35:06.267449+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-AVCAUPP
|
||||
ghcid_original: CZ-HL-PRA-L-AVČAÚPP
|
||||
ghcid_uuid: 18967c5e-b5e8-590f-a534-aee5991f94d2
|
||||
ghcid_uuid_sha256: 9b3c35db-cf09-8a4e-bf7a-4869fbae9378
|
||||
ghcid_numeric: 11185874792669071950
|
||||
record_id: 25dfb5ba-2772-4315-97f5-9e6bff301b3e
|
||||
generation_timestamp: '2025-12-06T23:35:06.267449+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-AVCAUPP
|
||||
valid_from: '2025-12-10T09:46:58Z'
|
||||
valid_to: null
|
||||
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha))
|
||||
per ISO 3166-2:CZ
|
||||
- ghcid: CZ-HL-PRA-L-AVCAUPP
|
||||
valid_from: null
|
||||
valid_to: '2025-12-10T09:46:58Z'
|
||||
reason: Previous GHCID with incorrect region code
|
||||
- ghcid: CZ-HL-PRA-L-AVCAUPP
|
||||
ghcid_numeric: 11185874792669071950
|
||||
valid_from: '2025-12-08T11:21:31.097745+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-AVČAÚPP)'
|
||||
- ghcid: CZ-HL-PRA-L-AVČAÚPP
|
||||
ghcid_numeric: 8955671449216497236
|
||||
valid_from: '2025-12-06T23:35:06.267449+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:31.097745+00:00'
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-AVCAUPP
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 18967c5e-b5e8-590f-a534-aee5991f94d2
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 9b3c35db-cf09-8a4e-bf7a-4869fbae9378
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '11185874792669071950'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 25dfb5ba-2772-4315-97f5-9e6bff301b3e
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.796909+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.796909+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date: null
|
||||
verified_by: null
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Akademie věd ČR - Astronomický ústav - pobočka Praha
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.796909+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.796909+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 4 - Spořilov
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.796909+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABB003
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.796909+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:35:06.267449+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 141 31
|
||||
street_address: Boční II/1401a
|
||||
normalization_timestamp: '2025-12-09T20:41:04.585077+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:04.585057+00:00'
|
||||
165
data/custodian.backup.20251230/CZ-10-PRA-L-AVCEI.yaml
Normal file
165
data/custodian.backup.20251230/CZ-10-PRA-L-AVCEI.yaml
Normal file
|
|
@ -0,0 +1,165 @@
|
|||
original_entry:
|
||||
name: Akademie věd ČR - Encyklopedický institut
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABB007
|
||||
locations:
|
||||
- street_address: Nerudova 21
|
||||
postal_code: 110 00
|
||||
city: Praha 1
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:25.509175+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-AVCEI
|
||||
ghcid_original: CZ-HL-PRA-L-AVČEI
|
||||
ghcid_uuid: 0db42277-531e-574c-b4d8-06d65682aec8
|
||||
ghcid_uuid_sha256: abe02825-c33c-8ab4-8761-1ed37589c252
|
||||
ghcid_numeric: 12384943117923306164
|
||||
record_id: 29def572-f83a-4df6-a71d-d7a953ebbaf3
|
||||
generation_timestamp: '2025-12-06T23:37:25.509175+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-AVCEI
|
||||
valid_from: "2025-12-10T09:46:58Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-AVCEI
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:58Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-AVCEI
|
||||
ghcid_numeric: 12384943117923306164
|
||||
valid_from: '2025-12-08T11:21:22.756802+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-AVČEI)'
|
||||
- ghcid: CZ-HL-PRA-L-AVČEI
|
||||
ghcid_numeric: 1744362685845442236
|
||||
valid_from: '2025-12-06T23:37:25.509175+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:22.756802+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Akademie věd ČR - Encyklopedický institut
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-AVCEI
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 0db42277-531e-574c-b4d8-06d65682aec8
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: abe02825-c33c-8ab4-8761-1ed37589c252
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '12384943117923306164'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 29def572-f83a-4df6-a71d-d7a953ebbaf3
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:30.046021+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046021+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Akademie věd ČR - Encyklopedický institut
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046021+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046021+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 1
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046021+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABB007
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046021+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:25.509175+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 110 00
|
||||
street_address: Nerudova 21
|
||||
normalization_timestamp: '2025-12-09T20:41:04.641407+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:04.641362+00:00'
|
||||
|
|
@ -0,0 +1,174 @@
|
|||
original_entry:
|
||||
name: Akademie věd ČR - Farmakologický ústav
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABB009
|
||||
locations:
|
||||
- street_address: Vídeňská 1083
|
||||
postal_code: 142 20
|
||||
city: Praha 4 - Krč
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:14.339235+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-AVCFU-akademie_ved_cr_farmakologicky_ustav
|
||||
ghcid_original: CZ-XX-PRA-L-AVČFÚ-akademie_ved_cr_farmakologicky_ustav
|
||||
ghcid_uuid: f96e7f32-5991-57f8-ac2b-03a29e61f6b4
|
||||
ghcid_uuid_sha256: 672371a3-e0d0-8931-b13f-7df5a5e66712
|
||||
ghcid_numeric: 7431908758756849969
|
||||
record_id: 0e02392d-7875-4d5d-8654-519be6319019
|
||||
generation_timestamp: '2025-12-06T23:37:14.339235+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: '10'
|
||||
city_code: PRA
|
||||
method: GEONAMES_CITY_LOOKUP
|
||||
region_name: CZ-10
|
||||
geonames_id: 8629195
|
||||
resolution_timestamp: '2025-12-07T11:42:53.985901+00:00'
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-AVCFU-akademie_ved_cr_farmakologicky_ustav
|
||||
ghcid_numeric: 7431908758756849969
|
||||
valid_from: '2025-12-08T11:21:21.962848+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-10-PRA-L-AVČFÚ-akademie_ved_cr_farmakologicky_ustav)'
|
||||
- ghcid: CZ-XX-PRA-L-AVČFÚ-akademie_ved_cr_farmakologicky_ustav
|
||||
ghcid_numeric: 1014122794529823696
|
||||
valid_from: '2025-12-06T23:37:14.339235+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:21.962848+00:00'
|
||||
- ghcid: CZ-10-PRA-L-AVČFÚ-akademie_ved_cr_farmakologicky_ustav
|
||||
valid_from: '2025-12-07T11:42:53.985912+00:00'
|
||||
reason: 'Region resolved via GeoNames city lookup: XX->10 (Praha Klanovice)'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Akademie věd ČR - Farmakologický ústav
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-AVCFU-akademie_ved_cr_farmakologicky_ustav
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: f96e7f32-5991-57f8-ac2b-03a29e61f6b4
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 672371a3-e0d0-8931-b13f-7df5a5e66712
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '7431908758756849969'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 0e02392d-7875-4d5d-8654-519be6319019
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.797040+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- 'YouTube/Google Maps enrichment 2025-12-08T19:49:19Z: Maps: rejected by LLM; YouTube: not found'
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:19Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:48:05Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:49:29Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.797040+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Akademie věd ČR - Farmakologický ústav
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.797040+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.797040+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 4 - Krč
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.797040+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABB009
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.797040+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:14.339235+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
google_maps_status: NO_MATCH
|
||||
google_maps_rejected:
|
||||
candidate_name: Czech Academy of Sciences
|
||||
rejection_reason: The candidate 'Czech Academy of Sciences' is the overarching parent institution for the source 'Akademie
|
||||
věd ČR - Farmakologický ústav' (Institute of Pharmacology). They are not the same entity. Furthermore, the candidate's
|
||||
generic Google Place types ('point_of_interest', 'establishment') do not match the expected heritage custodian types (museum,
|
||||
library, archive, gallery). The entity type is not GRP.HER.
|
||||
timestamp: '2025-12-08T19:49:19.452977+00:00'
|
||||
youtube_status: NOT_FOUND
|
||||
youtube_search_query: Akademie věd ČR - Farmakologický ústav official
|
||||
youtube_search_timestamp: '2025-12-08T19:49:19.794617+00:00'
|
||||
location:
|
||||
city: Prague
|
||||
region: CZ-10
|
||||
region_code: '10'
|
||||
country: CZ
|
||||
postal_code: 142 20
|
||||
street_address: Vídeňská 1083
|
||||
geonames_id: 3067696
|
||||
normalization_timestamp: '2025-12-09T20:40:58.470017+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:40:58.469902+00:00'
|
||||
175
data/custodian.backup.20251230/CZ-10-PRA-L-AVCFU.yaml
Normal file
175
data/custodian.backup.20251230/CZ-10-PRA-L-AVCFU.yaml
Normal file
|
|
@ -0,0 +1,175 @@
|
|||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Akademie věd ČR - Farmakologický ústav
|
||||
source_type: ch_annotator
|
||||
original_entry:
|
||||
name: Akademie věd ČR - Farmakologický ústav
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABB009
|
||||
locations:
|
||||
- street_address: Vídeňská 1083
|
||||
postal_code: 142 20
|
||||
city: Praha 4 - Krč
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:35:06.272726+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-AVCFU
|
||||
ghcid_original: CZ-XX-PRA-L-AVČFÚ
|
||||
ghcid_uuid: 19446a92-0047-50e3-a29e-2b6727963ddd
|
||||
ghcid_uuid_sha256: 509c980d-7747-8252-ac32-02074845989a
|
||||
ghcid_numeric: 5808684803004277330
|
||||
record_id: 7f650e4c-32cf-48c2-96c7-c2e8a97b93a0
|
||||
generation_timestamp: '2025-12-06T23:35:06.272726+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: '10'
|
||||
city_code: PRA
|
||||
method: GEONAMES_CITY_LOOKUP
|
||||
region_name: CZ-10
|
||||
geonames_id: 8629195
|
||||
resolution_timestamp: '2025-12-07T11:42:56.327508+00:00'
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-AVCFU
|
||||
ghcid_numeric: 5808684803004277330
|
||||
valid_from: '2025-12-08T11:21:22.541518+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-10-PRA-L-AVČFÚ)'
|
||||
- ghcid: CZ-XX-PRA-L-AVČFÚ
|
||||
ghcid_numeric: 10681739419161361008
|
||||
valid_from: '2025-12-06T23:35:06.272726+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:22.541518+00:00'
|
||||
- ghcid: CZ-10-PRA-L-AVČFÚ
|
||||
valid_from: '2025-12-07T11:42:56.327520+00:00'
|
||||
reason: 'Region resolved via GeoNames city lookup: XX->10 (Praha Klanovice)'
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-AVCFU
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 19446a92-0047-50e3-a29e-2b6727963ddd
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 509c980d-7747-8252-ac32-02074845989a
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '5808684803004277330'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 7f650e4c-32cf-48c2-96c7-c2e8a97b93a0
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.797040+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- 'YouTube/Google Maps enrichment 2025-12-08T19:49:23Z: Maps: rejected by LLM; YouTube:
|
||||
not found'
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:19Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:48:05Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:49:29Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.797040+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date: null
|
||||
verified_by: null
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Akademie věd ČR - Farmakologický ústav
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.797040+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.797040+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 4 - Krč
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.797040+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABB009
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.797040+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:35:06.272726+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
google_maps_status: NO_MATCH
|
||||
google_maps_rejected:
|
||||
candidate_name: Czech Academy of Sciences
|
||||
rejection_reason: The Google Place types 'point_of_interest' and 'establishment'
|
||||
are too generic and do not match the expected types for a Heritage Institution
|
||||
(GRP.HER). The source institution is a research institute, which does not align
|
||||
with the definition of a heritage custodian (museums, archives, libraries, galleries).
|
||||
timestamp: '2025-12-08T19:49:22.801497+00:00'
|
||||
youtube_status: NOT_FOUND
|
||||
youtube_search_query: Akademie věd ČR - Farmakologický ústav official
|
||||
youtube_search_timestamp: '2025-12-08T19:49:23.146454+00:00'
|
||||
location:
|
||||
city: Prague
|
||||
region: CZ-10
|
||||
region_code: '10'
|
||||
country: CZ
|
||||
postal_code: 142 20
|
||||
street_address: Vídeňská 1083
|
||||
geonames_id: 3067696
|
||||
normalization_timestamp: '2025-12-09T20:40:58.518802+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:40:58.518791+00:00'
|
||||
165
data/custodian.backup.20251230/CZ-10-PRA-L-AVCLEB.yaml
Normal file
165
data/custodian.backup.20251230/CZ-10-PRA-L-AVCLEB.yaml
Normal file
|
|
@ -0,0 +1,165 @@
|
|||
original_entry:
|
||||
name: Akademie věd ČR - Laboratoř evoluční biologie
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABB113
|
||||
locations:
|
||||
- street_address: Na Folimance 11
|
||||
postal_code: 120 00
|
||||
city: Praha 2
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:25.551825+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-AVCLEB
|
||||
ghcid_original: CZ-HL-PRA-L-AVČLEB
|
||||
ghcid_uuid: e0d81fc2-7fba-594a-9c91-9346aecc6151
|
||||
ghcid_uuid_sha256: 07e2e7ff-3bd7-8da4-9c02-bdf92fed8815
|
||||
ghcid_numeric: 568271586408783268
|
||||
record_id: 0ebd10c7-c2bf-47f2-9f6c-a6ab26dacb10
|
||||
generation_timestamp: '2025-12-06T23:37:25.551825+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-AVCLEB
|
||||
valid_from: "2025-12-10T09:46:58Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-AVCLEB
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:58Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-AVCLEB
|
||||
ghcid_numeric: 568271586408783268
|
||||
valid_from: '2025-12-08T11:21:21.322426+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-AVČLEB)'
|
||||
- ghcid: CZ-HL-PRA-L-AVČLEB
|
||||
ghcid_numeric: 6089402491441890915
|
||||
valid_from: '2025-12-06T23:37:25.551825+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:21.322426+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Akademie věd ČR - Laboratoř evoluční biologie
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-AVCLEB
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: e0d81fc2-7fba-594a-9c91-9346aecc6151
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 07e2e7ff-3bd7-8da4-9c02-bdf92fed8815
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '568271586408783268'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 0ebd10c7-c2bf-47f2-9f6c-a6ab26dacb10
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:30.046359+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046359+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Akademie věd ČR - Laboratoř evoluční biologie
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046359+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046359+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 2
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046359+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABB113
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046359+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:25.551825+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 120 00
|
||||
street_address: Na Folimance 11
|
||||
normalization_timestamp: '2025-12-09T20:41:04.685021+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:04.684993+00:00'
|
||||
|
|
@ -0,0 +1,165 @@
|
|||
original_entry:
|
||||
name: Akademie věd ČR - Mikrobiologický ústav
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABB070
|
||||
locations:
|
||||
- street_address: Vídeňská 1083
|
||||
postal_code: 142 20
|
||||
city: Praha 4 - Krč
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:14.360633+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-AVCMU-akademie_ved_cr_mikrobiologicky_ustav
|
||||
ghcid_original: CZ-HL-PRA-L-AVČMÚ-akademie_ved_cr_mikrobiologicky_ustav
|
||||
ghcid_uuid: a981b3fa-1bce-51ef-8dcb-102dd837a91d
|
||||
ghcid_uuid_sha256: 1df13b28-a632-8490-bb35-783b25a0b097
|
||||
ghcid_numeric: 2157570742260221072
|
||||
record_id: 7dd73bdf-0d97-4f80-8391-979cb3bf4e74
|
||||
generation_timestamp: '2025-12-06T23:37:14.360633+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-AVCMU-akademie_ved_cr_mikrobiologicky_ustav
|
||||
valid_from: "2025-12-10T09:46:58Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-AVCMU-akademie_ved_cr_mikrobiologicky_ustav
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:58Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-AVCMU-akademie_ved_cr_mikrobiologicky_ustav
|
||||
ghcid_numeric: 2157570742260221072
|
||||
valid_from: '2025-12-08T11:21:40.263531+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-AVČMÚ-akademie_ved_cr_mikrobiologicky_ustav)'
|
||||
- ghcid: CZ-HL-PRA-L-AVČMÚ-akademie_ved_cr_mikrobiologicky_ustav
|
||||
ghcid_numeric: 11561309061492259634
|
||||
valid_from: '2025-12-06T23:37:14.360633+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:40.263531+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Akademie věd ČR - Mikrobiologický ústav
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-AVCMU-akademie_ved_cr_mikrobiologicky_ustav
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: a981b3fa-1bce-51ef-8dcb-102dd837a91d
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 1df13b28-a632-8490-bb35-783b25a0b097
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '2157570742260221072'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 7dd73bdf-0d97-4f80-8391-979cb3bf4e74
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.799712+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.799712+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Akademie věd ČR - Mikrobiologický ústav
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.799712+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.799712+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 4 - Krč
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.799712+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABB070
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.799712+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:14.360633+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 142 20
|
||||
street_address: Vídeňská 1083
|
||||
normalization_timestamp: '2025-12-09T20:41:04.735969+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:04.735953+00:00'
|
||||
165
data/custodian.backup.20251230/CZ-10-PRA-L-AVCMU.yaml
Normal file
165
data/custodian.backup.20251230/CZ-10-PRA-L-AVCMU.yaml
Normal file
|
|
@ -0,0 +1,165 @@
|
|||
original_entry:
|
||||
name: Akademie věd ČR - Mikrobiologický ústav
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABB070
|
||||
locations:
|
||||
- street_address: Vídeňská 1083
|
||||
postal_code: 142 20
|
||||
city: Praha 4 - Krč
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:35:06.408370+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-AVCMU
|
||||
ghcid_original: CZ-HL-PRA-L-AVČMÚ
|
||||
ghcid_uuid: 67eb014a-1a9a-5c33-b7b0-ba6334bbd2f4
|
||||
ghcid_uuid_sha256: dee515cd-cd8e-8bb7-bb0f-5779401058ef
|
||||
ghcid_numeric: 16061267619747896247
|
||||
record_id: e556633f-9974-4eff-8494-d78123ceb59e
|
||||
generation_timestamp: '2025-12-06T23:35:06.408370+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-AVCMU
|
||||
valid_from: "2025-12-10T09:46:58Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-AVCMU
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:58Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-AVCMU
|
||||
ghcid_numeric: 16061267619747896247
|
||||
valid_from: '2025-12-08T11:21:21.267751+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-AVČMÚ)'
|
||||
- ghcid: CZ-HL-PRA-L-AVČMÚ
|
||||
ghcid_numeric: 17106076247160859197
|
||||
valid_from: '2025-12-06T23:35:06.408370+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:21.267751+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Akademie věd ČR - Mikrobiologický ústav
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-AVCMU
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 67eb014a-1a9a-5c33-b7b0-ba6334bbd2f4
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: dee515cd-cd8e-8bb7-bb0f-5779401058ef
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '16061267619747896247'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: e556633f-9974-4eff-8494-d78123ceb59e
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.799712+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.799712+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Akademie věd ČR - Mikrobiologický ústav
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.799712+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.799712+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 4 - Krč
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.799712+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABB070
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.799712+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:35:06.408370+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 142 20
|
||||
street_address: Vídeňská 1083
|
||||
normalization_timestamp: '2025-12-09T20:41:04.783450+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:04.783435+00:00'
|
||||
|
|
@ -0,0 +1,175 @@
|
|||
original_entry:
|
||||
name: Akademie věd ČR - Patentové a licenční služby
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABB034
|
||||
locations:
|
||||
- street_address: Českomalínská 25
|
||||
postal_code: 160 70
|
||||
city: Praha 6
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:14.342675+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-AVCPLS-akademie_ved_cr_patentove_a_licencni_sluzby
|
||||
ghcid_original: CZ-XX-PRA-L-AVČPLS-akademie_ved_cr_patentove_a_licencni_sluzby
|
||||
ghcid_uuid: 77fc1dea-cb44-5f17-b700-0fa091c6d43e
|
||||
ghcid_uuid_sha256: cf64099a-0441-8f94-9b74-a037a0a9bce2
|
||||
ghcid_numeric: 14944080020623171476
|
||||
record_id: 20afb6b3-2f0b-40bc-aa0f-ff7a2c1a78a7
|
||||
generation_timestamp: '2025-12-06T23:37:14.342675+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: '10'
|
||||
city_code: PRA
|
||||
method: GEONAMES_CITY_LOOKUP
|
||||
region_name: CZ-10
|
||||
geonames_id: 8629195
|
||||
resolution_timestamp: '2025-12-07T11:42:54.008704+00:00'
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-AVCPLS-akademie_ved_cr_patentove_a_licencni_sluzby
|
||||
ghcid_numeric: 14944080020623171476
|
||||
valid_from: '2025-12-08T11:21:38.962454+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-10-PRA-L-AVČPLS-akademie_ved_cr_patentove_a_licencni_sluzby)'
|
||||
- ghcid: CZ-XX-PRA-L-AVČPLS-akademie_ved_cr_patentove_a_licencni_sluzby
|
||||
ghcid_numeric: 2477309062137844976
|
||||
valid_from: '2025-12-06T23:37:14.342675+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:38.962454+00:00'
|
||||
- ghcid: CZ-10-PRA-L-AVČPLS-akademie_ved_cr_patentove_a_licencni_sluzby
|
||||
valid_from: '2025-12-07T11:42:54.008741+00:00'
|
||||
reason: 'Region resolved via GeoNames city lookup: XX->10 (Praha Klanovice)'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Akademie věd ČR - Patentové a licenční služby
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-AVCPLS-akademie_ved_cr_patentove_a_licencni_sluzby
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 77fc1dea-cb44-5f17-b700-0fa091c6d43e
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: cf64099a-0441-8f94-9b74-a037a0a9bce2
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '14944080020623171476'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 20afb6b3-2f0b-40bc-aa0f-ff7a2c1a78a7
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.797854+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- 'YouTube/Google Maps enrichment 2025-12-08T19:49:26Z: Maps: rejected by LLM; YouTube: not found'
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:19Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:48:05Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:49:29Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.797854+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Akademie věd ČR - Patentové a licenční služby
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.797854+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.797854+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 6
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.797854+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABB034
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.797854+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:14.342675+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
google_maps_status: NO_MATCH
|
||||
google_maps_rejected:
|
||||
candidate_name: Czech Academy of Sciences
|
||||
rejection_reason: 'REJECT: The source name indicates a specific department (Patent and Licensing Services) of the Czech
|
||||
Academy of Sciences (Akademie věd ČR). The Google Maps result is for the parent institution, the Czech Academy of Sciences
|
||||
itself, which is a broad scientific research organization, not a specific heritage custodian unit. Critically, the Google
|
||||
Place types (''point_of_interest'', ''establishment'') do not match any of the expected heritage custodian types (e.g.,
|
||||
library, museum, archive).'
|
||||
timestamp: '2025-12-08T19:49:26.082220+00:00'
|
||||
youtube_status: NOT_FOUND
|
||||
youtube_search_query: Akademie věd ČR - Patentové a licenční služby official
|
||||
youtube_search_timestamp: '2025-12-08T19:49:26.425308+00:00'
|
||||
location:
|
||||
city: Prague
|
||||
region: CZ-10
|
||||
region_code: '10'
|
||||
country: CZ
|
||||
postal_code: 160 70
|
||||
street_address: Českomalínská 25
|
||||
geonames_id: 3067696
|
||||
normalization_timestamp: '2025-12-09T20:40:58.576931+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:40:58.576911+00:00'
|
||||
173
data/custodian.backup.20251230/CZ-10-PRA-L-AVCPLS.yaml
Normal file
173
data/custodian.backup.20251230/CZ-10-PRA-L-AVCPLS.yaml
Normal file
|
|
@ -0,0 +1,173 @@
|
|||
original_entry:
|
||||
name: Akademie věd ČR - Patentové a licenční služby
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABB034
|
||||
locations:
|
||||
- street_address: Českomalínská 25
|
||||
postal_code: 160 70
|
||||
city: Praha 6
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:35:06.317887+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-AVCPLS
|
||||
ghcid_original: CZ-XX-PRA-L-AVČPLS
|
||||
ghcid_uuid: 736468df-5da2-5944-9de8-403b76877f1c
|
||||
ghcid_uuid_sha256: eb24cd69-68ef-8a64-b846-ef07973b1e82
|
||||
ghcid_numeric: 16943893550690450020
|
||||
record_id: 3b727510-8b1e-4cab-899d-ab014a488899
|
||||
generation_timestamp: '2025-12-06T23:35:06.317887+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: '10'
|
||||
city_code: PRA
|
||||
method: GEONAMES_CITY_LOOKUP
|
||||
region_name: CZ-10
|
||||
geonames_id: 8629195
|
||||
resolution_timestamp: '2025-12-07T11:42:54.307666+00:00'
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-AVCPLS
|
||||
ghcid_numeric: 16943893550690450020
|
||||
valid_from: '2025-12-08T11:21:31.031504+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-10-PRA-L-AVČPLS)'
|
||||
- ghcid: CZ-XX-PRA-L-AVČPLS
|
||||
ghcid_numeric: 14931995940768973646
|
||||
valid_from: '2025-12-06T23:35:06.317887+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:31.031504+00:00'
|
||||
- ghcid: CZ-10-PRA-L-AVČPLS
|
||||
valid_from: '2025-12-07T11:42:54.307676+00:00'
|
||||
reason: 'Region resolved via GeoNames city lookup: XX->10 (Praha Klanovice)'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Akademie věd ČR - Patentové a licenční služby
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-AVCPLS
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 736468df-5da2-5944-9de8-403b76877f1c
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: eb24cd69-68ef-8a64-b846-ef07973b1e82
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '16943893550690450020'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 3b727510-8b1e-4cab-899d-ab014a488899
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.797854+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- 'YouTube/Google Maps enrichment 2025-12-08T19:49:33Z: Maps: rejected by LLM; YouTube: not found'
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:19Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:48:05Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:49:29Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.797854+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Akademie věd ČR - Patentové a licenční služby
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.797854+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.797854+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 6
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.797854+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABB034
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.797854+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:35:06.317887+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
google_maps_status: NO_MATCH
|
||||
google_maps_rejected:
|
||||
candidate_name: Czech Academy of Sciences
|
||||
rejection_reason: 'REJECT: The candidate matches by name and location, but the Google Place Types (''point_of_interest'',
|
||||
''establishment'') do not match the expected heritage types (museum, library, archive, gallery). The institution is a
|
||||
national academy of sciences, not a heritage custodian like a museum or library.'
|
||||
timestamp: '2025-12-08T19:49:33.088455+00:00'
|
||||
youtube_status: NOT_FOUND
|
||||
youtube_search_query: Akademie věd ČR - Patentové a licenční služby official
|
||||
youtube_search_timestamp: '2025-12-08T19:49:33.450514+00:00'
|
||||
location:
|
||||
city: Prague
|
||||
region: CZ-10
|
||||
region_code: '10'
|
||||
country: CZ
|
||||
postal_code: 160 70
|
||||
street_address: Českomalínská 25
|
||||
geonames_id: 3067696
|
||||
normalization_timestamp: '2025-12-09T20:40:58.629000+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:40:58.628980+00:00'
|
||||
175
data/custodian.backup.20251230/CZ-10-PRA-L-AVCPU.yaml
Normal file
175
data/custodian.backup.20251230/CZ-10-PRA-L-AVCPU.yaml
Normal file
|
|
@ -0,0 +1,175 @@
|
|||
original_entry:
|
||||
name: Akademie věd ČR - Prognostický ústav
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABB112
|
||||
locations:
|
||||
- street_address: Opletalova 19
|
||||
postal_code: 110 00
|
||||
city: Praha 1
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:23.658421+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-AVCPU
|
||||
ghcid_original: CZ-XX-PRA-L-AVČPÚ
|
||||
ghcid_uuid: 8bb4e75c-a21b-5838-acb6-5ff445aed08a
|
||||
ghcid_uuid_sha256: 5de04df0-b47f-8a74-a7f8-a9bb971668b9
|
||||
ghcid_numeric: 6764492336526219892
|
||||
record_id: 018d704c-40a7-4314-802e-d1657a6ef48d
|
||||
generation_timestamp: '2025-12-06T23:37:23.658421+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: '10'
|
||||
city_code: PRA
|
||||
method: GEONAMES_CITY_LOOKUP
|
||||
region_name: CZ-10
|
||||
geonames_id: 8629195
|
||||
resolution_timestamp: '2025-12-07T11:42:55.506888+00:00'
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-AVCPU
|
||||
ghcid_numeric: 6764492336526219892
|
||||
valid_from: '2025-12-08T11:21:21.976876+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-10-PRA-L-AVČPÚ)'
|
||||
- ghcid: CZ-XX-PRA-L-AVČPÚ
|
||||
ghcid_numeric: 4769545273226280928
|
||||
valid_from: '2025-12-06T23:37:23.658421+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:21.976876+00:00'
|
||||
- ghcid: CZ-10-PRA-L-AVČPÚ
|
||||
valid_from: '2025-12-07T11:42:55.506915+00:00'
|
||||
reason: 'Region resolved via GeoNames city lookup: XX->10 (Praha Klanovice)'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Akademie věd ČR - Prognostický ústav
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-AVCPU
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 8bb4e75c-a21b-5838-acb6-5ff445aed08a
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 5de04df0-b47f-8a74-a7f8-a9bb971668b9
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '6764492336526219892'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 018d704c-40a7-4314-802e-d1657a6ef48d
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:30.030652+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- 'YouTube/Google Maps enrichment 2025-12-08T19:49:37Z: Maps: rejected by LLM; YouTube: not found'
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:19Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:48:05Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:49:29Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.030652+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Akademie věd ČR - Prognostický ústav
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.030652+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.030652+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 1
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.030652+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABB112
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.030652+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:23.658421+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
google_maps_status: NO_MATCH
|
||||
google_maps_rejected:
|
||||
candidate_name: Czech Academy of Sciences
|
||||
rejection_reason: 'NAME MATCH: The source ''Akademie věd ČR - Prognostický ústav'' (Czech Academy of Sciences - Prognostic
|
||||
Institute) is a specific research institute. The candidate ''Czech Academy of Sciences'' is the parent umbrella organization.
|
||||
They are not the same institution. TYPE MATCH: The Google Place types ''point_of_interest'' and ''establishment'' do not
|
||||
match any of the expected types for a heritage institution (e.g., museum, library, gallery). ENTITY TYPE: The candidate
|
||||
is an umbrella organization for a national academy of sciences and does not function as a heritage institution.'
|
||||
timestamp: '2025-12-08T19:49:36.877469+00:00'
|
||||
youtube_status: NOT_FOUND
|
||||
youtube_search_query: Akademie věd ČR - Prognostický ústav official
|
||||
youtube_search_timestamp: '2025-12-08T19:49:37.207233+00:00'
|
||||
location:
|
||||
city: Prague
|
||||
region: CZ-10
|
||||
region_code: '10'
|
||||
country: CZ
|
||||
postal_code: 110 00
|
||||
street_address: Opletalova 19
|
||||
geonames_id: 3067696
|
||||
normalization_timestamp: '2025-12-09T20:40:58.679222+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:40:58.679209+00:00'
|
||||
175
data/custodian.backup.20251230/CZ-10-PRA-L-AVCPUJK.yaml
Normal file
175
data/custodian.backup.20251230/CZ-10-PRA-L-AVCPUJK.yaml
Normal file
|
|
@ -0,0 +1,175 @@
|
|||
original_entry:
|
||||
name: Akademie věd ČR - Pedagogický ústav J.A.Komenského
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABB011
|
||||
locations:
|
||||
- street_address: Máchova 7
|
||||
postal_code: 120 00
|
||||
city: Praha 2
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:23.639904+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-AVCPUJK
|
||||
ghcid_original: CZ-XX-PRA-L-AVČPÚJK
|
||||
ghcid_uuid: 6c0c0023-d809-5af7-9792-3a88280b05cc
|
||||
ghcid_uuid_sha256: c7d810dd-72ec-871c-87f0-147e22c4f1b9
|
||||
ghcid_numeric: 14400278351819056924
|
||||
record_id: b51bbf12-d5c6-492c-9a1a-f19d1d6b61e7
|
||||
generation_timestamp: '2025-12-06T23:37:23.639904+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: '10'
|
||||
city_code: PRA
|
||||
method: GEONAMES_CITY_LOOKUP
|
||||
region_name: CZ-10
|
||||
geonames_id: 8629195
|
||||
resolution_timestamp: '2025-12-07T11:42:54.224540+00:00'
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-AVCPUJK
|
||||
ghcid_numeric: 14400278351819056924
|
||||
valid_from: '2025-12-08T11:21:29.200637+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-10-PRA-L-AVČPÚJK)'
|
||||
- ghcid: CZ-XX-PRA-L-AVČPÚJK
|
||||
ghcid_numeric: 11911444671767460800
|
||||
valid_from: '2025-12-06T23:37:23.639904+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:29.200637+00:00'
|
||||
- ghcid: CZ-10-PRA-L-AVČPÚJK
|
||||
valid_from: '2025-12-07T11:42:54.224551+00:00'
|
||||
reason: 'Region resolved via GeoNames city lookup: XX->10 (Praha Klanovice)'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Akademie věd ČR - Pedagogický ústav J.A.Komenského
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-AVCPUJK
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 6c0c0023-d809-5af7-9792-3a88280b05cc
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: c7d810dd-72ec-871c-87f0-147e22c4f1b9
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '14400278351819056924'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: b51bbf12-d5c6-492c-9a1a-f19d1d6b61e7
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:30.030561+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- 'YouTube/Google Maps enrichment 2025-12-08T19:49:41Z: Maps: rejected by LLM; YouTube: not found'
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:19Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:48:05Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:49:29Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.030561+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Akademie věd ČR - Pedagogický ústav J.A.Komenského
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.030561+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.030561+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 2
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.030561+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABB011
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.030561+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:23.639904+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
google_maps_status: NO_MATCH
|
||||
google_maps_rejected:
|
||||
candidate_name: Czech Academy of Sciences
|
||||
rejection_reason: This is a mismatch. The source is a specific research institute within the Czech Academy of Sciences.
|
||||
The Google Maps candidate represents the overarching parent organization, the Czech Academy of Sciences, located at its
|
||||
headquarters. The candidate's Google Place types ('point_of_interest', 'establishment') are generic and do not match the
|
||||
expected types for a heritage institution. Furthermore, the website links to the main academy, not the specific Pedagogical
|
||||
Institute.
|
||||
timestamp: '2025-12-08T19:49:41.002827+00:00'
|
||||
youtube_status: NOT_FOUND
|
||||
youtube_search_query: Akademie věd ČR - Pedagogický ústav J.A.Komenského official
|
||||
youtube_search_timestamp: '2025-12-08T19:49:41.371035+00:00'
|
||||
location:
|
||||
city: Prague
|
||||
region: CZ-10
|
||||
region_code: '10'
|
||||
country: CZ
|
||||
postal_code: 120 00
|
||||
street_address: Máchova 7
|
||||
geonames_id: 3067696
|
||||
normalization_timestamp: '2025-12-09T20:40:58.718568+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:40:58.718556+00:00'
|
||||
165
data/custodian.backup.20251230/CZ-10-PRA-L-AVCUAC.yaml
Normal file
165
data/custodian.backup.20251230/CZ-10-PRA-L-AVCUAC.yaml
Normal file
|
|
@ -0,0 +1,165 @@
|
|||
original_entry:
|
||||
name: Akademie věd ČR - Ústav anorganické chemie
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABB035
|
||||
locations:
|
||||
- street_address: Pelléova 24
|
||||
postal_code: 160 00
|
||||
city: Praha 6
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:25.544411+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-AVCUAC
|
||||
ghcid_original: CZ-HL-PRA-L-AVČÚAC
|
||||
ghcid_uuid: c224a937-aaa3-5581-ae12-af86dda848f4
|
||||
ghcid_uuid_sha256: 4644be0d-f1e9-85fe-a554-49213c174ac9
|
||||
ghcid_numeric: 5063380848173729278
|
||||
record_id: 76769554-a057-44b4-b2bf-00c3126e879d
|
||||
generation_timestamp: '2025-12-06T23:37:25.544411+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-AVCUAC
|
||||
valid_from: "2025-12-10T09:46:58Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-AVCUAC
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:58Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-AVCUAC
|
||||
ghcid_numeric: 5063380848173729278
|
||||
valid_from: '2025-12-08T11:21:27.871544+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-AVČÚAC)'
|
||||
- ghcid: CZ-HL-PRA-L-AVČÚAC
|
||||
ghcid_numeric: 9877480588037009223
|
||||
valid_from: '2025-12-06T23:37:25.544411+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:27.871544+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Akademie věd ČR - Ústav anorganické chemie
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-AVCUAC
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: c224a937-aaa3-5581-ae12-af86dda848f4
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 4644be0d-f1e9-85fe-a554-49213c174ac9
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '5063380848173729278'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 76769554-a057-44b4-b2bf-00c3126e879d
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:30.046296+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046296+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Akademie věd ČR - Ústav anorganické chemie
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046296+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046296+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 6
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046296+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABB035
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046296+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:25.544411+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 160 00
|
||||
street_address: Pelléova 24
|
||||
normalization_timestamp: '2025-12-09T20:41:04.836786+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:04.836764+00:00'
|
||||
172
data/custodian.backup.20251230/CZ-10-PRA-L-AVCUFM.yaml
Normal file
172
data/custodian.backup.20251230/CZ-10-PRA-L-AVCUFM.yaml
Normal file
|
|
@ -0,0 +1,172 @@
|
|||
original_entry:
|
||||
name: Akademie věd ČR - Ústav fyzikální metalurgie
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABB057
|
||||
locations:
|
||||
- street_address: Boční II/1401
|
||||
postal_code: 141 31
|
||||
city: Praha 4
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:23.652645+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-AVCUFM
|
||||
ghcid_original: CZ-XX-PRA-L-AVČÚFM
|
||||
ghcid_uuid: e3553f52-4b71-579e-8949-7ed9a14d7989
|
||||
ghcid_uuid_sha256: 56d2fc1d-5840-8332-b8c5-11f406f5728b
|
||||
ghcid_numeric: 6256340035335910194
|
||||
record_id: fa035dcd-5e79-4d93-8774-4d980c99cdf7
|
||||
generation_timestamp: '2025-12-06T23:37:23.652645+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: '10'
|
||||
city_code: PRA
|
||||
method: GEONAMES_CITY_LOOKUP
|
||||
region_name: CZ-10
|
||||
geonames_id: 8629195
|
||||
resolution_timestamp: '2025-12-07T11:42:54.761825+00:00'
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-AVCUFM
|
||||
ghcid_numeric: 6256340035335910194
|
||||
valid_from: '2025-12-08T11:21:29.451072+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-10-PRA-L-AVČÚFM)'
|
||||
- ghcid: CZ-XX-PRA-L-AVČÚFM
|
||||
ghcid_numeric: 14246368355740162447
|
||||
valid_from: '2025-12-06T23:37:23.652645+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:29.451072+00:00'
|
||||
- ghcid: CZ-10-PRA-L-AVČÚFM
|
||||
valid_from: '2025-12-07T11:42:54.761835+00:00'
|
||||
reason: 'Region resolved via GeoNames city lookup: XX->10 (Praha Klanovice)'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Akademie věd ČR - Ústav fyzikální metalurgie
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-AVCUFM
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: e3553f52-4b71-579e-8949-7ed9a14d7989
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 56d2fc1d-5840-8332-b8c5-11f406f5728b
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '6256340035335910194'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: fa035dcd-5e79-4d93-8774-4d980c99cdf7
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:30.030634+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- 'YouTube/Google Maps enrichment 2025-12-08T19:49:44Z: Maps: rejected by LLM; YouTube: not found'
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:19Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:48:05Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:49:29Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.030634+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Akademie věd ČR - Ústav fyzikální metalurgie
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.030634+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.030634+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 4
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.030634+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABB057
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.030634+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:23.652645+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
google_maps_status: NO_MATCH
|
||||
google_maps_rejected:
|
||||
candidate_name: Institute of Physics of Materials ASCR
|
||||
rejection_reason: 'REJECT: The candidate is a research institute. Its Google Place types (''point_of_interest'', ''establishment'')
|
||||
do not match the expected heritage institution types (museum, library, gallery).'
|
||||
timestamp: '2025-12-08T19:49:44.300650+00:00'
|
||||
youtube_status: NOT_FOUND
|
||||
youtube_search_query: Akademie věd ČR - Ústav fyzikální metalurgie official
|
||||
youtube_search_timestamp: '2025-12-08T19:49:44.639812+00:00'
|
||||
location:
|
||||
city: Prague
|
||||
region: CZ-10
|
||||
region_code: '10'
|
||||
country: CZ
|
||||
postal_code: 141 31
|
||||
street_address: Boční II/1401
|
||||
geonames_id: 3067696
|
||||
normalization_timestamp: '2025-12-09T20:40:58.765484+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:40:58.765473+00:00'
|
||||
165
data/custodian.backup.20251230/CZ-10-PRA-L-AVCUFR.yaml
Normal file
165
data/custodian.backup.20251230/CZ-10-PRA-L-AVCUFR.yaml
Normal file
|
|
@ -0,0 +1,165 @@
|
|||
original_entry:
|
||||
name: Akademie věd ČR - Ústav fyziologických regulací
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABB025
|
||||
locations:
|
||||
- street_address: Na Truhlářce 2
|
||||
postal_code: 180 00
|
||||
city: Praha 8 - Libeň
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:25.531267+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-AVCUFR
|
||||
ghcid_original: CZ-HL-PRA-L-AVČÚFR
|
||||
ghcid_uuid: 795dc63d-7b17-5de0-80be-3a319ae1298d
|
||||
ghcid_uuid_sha256: 86714f47-d21f-8eea-a4be-06373e19a4df
|
||||
ghcid_numeric: 9687611443337182954
|
||||
record_id: 8f9b21ff-fd8d-4247-8b70-05a3b5ee62b8
|
||||
generation_timestamp: '2025-12-06T23:37:25.531267+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-AVCUFR
|
||||
valid_from: "2025-12-10T09:46:58Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-AVCUFR
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:58Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-AVCUFR
|
||||
ghcid_numeric: 9687611443337182954
|
||||
valid_from: '2025-12-08T11:21:31.235957+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-AVČÚFR)'
|
||||
- ghcid: CZ-HL-PRA-L-AVČÚFR
|
||||
ghcid_numeric: 18029943413411395802
|
||||
valid_from: '2025-12-06T23:37:25.531267+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:31.235957+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Akademie věd ČR - Ústav fyziologických regulací
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-AVCUFR
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 795dc63d-7b17-5de0-80be-3a319ae1298d
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 86714f47-d21f-8eea-a4be-06373e19a4df
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '9687611443337182954'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 8f9b21ff-fd8d-4247-8b70-05a3b5ee62b8
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:30.046197+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046197+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Akademie věd ČR - Ústav fyziologických regulací
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046197+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046197+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 8 - Libeň
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046197+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABB025
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046197+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:25.531267+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 180 00
|
||||
street_address: Na Truhlářce 2
|
||||
normalization_timestamp: '2025-12-09T20:41:04.897937+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:04.897911+00:00'
|
||||
|
|
@ -0,0 +1,166 @@
|
|||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Akademie věd ČR - Ústav molekulární genetiky
|
||||
source_type: ch_annotator
|
||||
original_entry:
|
||||
name: Akademie věd ČR - Ústav molekulární genetiky
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABB110
|
||||
locations:
|
||||
- street_address: Flemingovo nám. 2
|
||||
postal_code: 166 37
|
||||
city: Praha 6
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:14.369691+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-AVCUMG-akademie_ved_cr_ustav_molekularni_genetiky
|
||||
ghcid_original: CZ-HL-PRA-L-AVČÚMG-akademie_ved_cr_ustav_molekularni_genetiky
|
||||
ghcid_uuid: 7f2b5df4-8c0a-5167-ba84-91432ff7ac7a
|
||||
ghcid_uuid_sha256: 5af43a46-4a31-8013-b858-d13011f05f40
|
||||
ghcid_numeric: 6553927431297822739
|
||||
record_id: 016cc72d-03a0-4072-9ae0-f0134a15c21c
|
||||
generation_timestamp: '2025-12-06T23:37:14.369691+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-AVCUMG-akademie_ved_cr_ustav_molekularni_genetiky
|
||||
valid_from: '2025-12-10T09:46:58Z'
|
||||
valid_to: null
|
||||
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha))
|
||||
per ISO 3166-2:CZ
|
||||
- ghcid: CZ-HL-PRA-L-AVCUMG-akademie_ved_cr_ustav_molekularni_genetiky
|
||||
valid_from: null
|
||||
valid_to: '2025-12-10T09:46:58Z'
|
||||
reason: Previous GHCID with incorrect region code
|
||||
- ghcid: CZ-HL-PRA-L-AVCUMG-akademie_ved_cr_ustav_molekularni_genetiky
|
||||
ghcid_numeric: 6553927431297822739
|
||||
valid_from: '2025-12-08T11:21:31.754415+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-AVČÚMG-akademie_ved_cr_ustav_molekularni_genetiky)'
|
||||
- ghcid: CZ-HL-PRA-L-AVČÚMG-akademie_ved_cr_ustav_molekularni_genetiky
|
||||
ghcid_numeric: 160046891344919514
|
||||
valid_from: '2025-12-06T23:37:14.369691+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:31.754415+00:00'
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-AVCUMG-akademie_ved_cr_ustav_molekularni_genetiky
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 7f2b5df4-8c0a-5167-ba84-91432ff7ac7a
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 5af43a46-4a31-8013-b858-d13011f05f40
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '6553927431297822739'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 016cc72d-03a0-4072-9ae0-f0134a15c21c
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.800225+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.800225+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date: null
|
||||
verified_by: null
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Akademie věd ČR - Ústav molekulární genetiky
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.800225+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.800225+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 6
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.800225+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABB110
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.800225+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:14.369691+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 166 37
|
||||
street_address: Flemingovo nám. 2
|
||||
normalization_timestamp: '2025-12-09T20:41:04.955642+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:04.955627+00:00'
|
||||
165
data/custodian.backup.20251230/CZ-10-PRA-L-AVCUMG.yaml
Normal file
165
data/custodian.backup.20251230/CZ-10-PRA-L-AVCUMG.yaml
Normal file
|
|
@ -0,0 +1,165 @@
|
|||
original_entry:
|
||||
name: Akademie věd ČR - Ústav molekulární genetiky
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABB110
|
||||
locations:
|
||||
- street_address: Flemingovo nám. 2
|
||||
postal_code: 166 37
|
||||
city: Praha 6
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:35:06.436939+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-AVCUMG
|
||||
ghcid_original: CZ-HL-PRA-L-AVČÚMG
|
||||
ghcid_uuid: 695bf0ce-d479-5c58-a6a8-1b631a99e4be
|
||||
ghcid_uuid_sha256: 27fc2512-1330-8d25-90f5-fcd2998c2f3a
|
||||
ghcid_numeric: 2881218621171850533
|
||||
record_id: c6c9b1f9-edfd-4e00-b83d-282e4bb3cd36
|
||||
generation_timestamp: '2025-12-06T23:35:06.436939+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-AVCUMG
|
||||
valid_from: "2025-12-10T09:46:58Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-AVCUMG
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:58Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-AVCUMG
|
||||
ghcid_numeric: 2881218621171850533
|
||||
valid_from: '2025-12-08T11:21:21.745302+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-AVČÚMG)'
|
||||
- ghcid: CZ-HL-PRA-L-AVČÚMG
|
||||
ghcid_numeric: 3450507008958569248
|
||||
valid_from: '2025-12-06T23:35:06.436939+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:21.745302+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Akademie věd ČR - Ústav molekulární genetiky
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-AVCUMG
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 695bf0ce-d479-5c58-a6a8-1b631a99e4be
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 27fc2512-1330-8d25-90f5-fcd2998c2f3a
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '2881218621171850533'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: c6c9b1f9-edfd-4e00-b83d-282e4bb3cd36
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.800225+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.800225+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Akademie věd ČR - Ústav molekulární genetiky
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.800225+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.800225+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 6
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.800225+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABB110
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.800225+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:35:06.436939+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 166 37
|
||||
street_address: Flemingovo nám. 2
|
||||
normalization_timestamp: '2025-12-09T20:41:05.002294+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:05.002273+00:00'
|
||||
165
data/custodian.backup.20251230/CZ-10-PRA-L-AVCUPCSLON.yaml
Normal file
165
data/custodian.backup.20251230/CZ-10-PRA-L-AVCUPCSLON.yaml
Normal file
|
|
@ -0,0 +1,165 @@
|
|||
original_entry:
|
||||
name: Akademie věd ČR - Ústav pro českou a světovou literaturu - odd. neslovanských literatur
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABB039
|
||||
locations:
|
||||
- street_address: nám. Republiky 1
|
||||
postal_code: 110 00
|
||||
city: Praha 1
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:25.547650+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-AVCUPCSLON
|
||||
ghcid_original: CZ-HL-PRA-L-AVČÚPČSLON
|
||||
ghcid_uuid: 612e600a-6c13-59da-bff4-07b2a007e568
|
||||
ghcid_uuid_sha256: 7537c6f3-aa62-86aa-8cbc-a34535e1b4bd
|
||||
ghcid_numeric: 8446438375994599082
|
||||
record_id: f2eec84d-8531-4e23-857e-6c002da3fec9
|
||||
generation_timestamp: '2025-12-06T23:37:25.547650+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-AVCUPCSLON
|
||||
valid_from: "2025-12-10T09:46:58Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-AVCUPCSLON
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:58Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-AVCUPCSLON
|
||||
ghcid_numeric: 8446438375994599082
|
||||
valid_from: '2025-12-08T11:21:22.066165+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-AVČÚPČSLON)'
|
||||
- ghcid: CZ-HL-PRA-L-AVČÚPČSLON
|
||||
ghcid_numeric: 6149932590907325350
|
||||
valid_from: '2025-12-06T23:37:25.547650+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:22.066165+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Akademie věd ČR - Ústav pro českou a světovou literaturu - odd. neslovanských literatur
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-AVCUPCSLON
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 612e600a-6c13-59da-bff4-07b2a007e568
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 7537c6f3-aa62-86aa-8cbc-a34535e1b4bd
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '8446438375994599082'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: f2eec84d-8531-4e23-857e-6c002da3fec9
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:30.046318+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046318+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Akademie věd ČR - Ústav pro českou a světovou literaturu - odd. neslovanských literatur
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046318+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046318+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 1
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046318+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABB039
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046318+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:25.547650+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 110 00
|
||||
street_address: nám. Republiky 1
|
||||
normalization_timestamp: '2025-12-09T20:41:05.061486+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:05.061461+00:00'
|
||||
|
|
@ -0,0 +1,165 @@
|
|||
original_entry:
|
||||
name: Akademie věd ČR - Ústav pro elektrotechniku - Knihovna
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABB042
|
||||
locations:
|
||||
- street_address: Dolejškova 5
|
||||
postal_code: 182 02
|
||||
city: Praha 8
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:14.348347+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-AVCUPEK-akademie_ved_cr_ustav_pro_elektrotechniku_knihovna
|
||||
ghcid_original: CZ-HL-PRA-L-AVČÚPEK-akademie_ved_cr_ustav_pro_elektrotechniku_knihovna
|
||||
ghcid_uuid: f377a24a-a2c3-5754-90e7-22295a9a5022
|
||||
ghcid_uuid_sha256: 2915a1fc-c781-8a6f-beb9-0629b4602eb7
|
||||
ghcid_numeric: 2960450437116938863
|
||||
record_id: fb760fd0-96d2-48eb-b2cb-e7974123d634
|
||||
generation_timestamp: '2025-12-06T23:37:14.348347+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-AVCUPEK-akademie_ved_cr_ustav_pro_elektrotechniku_knihovna
|
||||
valid_from: "2025-12-10T09:46:58Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-AVCUPEK-akademie_ved_cr_ustav_pro_elektrotechniku_knihovna
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:58Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-AVCUPEK-akademie_ved_cr_ustav_pro_elektrotechniku_knihovna
|
||||
ghcid_numeric: 2960450437116938863
|
||||
valid_from: '2025-12-08T11:21:33.284297+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-AVČÚPEK-akademie_ved_cr_ustav_pro_elektrotechniku_knihovna)'
|
||||
- ghcid: CZ-HL-PRA-L-AVČÚPEK-akademie_ved_cr_ustav_pro_elektrotechniku_knihovna
|
||||
ghcid_numeric: 16206133009206881437
|
||||
valid_from: '2025-12-06T23:37:14.348347+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:33.284297+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Akademie věd ČR - Ústav pro elektrotechniku - Knihovna
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-AVCUPEK-akademie_ved_cr_ustav_pro_elektrotechniku_knihovna
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: f377a24a-a2c3-5754-90e7-22295a9a5022
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 2915a1fc-c781-8a6f-beb9-0629b4602eb7
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '2960450437116938863'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: fb760fd0-96d2-48eb-b2cb-e7974123d634
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.798272+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.798272+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Akademie věd ČR - Ústav pro elektrotechniku - Knihovna
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.798272+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.798272+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 8
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.798272+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABB042
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.798272+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:14.348347+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 182 02
|
||||
street_address: Dolejškova 5
|
||||
normalization_timestamp: '2025-12-09T20:41:05.107965+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:05.107942+00:00'
|
||||
165
data/custodian.backup.20251230/CZ-10-PRA-L-AVCUPEK.yaml
Normal file
165
data/custodian.backup.20251230/CZ-10-PRA-L-AVCUPEK.yaml
Normal file
|
|
@ -0,0 +1,165 @@
|
|||
original_entry:
|
||||
name: Akademie věd ČR - Ústav pro elektrotechniku - Knihovna
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABB042
|
||||
locations:
|
||||
- street_address: Dolejškova 5
|
||||
postal_code: 182 02
|
||||
city: Praha 8
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:35:06.339629+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-AVCUPEK
|
||||
ghcid_original: CZ-HL-PRA-L-AVČÚPEK
|
||||
ghcid_uuid: 77caf563-ecf0-5f59-9224-55acb0d0ff4e
|
||||
ghcid_uuid_sha256: e7120276-cc52-8cfb-9757-fb99a4b53d77
|
||||
ghcid_numeric: 16650373481599474939
|
||||
record_id: 9d388e7a-9745-439a-a98a-886da098a45a
|
||||
generation_timestamp: '2025-12-06T23:35:06.339629+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-AVCUPEK
|
||||
valid_from: "2025-12-10T09:46:58Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-AVCUPEK
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:58Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-AVCUPEK
|
||||
ghcid_numeric: 16650373481599474939
|
||||
valid_from: '2025-12-08T11:21:40.439155+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-AVČÚPEK)'
|
||||
- ghcid: CZ-HL-PRA-L-AVČÚPEK
|
||||
ghcid_numeric: 134491386058511181
|
||||
valid_from: '2025-12-06T23:35:06.339629+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:40.439155+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Akademie věd ČR - Ústav pro elektrotechniku - Knihovna
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-AVCUPEK
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 77caf563-ecf0-5f59-9224-55acb0d0ff4e
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: e7120276-cc52-8cfb-9757-fb99a4b53d77
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '16650373481599474939'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 9d388e7a-9745-439a-a98a-886da098a45a
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.798272+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.798272+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Akademie věd ČR - Ústav pro elektrotechniku - Knihovna
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.798272+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.798272+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 8
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.798272+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABB042
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.798272+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:35:06.339629+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 182 02
|
||||
street_address: Dolejškova 5
|
||||
normalization_timestamp: '2025-12-09T20:41:05.152929+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:05.152904+00:00'
|
||||
|
|
@ -0,0 +1,174 @@
|
|||
original_entry:
|
||||
name: Akademie věd ČR - Vývojové dílny
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABB058
|
||||
locations:
|
||||
- street_address: Pod vodárenskou věží 4
|
||||
postal_code: 180 00
|
||||
city: Praha 8
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:14.351182+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-AVCVD-akademie_ved_cr_vyvojove_dilny
|
||||
ghcid_original: CZ-XX-PRA-L-AVČVD-akademie_ved_cr_vyvojove_dilny
|
||||
ghcid_uuid: ccd8c551-111d-5004-8893-af60cb91a370
|
||||
ghcid_uuid_sha256: c93d3ed6-02d1-864b-b032-a65e83112669
|
||||
ghcid_numeric: 14500815464094078539
|
||||
record_id: 168e29e2-f2d5-4d1b-8e85-f0f07f31731c
|
||||
generation_timestamp: '2025-12-06T23:37:14.351182+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: '10'
|
||||
city_code: PRA
|
||||
method: GEONAMES_CITY_LOOKUP
|
||||
region_name: CZ-10
|
||||
geonames_id: 8629195
|
||||
resolution_timestamp: '2025-12-07T11:42:54.977370+00:00'
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-AVCVD-akademie_ved_cr_vyvojove_dilny
|
||||
ghcid_numeric: 14500815464094078539
|
||||
valid_from: '2025-12-08T11:21:41.358002+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-10-PRA-L-AVČVD-akademie_ved_cr_vyvojove_dilny)'
|
||||
- ghcid: CZ-XX-PRA-L-AVČVD-akademie_ved_cr_vyvojove_dilny
|
||||
ghcid_numeric: 2362682881202093387
|
||||
valid_from: '2025-12-06T23:37:14.351182+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:41.358002+00:00'
|
||||
- ghcid: CZ-10-PRA-L-AVČVD-akademie_ved_cr_vyvojove_dilny
|
||||
valid_from: '2025-12-07T11:42:54.977381+00:00'
|
||||
reason: 'Region resolved via GeoNames city lookup: XX->10 (Praha Klanovice)'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Akademie věd ČR - Vývojové dílny
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-AVCVD-akademie_ved_cr_vyvojove_dilny
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: ccd8c551-111d-5004-8893-af60cb91a370
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: c93d3ed6-02d1-864b-b032-a65e83112669
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '14500815464094078539'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 168e29e2-f2d5-4d1b-8e85-f0f07f31731c
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.799317+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- 'YouTube/Google Maps enrichment 2025-12-08T19:49:48Z: Maps: rejected by LLM; YouTube: not found'
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:19Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:48:05Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:49:29Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.799317+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Akademie věd ČR - Vývojové dílny
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.799317+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.799317+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 8
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.799317+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABB058
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.799317+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:14.351182+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
google_maps_status: NO_MATCH
|
||||
google_maps_rejected:
|
||||
candidate_name: Czech Academy of Sciences
|
||||
rejection_reason: 'TYPE MISMATCH: The Google Place candidate is the Czech Academy of Sciences, a large research institution.
|
||||
Its Google Place Types are generic (''point_of_interest'', ''establishment'') and do not match any of the required heritage
|
||||
institution types (museum, library, gallery). Although there is a strong name and location match with the source, it does
|
||||
not fit the GLAM (Heritage Custodian) definition.'
|
||||
timestamp: '2025-12-08T19:49:48.074229+00:00'
|
||||
youtube_status: NOT_FOUND
|
||||
youtube_search_query: Akademie věd ČR - Vývojové dílny official
|
||||
youtube_search_timestamp: '2025-12-08T19:49:48.416534+00:00'
|
||||
location:
|
||||
city: Prague
|
||||
region: CZ-10
|
||||
region_code: '10'
|
||||
country: CZ
|
||||
postal_code: 180 00
|
||||
street_address: Pod vodárenskou věží 4
|
||||
geonames_id: 3067696
|
||||
normalization_timestamp: '2025-12-09T20:40:58.807961+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:40:58.807938+00:00'
|
||||
175
data/custodian.backup.20251230/CZ-10-PRA-L-AVCVD.yaml
Normal file
175
data/custodian.backup.20251230/CZ-10-PRA-L-AVCVD.yaml
Normal file
|
|
@ -0,0 +1,175 @@
|
|||
original_entry:
|
||||
name: Akademie věd ČR - Vývojové dílny
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABB058
|
||||
locations:
|
||||
- street_address: Pod vodárenskou věží 4
|
||||
postal_code: 180 00
|
||||
city: Praha 8
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:35:06.382577+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-AVCVD
|
||||
ghcid_original: CZ-XX-PRA-L-AVČVD
|
||||
ghcid_uuid: ba3d163b-c510-5878-8885-7fedeec11661
|
||||
ghcid_uuid_sha256: bfd1764c-38e3-82d4-aac0-9cf26c2dfdd2
|
||||
ghcid_numeric: 13821958801120805588
|
||||
record_id: d803d048-5d6a-41f2-ae30-b243e288d372
|
||||
generation_timestamp: '2025-12-06T23:35:06.382577+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: '10'
|
||||
city_code: PRA
|
||||
method: GEONAMES_CITY_LOOKUP
|
||||
region_name: CZ-10
|
||||
geonames_id: 8629195
|
||||
resolution_timestamp: '2025-12-07T11:42:55.652198+00:00'
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-AVCVD
|
||||
ghcid_numeric: 13821958801120805588
|
||||
valid_from: '2025-12-08T11:21:37.438639+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-10-PRA-L-AVČVD)'
|
||||
- ghcid: CZ-XX-PRA-L-AVČVD
|
||||
ghcid_numeric: 215626475134910548
|
||||
valid_from: '2025-12-06T23:35:06.382577+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:37.438639+00:00'
|
||||
- ghcid: CZ-10-PRA-L-AVČVD
|
||||
valid_from: '2025-12-07T11:42:55.652217+00:00'
|
||||
reason: 'Region resolved via GeoNames city lookup: XX->10 (Praha Klanovice)'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Akademie věd ČR - Vývojové dílny
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-AVCVD
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: ba3d163b-c510-5878-8885-7fedeec11661
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: bfd1764c-38e3-82d4-aac0-9cf26c2dfdd2
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '13821958801120805588'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: d803d048-5d6a-41f2-ae30-b243e288d372
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.799317+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- 'YouTube/Google Maps enrichment 2025-12-08T19:49:52Z: Maps: rejected by LLM; YouTube: not found'
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:19Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:48:05Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:49:29Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.799317+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Akademie věd ČR - Vývojové dílny
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.799317+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.799317+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 8
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.799317+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABB058
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.799317+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:35:06.382577+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
google_maps_status: NO_MATCH
|
||||
google_maps_rejected:
|
||||
candidate_name: Czech Academy of Sciences
|
||||
rejection_reason: 'TYPE MISMATCH: The Google Maps candidate ''Czech Academy of Sciences'' is a national research institution,
|
||||
not a specific museum, library, or archive. Its Google types (''point_of_interest'', ''establishment'') are too general
|
||||
and do not match the expected heritage types (e.g., museum, library). The source name ''Akademie věd ČR - Vývojové dílny''
|
||||
(Academy of Sciences of the Czech Republic - Development Workshops) also suggests a research or technical department,
|
||||
further confirming it is not a public-facing heritage institution.'
|
||||
timestamp: '2025-12-08T19:49:52.339515+00:00'
|
||||
youtube_status: NOT_FOUND
|
||||
youtube_search_query: Akademie věd ČR - Vývojové dílny official
|
||||
youtube_search_timestamp: '2025-12-08T19:49:52.671667+00:00'
|
||||
location:
|
||||
city: Prague
|
||||
region: CZ-10
|
||||
region_code: '10'
|
||||
country: CZ
|
||||
postal_code: 180 00
|
||||
street_address: Pod vodárenskou věží 4
|
||||
geonames_id: 3067696
|
||||
normalization_timestamp: '2025-12-09T20:40:58.845752+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:40:58.845740+00:00'
|
||||
280
data/custodian.backup.20251230/CZ-10-PRA-L-AVSSK.yaml
Normal file
280
data/custodian.backup.20251230/CZ-10-PRA-L-AVSSK.yaml
Normal file
|
|
@ -0,0 +1,280 @@
|
|||
original_entry:
|
||||
name: AMBIS vysoká škola, a.s - Knihovna
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABD030
|
||||
- &id002
|
||||
identifier_scheme: IČO
|
||||
identifier_value: '61858307'
|
||||
- &id003
|
||||
identifier_scheme: DIČ
|
||||
identifier_value: CZ61858307
|
||||
- &id004
|
||||
identifier_scheme: Website
|
||||
identifier_value: https://www.ambis.cz
|
||||
identifier_url: https://www.ambis.cz
|
||||
- &id005
|
||||
identifier_scheme: Wikidata
|
||||
identifier_value: Q114619496
|
||||
identifier_url: https://www.wikidata.org/wiki/Q114619496
|
||||
locations:
|
||||
- street_address: Lindnerova 575/1
|
||||
postal_code: 180 00
|
||||
city: Praha 8 - Libeň
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
latitude: 50.111828
|
||||
longitude: 14.472999
|
||||
processing_timestamp: '2025-12-06T23:37:26.591017+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-AVSSK
|
||||
ghcid_original: CZ-HL-PRA-L-AVŠSK
|
||||
ghcid_uuid: a2942158-3abe-500e-a42e-47e3d802cffb
|
||||
ghcid_uuid_sha256: dbf6e478-f670-8c95-b621-5141084dc9f8
|
||||
ghcid_numeric: 15850107146758818965
|
||||
record_id: b7bdc0a9-70d9-484f-bff6-eef614485765
|
||||
generation_timestamp: '2025-12-06T23:37:26.591017+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-AVSSK
|
||||
valid_from: '2025-12-10T09:46:58Z'
|
||||
valid_to:
|
||||
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
|
||||
- ghcid: CZ-HL-PRA-L-AVSSK
|
||||
valid_from:
|
||||
valid_to: '2025-12-10T09:46:58Z'
|
||||
reason: Previous GHCID with incorrect region code
|
||||
- ghcid: CZ-HL-PRA-L-AVSSK
|
||||
ghcid_numeric: 15850107146758818965
|
||||
valid_from: '2025-12-08T11:21:36.402400+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-AVŠSK)'
|
||||
- ghcid: CZ-HL-PRA-L-AVŠSK
|
||||
ghcid_numeric: 11709857600197536560
|
||||
valid_from: '2025-12-06T23:37:26.591017+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:36.402400+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: AMBIS vysoká škola, a.s - Knihovna
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-AVSSK
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: a2942158-3abe-500e-a42e-47e3d802cffb
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: dbf6e478-f670-8c95-b621-5141084dc9f8
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '15850107146758818965'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: b7bdc0a9-70d9-484f-bff6-eef614485765
|
||||
- *id001
|
||||
- *id002
|
||||
- *id003
|
||||
- *id004
|
||||
- *id005
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:30.055665+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
schema_version: 2.0.0
|
||||
enrichment_provenance:
|
||||
wikidata_enrichment:
|
||||
content_hash: sha256-tDge/qPcehBzQ4ZL0RmrfaaoiFExgARmcRFJPWdLs/g=
|
||||
verified_at: '2025-12-28T19:58:23.178566+00:00'
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114619496
|
||||
provenance_schema_version: '2.0'
|
||||
standards_compliance:
|
||||
- W3C PROV-O
|
||||
- W3C SRI (content hashes)
|
||||
generated_at: '2025-12-28T19:58:23.178587+00:00'
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.055665+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: AMBIS vysoká škola, a.s - Knihovna
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.055665+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.055665+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 8 - Libeň
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.055665+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: wikidata_id
|
||||
claim_value: Q114619496
|
||||
property_uri: owl:sameAs
|
||||
provenance:
|
||||
namespace: wikidata
|
||||
path: /entity/Q114619496
|
||||
timestamp: '2025-11-19T10:14:30.055665+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.98
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABD030
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.055665+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:26.591017+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
wikidata_enrichment:
|
||||
wikidata_entity_id: Q114619496
|
||||
api_metadata:
|
||||
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
fetch_timestamp: '2025-12-18T22:27:00.684272+00:00'
|
||||
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
|
||||
enrichment_version: 2.1_generic
|
||||
properties_found:
|
||||
- P31
|
||||
- P17
|
||||
- P159
|
||||
- P968
|
||||
- P463
|
||||
wikidata_labels:
|
||||
cs: AMBIS vysoká škola, a.s, knihovna
|
||||
en: AMBIS,JSC, Scholarly Library
|
||||
wikidata_label_en: AMBIS,JSC, Scholarly Library
|
||||
wikidata_label_cs: AMBIS vysoká škola, a.s, knihovna
|
||||
wikidata_descriptions:
|
||||
cs: knihovna v Praze
|
||||
wikidata_aliases:
|
||||
cs:
|
||||
- Knihovna AMBIS
|
||||
wikidata_classification:
|
||||
instance_of: &id006
|
||||
- id: Q1622062
|
||||
label: university library
|
||||
description: general library at an institution of higher education
|
||||
wikidata_instance_of: *id006
|
||||
wikidata_location:
|
||||
country: &id007
|
||||
id: Q213
|
||||
label: Czech Republic
|
||||
description: country in Central Europe
|
||||
headquarters_location:
|
||||
id: Q2003486
|
||||
label: Libeň
|
||||
description: cadastral area and district of Prague
|
||||
wikidata_country: *id007
|
||||
wikidata_organization:
|
||||
member_of:
|
||||
- id: Q4809890
|
||||
label: Association of Library and Information Professionals of the Czech Republic
|
||||
description: organization
|
||||
wikidata_contact:
|
||||
email: mailto:knihovna@ambis.cz
|
||||
_provenance:
|
||||
content_hash:
|
||||
algorithm: sha256
|
||||
value: sha256-tDge/qPcehBzQ4ZL0RmrfaaoiFExgARmcRFJPWdLs/g=
|
||||
scope: enrichment_section
|
||||
computed_at: '2025-12-28T19:58:23.178566+00:00'
|
||||
prov:
|
||||
wasGeneratedBy:
|
||||
'@type': prov:Activity
|
||||
name: wikidata_api_fetch
|
||||
used: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114619496
|
||||
verification:
|
||||
status: verified
|
||||
last_verified: '2025-12-28T19:58:23.178577+00:00'
|
||||
location:
|
||||
latitude: 50.111828
|
||||
longitude: 14.472999
|
||||
coordinate_provenance:
|
||||
source_type: ORIGINAL_ENTRY
|
||||
source_path: original_entry.locations[0]
|
||||
city: Praha 8 - Libeň
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 180 00
|
||||
street_address: Lindnerova 575/1
|
||||
normalization_timestamp: '2025-12-09T06:50:47.561464+00:00'
|
||||
logo_enrichment:
|
||||
enrichment_timestamp: '2025-12-23T21:58:33.227071+00:00'
|
||||
source_url: https://www.ambis.cz
|
||||
extraction_method: crawl4ai
|
||||
claims:
|
||||
- claim_type: favicon_url
|
||||
claim_value: https://www.ambis.cz/files/resize/16/16/ambis-favicon.png
|
||||
source_url: https://www.ambis.cz
|
||||
css_selector: '[document] > html.js.fonts-loaded > head > link:nth-of-type(3)'
|
||||
retrieved_on: '2025-12-23T21:58:33.227071+00:00'
|
||||
extraction_method: crawl4ai_link_rel
|
||||
favicon_type: image/png
|
||||
favicon_sizes: 16x16
|
||||
summary:
|
||||
total_claims: 1
|
||||
has_primary_logo: false
|
||||
has_favicon: true
|
||||
has_og_image: false
|
||||
favicon_count: 2
|
||||
277
data/custodian.backup.20251230/CZ-10-PRA-L-AVUVPK.yaml
Normal file
277
data/custodian.backup.20251230/CZ-10-PRA-L-AVUVPK.yaml
Normal file
|
|
@ -0,0 +1,277 @@
|
|||
original_entry:
|
||||
name: Akademie výtvarných umění v Praze - Knihovna
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABD021
|
||||
- &id002
|
||||
identifier_scheme: IČO
|
||||
identifier_value: '60461446'
|
||||
- &id003
|
||||
identifier_scheme: DIČ
|
||||
identifier_value: CZ60461446
|
||||
- &id004
|
||||
identifier_scheme: Website
|
||||
identifier_value: http://opac.avu.cz/
|
||||
identifier_url: http://opac.avu.cz/
|
||||
- &id005
|
||||
identifier_scheme: Wikidata
|
||||
identifier_value: Q114617655
|
||||
identifier_url: https://www.wikidata.org/wiki/Q114617655
|
||||
locations:
|
||||
- street_address: U Akademie 172/4
|
||||
postal_code: 172 00
|
||||
city: Praha 7
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
latitude: 50.102708
|
||||
longitude: 14.424
|
||||
processing_timestamp: '2025-12-06T23:37:14.758955+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-AVUVPK
|
||||
ghcid_original: CZ-HL-PRA-L-AVUVPK
|
||||
ghcid_uuid: 52d6ae9a-c788-5ca6-9ac6-cdfcccee0d3a
|
||||
ghcid_uuid_sha256: 29049dce-349a-89bc-0e62-9fdf688e56ed
|
||||
ghcid_numeric: 2955660764433230268
|
||||
record_id: d7ec6f35-674c-4715-b658-c2e0d50ee1b8
|
||||
generation_timestamp: '2025-12-06T23:37:14.758955+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-AVUVPK
|
||||
valid_from: '2025-12-10T09:46:58Z'
|
||||
valid_to:
|
||||
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
|
||||
- ghcid: CZ-HL-PRA-L-AVUVPK
|
||||
valid_from:
|
||||
valid_to: '2025-12-10T09:46:58Z'
|
||||
reason: Previous GHCID with incorrect region code
|
||||
- ghcid: CZ-HL-PRA-L-AVUVPK
|
||||
ghcid_numeric: 2955660764433230268
|
||||
valid_from: '2025-12-06T23:37:14.758955+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Akademie výtvarných umění v Praze - Knihovna
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-AVUVPK
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 52d6ae9a-c788-5ca6-9ac6-cdfcccee0d3a
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 29049dce-349a-89bc-0e62-9fdf688e56ed
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '2955660764433230268'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: d7ec6f35-674c-4715-b658-c2e0d50ee1b8
|
||||
- *id001
|
||||
- *id002
|
||||
- *id003
|
||||
- *id004
|
||||
- *id005
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.805625+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
schema_version: 2.0.0
|
||||
enrichment_provenance:
|
||||
wikidata_enrichment:
|
||||
content_hash: sha256-rbDX60Po9LRK18K3O5FR9eSZgfb5ph0+zBwk4or6wUQ=
|
||||
verified_at: '2025-12-28T19:58:23.199863+00:00'
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617655
|
||||
provenance_schema_version: '2.0'
|
||||
standards_compliance:
|
||||
- W3C PROV-O
|
||||
- W3C SRI (content hashes)
|
||||
generated_at: '2025-12-28T19:58:23.199880+00:00'
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.805625+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Akademie výtvarných umění v Praze - Knihovna
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.805625+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.805625+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 7
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.805625+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: wikidata_id
|
||||
claim_value: Q114617655
|
||||
property_uri: owl:sameAs
|
||||
provenance:
|
||||
namespace: wikidata
|
||||
path: /entity/Q114617655
|
||||
timestamp: '2025-11-19T10:14:29.805625+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.98
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABD021
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.805625+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:14.758955+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
wikidata_enrichment:
|
||||
wikidata_entity_id: Q114617655
|
||||
api_metadata:
|
||||
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
fetch_timestamp: '2025-12-18T22:27:08.718822+00:00'
|
||||
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
|
||||
enrichment_version: 2.1_generic
|
||||
properties_found:
|
||||
- P31
|
||||
- P17
|
||||
- P159
|
||||
- P968
|
||||
wikidata_labels:
|
||||
cs: Akademie výtvarných umění v Praze, knihovna
|
||||
en: Academy of Fine Arts in Prague, Library
|
||||
wikidata_label_en: Academy of Fine Arts in Prague, Library
|
||||
wikidata_label_cs: Akademie výtvarných umění v Praze, knihovna
|
||||
wikidata_descriptions:
|
||||
cs: knihovna v Praze
|
||||
wikidata_aliases:
|
||||
cs:
|
||||
- AVU
|
||||
wikidata_classification:
|
||||
instance_of: &id006
|
||||
- id: Q1622062
|
||||
label: university library
|
||||
description: general library at an institution of higher education
|
||||
wikidata_instance_of: *id006
|
||||
wikidata_location:
|
||||
country: &id007
|
||||
id: Q213
|
||||
label: Czech Republic
|
||||
description: country in Central Europe
|
||||
headquarters_location:
|
||||
id: Q1490045
|
||||
label: Prague 7
|
||||
description: municipal part of Prague
|
||||
wikidata_country: *id007
|
||||
wikidata_contact:
|
||||
email: mailto:tomas.pergler@avu.cz
|
||||
_provenance:
|
||||
content_hash:
|
||||
algorithm: sha256
|
||||
value: sha256-rbDX60Po9LRK18K3O5FR9eSZgfb5ph0+zBwk4or6wUQ=
|
||||
scope: enrichment_section
|
||||
computed_at: '2025-12-28T19:58:23.199863+00:00'
|
||||
prov:
|
||||
wasGeneratedBy:
|
||||
'@type': prov:Activity
|
||||
name: wikidata_api_fetch
|
||||
used: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617655
|
||||
verification:
|
||||
status: verified
|
||||
last_verified: '2025-12-28T19:58:23.199872+00:00'
|
||||
location:
|
||||
latitude: 50.102708
|
||||
longitude: 14.424
|
||||
coordinate_provenance:
|
||||
source_type: ORIGINAL_ENTRY
|
||||
source_path: original_entry.locations[0]
|
||||
city: Praha 7
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 172 00
|
||||
street_address: U Akademie 172/4
|
||||
normalization_timestamp: '2025-12-09T06:50:47.580972+00:00'
|
||||
logo_enrichment:
|
||||
enrichment_timestamp: '2025-12-23T21:58:40.363421+00:00'
|
||||
source_url: http://opac.avu.cz
|
||||
extraction_method: crawl4ai
|
||||
claims:
|
||||
- claim_type: logo_url
|
||||
claim_value: http://opac.avu.cz/custom/design/logoavu2.png
|
||||
source_url: http://opac.avu.cz
|
||||
css_selector: '#portaro-classic-layout > kp-svelte-component-wrapper.kp-header-component-wrapper.ng-isolate-scope > header.kp-header.logo-stripe > div.logo-content-container.container > div.logo-search-row.row > div.customLogoArea.custom-logo-area > h1.unset-style > img'
|
||||
retrieved_on: '2025-12-23T21:58:40.363421+00:00'
|
||||
extraction_method: crawl4ai_header_logo
|
||||
detection_confidence: high
|
||||
alt_text: Akademie výtvarných umění
|
||||
- claim_type: favicon_url
|
||||
claim_value: http://opac.avu.cz/favicon.png?v=2.3.0-32021
|
||||
source_url: http://opac.avu.cz
|
||||
css_selector: '#ng-app > head > link:nth-of-type(9)'
|
||||
retrieved_on: '2025-12-23T21:58:40.363421+00:00'
|
||||
extraction_method: crawl4ai_link_rel
|
||||
favicon_type: image/png
|
||||
favicon_sizes: 256x256
|
||||
summary:
|
||||
total_claims: 2
|
||||
has_primary_logo: true
|
||||
has_favicon: true
|
||||
has_og_image: false
|
||||
favicon_count: 1
|
||||
160
data/custodian.backup.20251230/CZ-10-PRA-L-AZN.yaml
Normal file
160
data/custodian.backup.20251230/CZ-10-PRA-L-AZN.yaml
Normal file
|
|
@ -0,0 +1,160 @@
|
|||
original_entry:
|
||||
name: AVICENUM, zdravotnické nakladatelství
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABE322
|
||||
locations:
|
||||
- street_address: Malostranské nám. 26
|
||||
postal_code: 118 02
|
||||
city: Praha 1
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:15.610784+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-AZN
|
||||
ghcid_original: CZ-HL-PRA-L-AZN
|
||||
ghcid_uuid: 00c6dd57-92d9-54f7-9b20-9f93f29de9c7
|
||||
ghcid_uuid_sha256: fbb3f4e7-7051-8310-35c9-e0cefa4d0d26
|
||||
ghcid_numeric: 18137109399210111760
|
||||
record_id: 3a8d5f57-f602-4f19-abcc-a496565c9120
|
||||
generation_timestamp: '2025-12-06T23:37:15.610784+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-AZN
|
||||
valid_from: "2025-12-10T09:46:58Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-AZN
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:58Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-AZN
|
||||
ghcid_numeric: 18137109399210111760
|
||||
valid_from: '2025-12-06T23:37:15.610784+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: AVICENUM, zdravotnické nakladatelství
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-AZN
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 00c6dd57-92d9-54f7-9b20-9f93f29de9c7
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: fbb3f4e7-7051-8310-35c9-e0cefa4d0d26
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '18137109399210111760'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 3a8d5f57-f602-4f19-abcc-a496565c9120
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.912197+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.912197+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: AVICENUM, zdravotnické nakladatelství
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.912197+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.912197+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 1
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.912197+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABE322
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.912197+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:15.610784+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 118 02
|
||||
street_address: Malostranské nám. 26
|
||||
normalization_timestamp: '2025-12-09T20:41:05.207401+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:05.207376+00:00'
|
||||
160
data/custodian.backup.20251230/CZ-10-PRA-L-B.yaml
Normal file
160
data/custodian.backup.20251230/CZ-10-PRA-L-B.yaml
Normal file
|
|
@ -0,0 +1,160 @@
|
|||
original_entry:
|
||||
name: BENZINA
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABF008
|
||||
locations:
|
||||
- street_address: Spálená 5
|
||||
postal_code: 111 28
|
||||
city: Praha 1
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:15.782203+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-B
|
||||
ghcid_original: CZ-HL-PRA-L-B
|
||||
ghcid_uuid: b53b98b7-2cbf-5d8d-b793-f0d601d14c77
|
||||
ghcid_uuid_sha256: 44a7a076-2f51-8988-83ea-7ba7c46b360e
|
||||
ghcid_numeric: 4947099145150265736
|
||||
record_id: 168aece8-08ed-4da9-adbc-8150b306ed07
|
||||
generation_timestamp: '2025-12-06T23:37:15.782203+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-B
|
||||
valid_from: "2025-12-10T09:46:58Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-B
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:58Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-B
|
||||
ghcid_numeric: 4947099145150265736
|
||||
valid_from: '2025-12-06T23:37:15.782203+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: BENZINA
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-B
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: b53b98b7-2cbf-5d8d-b793-f0d601d14c77
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 44a7a076-2f51-8988-83ea-7ba7c46b360e
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '4947099145150265736'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 168aece8-08ed-4da9-adbc-8150b306ed07
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.914126+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.914126+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: BENZINA
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.914126+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.914126+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 1
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.914126+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABF008
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.914126+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:15.782203+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 111 28
|
||||
street_address: Spálená 5
|
||||
normalization_timestamp: '2025-12-09T20:41:05.253191+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:05.253168+00:00'
|
||||
160
data/custodian.backup.20251230/CZ-10-PRA-L-BL.yaml
Normal file
160
data/custodian.backup.20251230/CZ-10-PRA-L-BL.yaml
Normal file
|
|
@ -0,0 +1,160 @@
|
|||
original_entry:
|
||||
name: Barvy a laky
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABC057
|
||||
locations:
|
||||
- street_address: Českomoravská 29
|
||||
postal_code: 190 00
|
||||
city: Praha 9
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:14.488435+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-BL
|
||||
ghcid_original: CZ-HL-PRA-L-BL
|
||||
ghcid_uuid: 49bed947-bbe7-5811-9b7c-001675e2d6d6
|
||||
ghcid_uuid_sha256: 297bde07-c161-877f-a82a-5dd56cffbaba
|
||||
ghcid_numeric: 2989226902580967295
|
||||
record_id: 002a2a73-db8c-4ed7-aa4a-3228a1c76a97
|
||||
generation_timestamp: '2025-12-06T23:37:14.488435+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-BL
|
||||
valid_from: "2025-12-10T09:46:58Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-BL
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:58Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-BL
|
||||
ghcid_numeric: 2989226902580967295
|
||||
valid_from: '2025-12-06T23:37:14.488435+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Barvy a laky
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-BL
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 49bed947-bbe7-5811-9b7c-001675e2d6d6
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 297bde07-c161-877f-a82a-5dd56cffbaba
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '2989226902580967295'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 002a2a73-db8c-4ed7-aa4a-3228a1c76a97
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.802491+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.802491+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Barvy a laky
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.802491+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.802491+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 9
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.802491+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABC057
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.802491+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:14.488435+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 190 00
|
||||
street_address: Českomoravská 29
|
||||
normalization_timestamp: '2025-12-09T20:41:05.295510+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:05.295483+00:00'
|
||||
160
data/custodian.backup.20251230/CZ-10-PRA-L-BLZ.yaml
Normal file
160
data/custodian.backup.20251230/CZ-10-PRA-L-BLZ.yaml
Normal file
|
|
@ -0,0 +1,160 @@
|
|||
original_entry:
|
||||
name: Barvy a laky - závod 5
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABF511
|
||||
locations:
|
||||
- street_address: Přátelství 550
|
||||
postal_code: 104 00
|
||||
city: Praha 10 - Uhříněves
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:16.373440+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-BLZ
|
||||
ghcid_original: CZ-HL-PRA-L-BLZ
|
||||
ghcid_uuid: 2c47abc8-7588-5b11-8963-71c9377d507e
|
||||
ghcid_uuid_sha256: 834799aa-8861-8c7d-6ce3-6d36f8dce2f0
|
||||
ghcid_numeric: 9459698500026625149
|
||||
record_id: 1e659114-a761-4f09-a898-91c128325318
|
||||
generation_timestamp: '2025-12-06T23:37:16.373440+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-BLZ
|
||||
valid_from: "2025-12-10T09:46:58Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-BLZ
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:58Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-BLZ
|
||||
ghcid_numeric: 9459698500026625149
|
||||
valid_from: '2025-12-06T23:37:16.373440+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Barvy a laky - závod 5
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-BLZ
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 2c47abc8-7588-5b11-8963-71c9377d507e
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 834799aa-8861-8c7d-6ce3-6d36f8dce2f0
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '9459698500026625149'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 1e659114-a761-4f09-a898-91c128325318
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.918845+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.918845+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Barvy a laky - závod 5
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.918845+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.918845+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 10 - Uhříněves
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.918845+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABF511
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.918845+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:16.373440+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 104 00
|
||||
street_address: Přátelství 550
|
||||
normalization_timestamp: '2025-12-09T20:41:05.343604+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:05.343576+00:00'
|
||||
160
data/custodian.backup.20251230/CZ-10-PRA-L-BMSSR.yaml
Normal file
160
data/custodian.backup.20251230/CZ-10-PRA-L-BMSSR.yaml
Normal file
|
|
@ -0,0 +1,160 @@
|
|||
original_entry:
|
||||
name: Bristol-Myers Squibb, s.r.o.
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABF018
|
||||
locations:
|
||||
- street_address: Olivova 4/2096
|
||||
postal_code: 110 00
|
||||
city: Praha 1
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:24.186843+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-BMSSR
|
||||
ghcid_original: CZ-HL-PRA-L-BMSSR
|
||||
ghcid_uuid: ae0d311b-9a87-57bb-a729-38ee3806551b
|
||||
ghcid_uuid_sha256: 732ffc3e-197f-8142-163d-2d91ea6f5289
|
||||
ghcid_numeric: 8300129981913076034
|
||||
record_id: 21e138b9-56a7-41e4-bd57-9f8da97e0d56
|
||||
generation_timestamp: '2025-12-06T23:37:24.186843+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-BMSSR
|
||||
valid_from: "2025-12-10T09:46:58Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-BMSSR
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:58Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-BMSSR
|
||||
ghcid_numeric: 8300129981913076034
|
||||
valid_from: '2025-12-06T23:37:24.186843+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Bristol-Myers Squibb, s.r.o.
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-BMSSR
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: ae0d311b-9a87-57bb-a729-38ee3806551b
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 732ffc3e-197f-8142-163d-2d91ea6f5289
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '8300129981913076034'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 21e138b9-56a7-41e4-bd57-9f8da97e0d56
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:30.034986+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.034986+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Bristol-Myers Squibb, s.r.o.
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.034986+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.034986+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 1
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.034986+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABF018
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.034986+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:24.186843+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 110 00
|
||||
street_address: Olivova 4/2096
|
||||
normalization_timestamp: '2025-12-09T20:41:05.399591+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:05.399563+00:00'
|
||||
160
data/custodian.backup.20251230/CZ-10-PRA-L-BOPS.yaml
Normal file
160
data/custodian.backup.20251230/CZ-10-PRA-L-BOPS.yaml
Normal file
|
|
@ -0,0 +1,160 @@
|
|||
original_entry:
|
||||
name: Balírny obchodu Praha, a.s.
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABF237
|
||||
locations:
|
||||
- street_address: Karlovo nám. 18
|
||||
postal_code: 120 00
|
||||
city: Praha 2
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:16.187235+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-BOPS
|
||||
ghcid_original: CZ-HL-PRA-L-BOPS
|
||||
ghcid_uuid: 7beaa98c-c3fb-574c-95cf-1df1778178ba
|
||||
ghcid_uuid_sha256: 6ab92454-360f-865a-41a9-a05c376f827f
|
||||
ghcid_numeric: 7690217782814639706
|
||||
record_id: dc435520-e235-436b-9023-4f66b4208c10
|
||||
generation_timestamp: '2025-12-06T23:37:16.187235+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-BOPS
|
||||
valid_from: "2025-12-10T09:46:58Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-BOPS
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:58Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-BOPS
|
||||
ghcid_numeric: 7690217782814639706
|
||||
valid_from: '2025-12-06T23:37:16.187235+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Balírny obchodu Praha, a.s.
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-BOPS
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 7beaa98c-c3fb-574c-95cf-1df1778178ba
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 6ab92454-360f-865a-41a9-a05c376f827f
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '7690217782814639706'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: dc435520-e235-436b-9023-4f66b4208c10
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.917462+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.917462+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Balírny obchodu Praha, a.s.
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.917462+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.917462+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 2
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.917462+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABF237
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.917462+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:16.187235+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 120 00
|
||||
street_address: Karlovo nám. 18
|
||||
normalization_timestamp: '2025-12-09T20:41:05.444789+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:05.444776+00:00'
|
||||
272
data/custodian.backup.20251230/CZ-10-PRA-L-BPSRPNM.yaml
Normal file
272
data/custodian.backup.20251230/CZ-10-PRA-L-BPSRPNM.yaml
Normal file
|
|
@ -0,0 +1,272 @@
|
|||
original_entry:
|
||||
name: Bärenreiter Praha s.r.o. - Půjčovna notových materiálů
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABF314
|
||||
- &id002
|
||||
identifier_scheme: IČO
|
||||
identifier_value: '40527352'
|
||||
- &id003
|
||||
identifier_scheme: DIČ
|
||||
identifier_value: CZ40527352
|
||||
- &id004
|
||||
identifier_scheme: Website
|
||||
identifier_value: https://www.baerenreiter.cz/
|
||||
identifier_url: https://www.baerenreiter.cz/
|
||||
- &id005
|
||||
identifier_scheme: Wikidata
|
||||
identifier_value: Q114617814
|
||||
identifier_url: https://www.wikidata.org/wiki/Q114617814
|
||||
locations:
|
||||
- street_address: Perunova 1412/10
|
||||
postal_code: 130 00
|
||||
city: Praha 3
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
latitude: 50.07673
|
||||
longitude: 14.454035
|
||||
processing_timestamp: '2025-12-06T23:37:16.222539+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-BPSRPNM
|
||||
ghcid_original: CZ-HL-PRA-L-BPSRPNM
|
||||
ghcid_uuid: f517ef34-3199-5fa8-a64e-a3d6c6bc8b79
|
||||
ghcid_uuid_sha256: f1b94f67-64ea-8685-9497-237fda7ff180
|
||||
ghcid_numeric: 17418040339325408901
|
||||
record_id: b4884eee-e6aa-48b6-b13c-12711d7cd24b
|
||||
generation_timestamp: '2025-12-06T23:37:16.222539+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-BPSRPNM
|
||||
valid_from: '2025-12-10T09:46:58Z'
|
||||
valid_to:
|
||||
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
|
||||
- ghcid: CZ-HL-PRA-L-BPSRPNM
|
||||
valid_from:
|
||||
valid_to: '2025-12-10T09:46:58Z'
|
||||
reason: Previous GHCID with incorrect region code
|
||||
- ghcid: CZ-HL-PRA-L-BPSRPNM
|
||||
ghcid_numeric: 17418040339325408901
|
||||
valid_from: '2025-12-06T23:37:16.222539+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Bärenreiter Praha s.r.o. - Půjčovna notových materiálů
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-BPSRPNM
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: f517ef34-3199-5fa8-a64e-a3d6c6bc8b79
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: f1b94f67-64ea-8685-9497-237fda7ff180
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '17418040339325408901'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: b4884eee-e6aa-48b6-b13c-12711d7cd24b
|
||||
- *id001
|
||||
- *id002
|
||||
- *id003
|
||||
- *id004
|
||||
- *id005
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.917698+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
schema_version: 2.0.0
|
||||
enrichment_provenance:
|
||||
wikidata_enrichment:
|
||||
content_hash: sha256-adAcvng7eL1uAzyjqBlS4toCs3qHqquBYX6PXZUcX/s=
|
||||
verified_at: '2025-12-28T19:58:23.277377+00:00'
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617814
|
||||
provenance_schema_version: '2.0'
|
||||
standards_compliance:
|
||||
- W3C PROV-O
|
||||
- W3C SRI (content hashes)
|
||||
generated_at: '2025-12-28T19:58:23.277402+00:00'
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.917698+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Bärenreiter Praha s.r.o. - Půjčovna notových materiálů
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.917698+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.917698+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 3
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.917698+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: wikidata_id
|
||||
claim_value: Q114617814
|
||||
property_uri: owl:sameAs
|
||||
provenance:
|
||||
namespace: wikidata
|
||||
path: /entity/Q114617814
|
||||
timestamp: '2025-11-19T10:14:29.917698+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.98
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABF314
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.917698+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:16.222539+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
wikidata_enrichment:
|
||||
wikidata_entity_id: Q114617814
|
||||
api_metadata:
|
||||
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
fetch_timestamp: '2025-12-18T22:27:16.758067+00:00'
|
||||
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
|
||||
enrichment_version: 2.1_generic
|
||||
properties_found:
|
||||
- P31
|
||||
- P17
|
||||
- P159
|
||||
- P968
|
||||
wikidata_labels:
|
||||
cs: Bärenreiter Praha s.r.o., Půjčovna notových materiálů (knihovna)
|
||||
en: Bärenreiter Praha s.r.o., Hire Library
|
||||
wikidata_label_en: Bärenreiter Praha s.r.o., Hire Library
|
||||
wikidata_label_cs: Bärenreiter Praha s.r.o., Půjčovna notových materiálů (knihovna)
|
||||
wikidata_descriptions:
|
||||
cs: knihovna v Praze
|
||||
wikidata_classification:
|
||||
instance_of: &id006
|
||||
- id: Q385994
|
||||
label: special library
|
||||
description: specialized library providing resources on a particular topic or discipline
|
||||
wikidata_instance_of: *id006
|
||||
wikidata_location:
|
||||
country: &id007
|
||||
id: Q213
|
||||
label: Czech Republic
|
||||
description: country in Central Europe
|
||||
headquarters_location:
|
||||
id: Q2598899
|
||||
label: Prague 3
|
||||
description: administrative district, municipal district and municipal part of Prague
|
||||
wikidata_country: *id007
|
||||
wikidata_contact:
|
||||
email: mailto:hire@baerenreiter.cz
|
||||
_provenance:
|
||||
content_hash:
|
||||
algorithm: sha256
|
||||
value: sha256-adAcvng7eL1uAzyjqBlS4toCs3qHqquBYX6PXZUcX/s=
|
||||
scope: enrichment_section
|
||||
computed_at: '2025-12-28T19:58:23.277377+00:00'
|
||||
prov:
|
||||
wasGeneratedBy:
|
||||
'@type': prov:Activity
|
||||
name: wikidata_api_fetch
|
||||
used: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617814
|
||||
verification:
|
||||
status: verified
|
||||
last_verified: '2025-12-28T19:58:23.277390+00:00'
|
||||
location:
|
||||
latitude: 50.07673
|
||||
longitude: 14.454035
|
||||
coordinate_provenance:
|
||||
source_type: ORIGINAL_ENTRY
|
||||
source_path: original_entry.locations[0]
|
||||
city: Praha 3
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 130 00
|
||||
street_address: Perunova 1412/10
|
||||
normalization_timestamp: '2025-12-09T06:50:47.749663+00:00'
|
||||
logo_enrichment:
|
||||
enrichment_timestamp: '2025-12-23T21:58:58.246784+00:00'
|
||||
source_url: https://www.baerenreiter.cz
|
||||
extraction_method: crawl4ai
|
||||
claims:
|
||||
- claim_type: favicon_url
|
||||
claim_value: https://www.baerenreiter.cz/sites/default/files/favicon.ico
|
||||
source_url: https://www.baerenreiter.cz
|
||||
css_selector: '[document] > html.js > head > link'
|
||||
retrieved_on: '2025-12-23T21:58:58.246784+00:00'
|
||||
extraction_method: crawl4ai_link_rel
|
||||
favicon_type: image/vnd.microsoft.icon
|
||||
favicon_sizes: ''
|
||||
- claim_type: og_image_url
|
||||
claim_value: https://www.baerenreiter.cz/sites/default/files/fb.png
|
||||
source_url: https://www.baerenreiter.cz
|
||||
css_selector: '[document] > html.js > head > meta:nth-of-type(10)'
|
||||
retrieved_on: '2025-12-23T21:58:58.246784+00:00'
|
||||
extraction_method: crawl4ai_meta_og
|
||||
summary:
|
||||
total_claims: 2
|
||||
has_primary_logo: false
|
||||
has_favicon: true
|
||||
has_og_image: true
|
||||
favicon_count: 1
|
||||
160
data/custodian.backup.20251230/CZ-10-PRA-L-BS.yaml
Normal file
160
data/custodian.backup.20251230/CZ-10-PRA-L-BS.yaml
Normal file
|
|
@ -0,0 +1,160 @@
|
|||
original_entry:
|
||||
name: BALNEX, a.s.
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABE209
|
||||
locations:
|
||||
- street_address: Křižovnická 3
|
||||
postal_code: 110 01
|
||||
city: Praha 1
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:15.404725+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-BS
|
||||
ghcid_original: CZ-HL-PRA-L-BS
|
||||
ghcid_uuid: a6c010de-713b-583c-911c-7360c964f0c8
|
||||
ghcid_uuid_sha256: d5f79d41-d62a-81e5-fa61-a849a722c094
|
||||
ghcid_numeric: 15417964755417694693
|
||||
record_id: 5bab80da-b02b-44fb-90a4-4be4ae66abc6
|
||||
generation_timestamp: '2025-12-06T23:37:15.404725+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-BS
|
||||
valid_from: "2025-12-10T09:46:58Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-BS
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:58Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-BS
|
||||
ghcid_numeric: 15417964755417694693
|
||||
valid_from: '2025-12-06T23:37:15.404725+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: BALNEX, a.s.
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-BS
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: a6c010de-713b-583c-911c-7360c964f0c8
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: d5f79d41-d62a-81e5-fa61-a849a722c094
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '15417964755417694693'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 5bab80da-b02b-44fb-90a4-4be4ae66abc6
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.909359+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.909359+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: BALNEX, a.s.
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.909359+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.909359+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 1
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.909359+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABE209
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.909359+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:15.404725+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 110 01
|
||||
street_address: Křižovnická 3
|
||||
normalization_timestamp: '2025-12-09T20:41:05.487903+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:05.487890+00:00'
|
||||
161
data/custodian.backup.20251230/CZ-10-PRA-L-BTS.yaml
Normal file
161
data/custodian.backup.20251230/CZ-10-PRA-L-BTS.yaml
Normal file
|
|
@ -0,0 +1,161 @@
|
|||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: BIJO TC, a.s.
|
||||
source_type: ch_annotator
|
||||
original_entry:
|
||||
name: BIJO TC, a.s.
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABC125
|
||||
locations:
|
||||
- street_address: Tiskařská 10/257
|
||||
postal_code: 108 28
|
||||
city: Praha 10 - Malešice
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:14.605316+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-BTS
|
||||
ghcid_original: CZ-HL-PRA-L-BTS
|
||||
ghcid_uuid: 46fb3260-4257-5d4f-859e-32420cdf87b5
|
||||
ghcid_uuid_sha256: ca43ed37-dcf6-8c51-0af3-e33d300f2c68
|
||||
ghcid_numeric: 14574753643287153745
|
||||
record_id: 8748db8f-27f7-42d7-9ea4-9a141e04d05d
|
||||
generation_timestamp: '2025-12-06T23:37:14.605316+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-BTS
|
||||
valid_from: '2025-12-10T09:46:58Z'
|
||||
valid_to: null
|
||||
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha))
|
||||
per ISO 3166-2:CZ
|
||||
- ghcid: CZ-HL-PRA-L-BTS
|
||||
valid_from: null
|
||||
valid_to: '2025-12-10T09:46:58Z'
|
||||
reason: Previous GHCID with incorrect region code
|
||||
- ghcid: CZ-HL-PRA-L-BTS
|
||||
ghcid_numeric: 14574753643287153745
|
||||
valid_from: '2025-12-06T23:37:14.605316+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-BTS
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 46fb3260-4257-5d4f-859e-32420cdf87b5
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: ca43ed37-dcf6-8c51-0af3-e33d300f2c68
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '14574753643287153745'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 8748db8f-27f7-42d7-9ea4-9a141e04d05d
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.803741+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.803741+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date: null
|
||||
verified_by: null
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: BIJO TC, a.s.
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.803741+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.803741+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 10 - Malešice
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.803741+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABC125
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.803741+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:14.605316+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 108 28
|
||||
street_address: Tiskařská 10/257
|
||||
normalization_timestamp: '2025-12-09T20:41:05.534948+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:05.534935+00:00'
|
||||
160
data/custodian.backup.20251230/CZ-10-PRA-L-BVCRSR.yaml
Normal file
160
data/custodian.backup.20251230/CZ-10-PRA-L-BVCRSR.yaml
Normal file
|
|
@ -0,0 +1,160 @@
|
|||
original_entry:
|
||||
name: BUREAU VERITAS CZECH REPUBLIC, s.r.o.
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABF042
|
||||
locations:
|
||||
- street_address: Olbrachtova 1
|
||||
postal_code: 140 00
|
||||
city: Praha 4
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:15.840934+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-BVCRSR
|
||||
ghcid_original: CZ-HL-PRA-L-BVCRSR
|
||||
ghcid_uuid: b4d15574-7d65-5c95-b400-0cd921f8038f
|
||||
ghcid_uuid_sha256: adba6580-9cb8-8cba-3b07-d6bc76134ec4
|
||||
ghcid_numeric: 12518429717289270458
|
||||
record_id: 2cdae77e-3374-4bf4-9254-ea086fa284d1
|
||||
generation_timestamp: '2025-12-06T23:37:15.840934+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-BVCRSR
|
||||
valid_from: "2025-12-10T09:46:58Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-BVCRSR
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:58Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-BVCRSR
|
||||
ghcid_numeric: 12518429717289270458
|
||||
valid_from: '2025-12-06T23:37:15.840934+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: BUREAU VERITAS CZECH REPUBLIC, s.r.o.
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-BVCRSR
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: b4d15574-7d65-5c95-b400-0cd921f8038f
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: adba6580-9cb8-8cba-3b07-d6bc76134ec4
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '12518429717289270458'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 2cdae77e-3374-4bf4-9254-ea086fa284d1
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.914590+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.914590+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: BUREAU VERITAS CZECH REPUBLIC, s.r.o.
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.914590+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.914590+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 4
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.914590+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABF042
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.914590+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:15.840934+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 140 00
|
||||
street_address: Olbrachtova 1
|
||||
normalization_timestamp: '2025-12-09T20:41:05.573445+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:05.573432+00:00'
|
||||
277
data/custodian.backup.20251230/CZ-10-PRA-L-BZHMPK.yaml
Normal file
277
data/custodian.backup.20251230/CZ-10-PRA-L-BZHMPK.yaml
Normal file
|
|
@ -0,0 +1,277 @@
|
|||
original_entry:
|
||||
name: Botanická zahrada hl. m. Prahy - knihovna
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABE404
|
||||
- &id002
|
||||
identifier_scheme: IČO
|
||||
identifier_value: '00064572'
|
||||
- &id003
|
||||
identifier_scheme: DIČ
|
||||
identifier_value: CZ00064572
|
||||
- &id004
|
||||
identifier_scheme: Website
|
||||
identifier_value: https://knihovna.botanicka.cz/
|
||||
identifier_url: https://knihovna.botanicka.cz/
|
||||
- &id005
|
||||
identifier_scheme: Wikidata
|
||||
identifier_value: Q114617803
|
||||
identifier_url: https://www.wikidata.org/wiki/Q114617803
|
||||
locations:
|
||||
- street_address: Trojská 800/196
|
||||
postal_code: 171 00
|
||||
city: Praha 7 - Troja
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
latitude: 50.121983
|
||||
longitude: 14.412453
|
||||
processing_timestamp: '2025-12-06T23:37:15.700458+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-BZHMPK
|
||||
ghcid_original: CZ-HL-PRA-L-BZHMPK
|
||||
ghcid_uuid: ddb28542-c287-5302-9f45-281bda8f18d1
|
||||
ghcid_uuid_sha256: 6de8d9c8-9d4d-8589-b9b1-7776aad03b6a
|
||||
ghcid_numeric: 7919819400386827657
|
||||
record_id: 965cbe79-ccdd-4561-8acc-7ae286cc3ef2
|
||||
generation_timestamp: '2025-12-06T23:37:15.700458+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-BZHMPK
|
||||
valid_from: '2025-12-10T09:46:58Z'
|
||||
valid_to:
|
||||
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
|
||||
- ghcid: CZ-HL-PRA-L-BZHMPK
|
||||
valid_from:
|
||||
valid_to: '2025-12-10T09:46:58Z'
|
||||
reason: Previous GHCID with incorrect region code
|
||||
- ghcid: CZ-HL-PRA-L-BZHMPK
|
||||
ghcid_numeric: 7919819400386827657
|
||||
valid_from: '2025-12-06T23:37:15.700458+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Botanická zahrada hl. m. Prahy - knihovna
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-BZHMPK
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: ddb28542-c287-5302-9f45-281bda8f18d1
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 6de8d9c8-9d4d-8589-b9b1-7776aad03b6a
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '7919819400386827657'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 965cbe79-ccdd-4561-8acc-7ae286cc3ef2
|
||||
- *id001
|
||||
- *id002
|
||||
- *id003
|
||||
- *id004
|
||||
- *id005
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.913331+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
schema_version: 2.0.0
|
||||
enrichment_provenance:
|
||||
wikidata_enrichment:
|
||||
content_hash: sha256-fnZfc5LlrZv1vMwhc7UMIawFYMK4eplZVEeWgDEpN0Y=
|
||||
verified_at: '2025-12-28T19:58:23.327120+00:00'
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617803
|
||||
provenance_schema_version: '2.0'
|
||||
standards_compliance:
|
||||
- W3C PROV-O
|
||||
- W3C SRI (content hashes)
|
||||
generated_at: '2025-12-28T19:58:23.327139+00:00'
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.913331+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Botanická zahrada hl. m. Prahy - knihovna
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.913331+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.913331+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 7 - Troja
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.913331+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: wikidata_id
|
||||
claim_value: Q114617803
|
||||
property_uri: owl:sameAs
|
||||
provenance:
|
||||
namespace: wikidata
|
||||
path: /entity/Q114617803
|
||||
timestamp: '2025-11-19T10:14:29.913331+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.98
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABE404
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.913331+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:15.700458+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
wikidata_enrichment:
|
||||
wikidata_entity_id: Q114617803
|
||||
api_metadata:
|
||||
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
fetch_timestamp: '2025-12-18T22:27:25.230947+00:00'
|
||||
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
|
||||
enrichment_version: 2.1_generic
|
||||
properties_found:
|
||||
- P31
|
||||
- P17
|
||||
- P159
|
||||
- P968
|
||||
wikidata_labels:
|
||||
cs: Botanická zahrada hl. m. Prahy, knihovna
|
||||
en: Prague Botanical Garden
|
||||
wikidata_label_en: Prague Botanical Garden
|
||||
wikidata_label_cs: Botanická zahrada hl. m. Prahy, knihovna
|
||||
wikidata_descriptions:
|
||||
cs: knihovna v Praze
|
||||
wikidata_aliases:
|
||||
cs:
|
||||
- BZ Praha
|
||||
wikidata_classification:
|
||||
instance_of: &id006
|
||||
- id: Q385994
|
||||
label: special library
|
||||
description: specialized library providing resources on a particular topic or discipline
|
||||
wikidata_instance_of: *id006
|
||||
wikidata_location:
|
||||
country: &id007
|
||||
id: Q213
|
||||
label: Czech Republic
|
||||
description: country in Central Europe
|
||||
headquarters_location:
|
||||
id: Q84682501
|
||||
label: Troja
|
||||
description: cadastral area in Prague
|
||||
wikidata_country: *id007
|
||||
wikidata_contact:
|
||||
email: mailto:info@botanicka.cz
|
||||
_provenance:
|
||||
content_hash:
|
||||
algorithm: sha256
|
||||
value: sha256-fnZfc5LlrZv1vMwhc7UMIawFYMK4eplZVEeWgDEpN0Y=
|
||||
scope: enrichment_section
|
||||
computed_at: '2025-12-28T19:58:23.327120+00:00'
|
||||
prov:
|
||||
wasGeneratedBy:
|
||||
'@type': prov:Activity
|
||||
name: wikidata_api_fetch
|
||||
used: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617803
|
||||
verification:
|
||||
status: verified
|
||||
last_verified: '2025-12-28T19:58:23.327130+00:00'
|
||||
location:
|
||||
latitude: 50.121983
|
||||
longitude: 14.412453
|
||||
coordinate_provenance:
|
||||
source_type: ORIGINAL_ENTRY
|
||||
source_path: original_entry.locations[0]
|
||||
city: Praha 7 - Troja
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 171 00
|
||||
street_address: Trojská 800/196
|
||||
normalization_timestamp: '2025-12-09T06:50:47.837815+00:00'
|
||||
logo_enrichment:
|
||||
enrichment_timestamp: '2025-12-23T21:59:10.228060+00:00'
|
||||
source_url: https://knihovna.botanicka.cz
|
||||
extraction_method: crawl4ai
|
||||
claims:
|
||||
- claim_type: logo_url
|
||||
claim_value: https://knihovna.botanicka.cz/custom/design/logo.png
|
||||
source_url: https://knihovna.botanicka.cz
|
||||
css_selector: '#portaro-classic-layout > kp-svelte-component-wrapper.kp-header-component-wrapper.ng-isolate-scope > header.kp-header.logo-stripe > div.logo-content-container.container > div.logo-search-row.row > div.customLogoArea.custom-logo-area > div > a > img'
|
||||
retrieved_on: '2025-12-23T21:59:10.228060+00:00'
|
||||
extraction_method: crawl4ai_header_logo
|
||||
detection_confidence: high
|
||||
alt_text: Botanická zahrada Praha
|
||||
- claim_type: favicon_url
|
||||
claim_value: https://knihovna.botanicka.cz/favicon.png?v=2.3.0-32021
|
||||
source_url: https://knihovna.botanicka.cz
|
||||
css_selector: '#ng-app > head > link:nth-of-type(9)'
|
||||
retrieved_on: '2025-12-23T21:59:10.228060+00:00'
|
||||
extraction_method: crawl4ai_link_rel
|
||||
favicon_type: image/png
|
||||
favicon_sizes: 256x256
|
||||
summary:
|
||||
total_claims: 2
|
||||
has_primary_logo: true
|
||||
has_favicon: true
|
||||
has_og_image: false
|
||||
favicon_count: 1
|
||||
160
data/custodian.backup.20251230/CZ-10-PRA-L-BZVPM.yaml
Normal file
160
data/custodian.backup.20251230/CZ-10-PRA-L-BZVPM.yaml
Normal file
|
|
@ -0,0 +1,160 @@
|
|||
original_entry:
|
||||
name: BARUM Zlín - výrobní podnik MITAS
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABF219
|
||||
locations:
|
||||
- street_address: Komarovova 1900
|
||||
postal_code: 106 24
|
||||
city: Praha 10
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:16.166411+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-BZVPM
|
||||
ghcid_original: CZ-HL-PRA-L-BZVPM
|
||||
ghcid_uuid: 2bc7f6fb-4623-5d84-bc09-5af68d2b30d5
|
||||
ghcid_uuid_sha256: 7ee23211-b1ce-87fe-b290-acc47c3d2166
|
||||
ghcid_numeric: 9142925245094483966
|
||||
record_id: 8ce281e5-83f2-413e-83e0-286609a9437c
|
||||
generation_timestamp: '2025-12-06T23:37:16.166411+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-BZVPM
|
||||
valid_from: "2025-12-10T09:46:58Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-BZVPM
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:58Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-BZVPM
|
||||
ghcid_numeric: 9142925245094483966
|
||||
valid_from: '2025-12-06T23:37:16.166411+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: BARUM Zlín - výrobní podnik MITAS
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-BZVPM
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 2bc7f6fb-4623-5d84-bc09-5af68d2b30d5
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 7ee23211-b1ce-87fe-b290-acc47c3d2166
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '9142925245094483966'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 8ce281e5-83f2-413e-83e0-286609a9437c
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.917237+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.917237+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: BARUM Zlín - výrobní podnik MITAS
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.917237+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.917237+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 10
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.917237+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABF219
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.917237+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:16.166411+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 106 24
|
||||
street_address: Komarovova 1900
|
||||
normalization_timestamp: '2025-12-09T20:41:05.614114+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:05.614068+00:00'
|
||||
233
data/custodian.backup.20251230/CZ-10-PRA-L-C.yaml
Normal file
233
data/custodian.backup.20251230/CZ-10-PRA-L-C.yaml
Normal file
|
|
@ -0,0 +1,233 @@
|
|||
original_entry:
|
||||
name: ČKAIT
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABE121
|
||||
- &id002
|
||||
identifier_scheme: IČO
|
||||
identifier_value: '45770743'
|
||||
- &id003
|
||||
identifier_scheme: DIČ
|
||||
identifier_value: CZ45770743
|
||||
- &id004
|
||||
identifier_scheme: Website
|
||||
identifier_value: https://www.ckait.cz/content/knihovna-studovna-vyuzivejte-zdroje-komory
|
||||
identifier_url: https://www.ckait.cz/content/knihovna-studovna-vyuzivejte-zdroje-komory
|
||||
locations:
|
||||
- street_address: Sokolská 15
|
||||
postal_code: 120 00
|
||||
city: Praha 2
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
latitude: 50.073689
|
||||
longitude: 14.428981
|
||||
processing_timestamp: '2025-12-06T23:37:24.220409+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-C
|
||||
ghcid_original: CZ-HL-PRA-L-Č
|
||||
ghcid_uuid: fc03c3b2-07eb-59c3-8cec-76a02a126334
|
||||
ghcid_uuid_sha256: 69fa5c4f-fa31-8052-a145-b97bc584a8c4
|
||||
ghcid_numeric: 7636517616729829458
|
||||
record_id: e4649a2e-d847-4fff-988c-474675df99b5
|
||||
generation_timestamp: '2025-12-06T23:37:24.220409+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-C
|
||||
valid_from: '2025-12-10T09:46:58Z'
|
||||
valid_to:
|
||||
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
|
||||
- ghcid: CZ-HL-PRA-L-C
|
||||
valid_from:
|
||||
valid_to: '2025-12-10T09:46:58Z'
|
||||
reason: Previous GHCID with incorrect region code
|
||||
- ghcid: CZ-HL-PRA-L-C
|
||||
ghcid_numeric: 7636517616729829458
|
||||
valid_from: '2025-12-08T11:21:30.867239+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-Č)'
|
||||
- ghcid: CZ-HL-PRA-L-Č
|
||||
ghcid_numeric: 6753267552530310287
|
||||
valid_from: '2025-12-06T23:37:24.220409+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:30.867239+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: ČKAIT
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-C
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: fc03c3b2-07eb-59c3-8cec-76a02a126334
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 69fa5c4f-fa31-8052-a145-b97bc584a8c4
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '7636517616729829458'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: e4649a2e-d847-4fff-988c-474675df99b5
|
||||
- *id001
|
||||
- *id002
|
||||
- *id003
|
||||
- *id004
|
||||
- identifier_scheme: Wikidata
|
||||
identifier_value: Q114619194
|
||||
identifier_url: https://www.wikidata.org/wiki/Q114619194
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:30.035396+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
schema_version: 2.0.0
|
||||
enrichment_provenance:
|
||||
wikidata_enrichment:
|
||||
content_hash: sha256-9flePgPRModhuSmKlHwWMjPrGn5nVoXBoUUCCLzfbjw=
|
||||
verified_at: '2025-12-28T19:58:23.355427+00:00'
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114619194
|
||||
provenance_schema_version: '2.0'
|
||||
standards_compliance:
|
||||
- W3C PROV-O
|
||||
- W3C SRI (content hashes)
|
||||
generated_at: '2025-12-28T19:58:23.355445+00:00'
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.035396+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: ČKAIT
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.035396+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.035396+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 2
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.035396+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABE121
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.035396+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:24.220409+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
latitude: 50.073689
|
||||
longitude: 14.428981
|
||||
coordinate_provenance:
|
||||
source_type: ORIGINAL_ENTRY
|
||||
source_path: original_entry.locations[0]
|
||||
city: Praha 2
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 120 00
|
||||
street_address: Sokolská 15
|
||||
normalization_timestamp: '2025-12-09T06:50:47.895364+00:00'
|
||||
wikidata_enrichment:
|
||||
wikidata_id: Q114619194
|
||||
wikidata_url: https://www.wikidata.org/wiki/Q114619194
|
||||
matched_by: sigla_identifier
|
||||
matched_sigla: ABE121
|
||||
wikidata_label: ČKAIT (knihovna)
|
||||
wikidata_description: knihovna v Praze
|
||||
enrichment_date: '2025-12-19T10:13:02.020089+00:00'
|
||||
enrichment_version: 2.1.0
|
||||
instance_of:
|
||||
- Q385994
|
||||
_provenance:
|
||||
content_hash:
|
||||
algorithm: sha256
|
||||
value: sha256-9flePgPRModhuSmKlHwWMjPrGn5nVoXBoUUCCLzfbjw=
|
||||
scope: enrichment_section
|
||||
computed_at: '2025-12-28T19:58:23.355427+00:00'
|
||||
prov:
|
||||
wasGeneratedBy:
|
||||
'@type': prov:Activity
|
||||
name: wikidata_api_fetch
|
||||
used: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114619194
|
||||
generatedAtTime: '2025-12-19T10:13:02.020089+00:00'
|
||||
verification:
|
||||
status: verified
|
||||
last_verified: '2025-12-28T19:58:23.355435+00:00'
|
||||
wikidata_entity_id: Q114619194
|
||||
logo_enrichment:
|
||||
enrichment_timestamp: '2025-12-23T21:59:18.694376+00:00'
|
||||
source_url: https://www.ckait.cz/content/knihovna-studovna-vyuzivejte-zdroje-komory
|
||||
extraction_method: crawl4ai
|
||||
claims:
|
||||
- claim_type: og_image_url
|
||||
claim_value: https://www.ckait.cz/sites/default/files/logo_0.png
|
||||
source_url: https://www.ckait.cz/content/knihovna-studovna-vyuzivejte-zdroje-komory
|
||||
css_selector: '[document] > html.js > head > meta:nth-of-type(4)'
|
||||
retrieved_on: '2025-12-23T21:59:18.694376+00:00'
|
||||
extraction_method: crawl4ai_meta_og
|
||||
summary:
|
||||
total_claims: 1
|
||||
has_primary_logo: false
|
||||
has_favicon: false
|
||||
has_og_image: true
|
||||
favicon_count: 0
|
||||
165
data/custodian.backup.20251230/CZ-10-PRA-L-CA.yaml
Normal file
165
data/custodian.backup.20251230/CZ-10-PRA-L-CA.yaml
Normal file
|
|
@ -0,0 +1,165 @@
|
|||
original_entry:
|
||||
name: Československé aerolinie
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABF013
|
||||
locations:
|
||||
- street_address: Revoluční 1
|
||||
postal_code: 110 00
|
||||
city: Praha 1
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:25.757670+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-CA
|
||||
ghcid_original: CZ-HL-PRA-L-ČA
|
||||
ghcid_uuid: ec8f4d30-cab6-563a-9e95-68ecde7d50d2
|
||||
ghcid_uuid_sha256: 0e090c6c-ff12-8a37-b143-e016cadec154
|
||||
ghcid_numeric: 1011353253596809783
|
||||
record_id: 2eef5618-dd76-4088-98bd-052f9a7f1daa
|
||||
generation_timestamp: '2025-12-06T23:37:25.757670+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-CA
|
||||
valid_from: "2025-12-10T09:46:58Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-CA
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:58Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-CA
|
||||
ghcid_numeric: 1011353253596809783
|
||||
valid_from: '2025-12-08T11:21:40.305739+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ČA)'
|
||||
- ghcid: CZ-HL-PRA-L-ČA
|
||||
ghcid_numeric: 14316824294466373603
|
||||
valid_from: '2025-12-06T23:37:25.757670+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:40.305739+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Československé aerolinie
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-CA
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: ec8f4d30-cab6-563a-9e95-68ecde7d50d2
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 0e090c6c-ff12-8a37-b143-e016cadec154
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '1011353253596809783'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 2eef5618-dd76-4088-98bd-052f9a7f1daa
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:30.047770+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.047770+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Československé aerolinie
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.047770+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.047770+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 1
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.047770+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABF013
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.047770+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:25.757670+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 110 00
|
||||
street_address: Revoluční 1
|
||||
normalization_timestamp: '2025-12-09T20:41:05.659406+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:05.659389+00:00'
|
||||
165
data/custodian.backup.20251230/CZ-10-PRA-L-CAPOC.yaml
Normal file
165
data/custodian.backup.20251230/CZ-10-PRA-L-CAPOC.yaml
Normal file
|
|
@ -0,0 +1,165 @@
|
|||
original_entry:
|
||||
name: Česká agentura na podporu obchodu / CzechTrade
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABC078
|
||||
locations:
|
||||
- street_address: Politických vězňů 20
|
||||
postal_code: 111 21
|
||||
city: Praha 1
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:14.535093+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-CAPOC
|
||||
ghcid_original: CZ-HL-PRA-L-ČAPOC
|
||||
ghcid_uuid: bd63e385-f54a-59f0-8fc2-9a816ff748dc
|
||||
ghcid_uuid_sha256: bf51eb6c-2cd1-8f54-9b13-21677ac27e17
|
||||
ghcid_numeric: 13786058784198733652
|
||||
record_id: 76cf57a2-2e08-4564-ab23-7d6a2916eb29
|
||||
generation_timestamp: '2025-12-06T23:37:14.535093+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-CAPOC
|
||||
valid_from: "2025-12-10T09:46:58Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-CAPOC
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:58Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-CAPOC
|
||||
ghcid_numeric: 13786058784198733652
|
||||
valid_from: '2025-12-08T11:21:23.125025+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ČAPOC)'
|
||||
- ghcid: CZ-HL-PRA-L-ČAPOC
|
||||
ghcid_numeric: 436287342414373497
|
||||
valid_from: '2025-12-06T23:37:14.535093+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:23.125025+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Česká agentura na podporu obchodu / CzechTrade
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-CAPOC
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: bd63e385-f54a-59f0-8fc2-9a816ff748dc
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: bf51eb6c-2cd1-8f54-9b13-21677ac27e17
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '13786058784198733652'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 76cf57a2-2e08-4564-ab23-7d6a2916eb29
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.803043+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.803043+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Česká agentura na podporu obchodu / CzechTrade
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.803043+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.803043+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 1
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.803043+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABC078
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.803043+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:14.535093+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 111 21
|
||||
street_address: Politických vězňů 20
|
||||
normalization_timestamp: '2025-12-09T20:41:05.703407+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:05.703378+00:00'
|
||||
173
data/custodian.backup.20251230/CZ-10-PRA-L-CAS.yaml
Normal file
173
data/custodian.backup.20251230/CZ-10-PRA-L-CAS.yaml
Normal file
|
|
@ -0,0 +1,173 @@
|
|||
original_entry:
|
||||
name: České aerolinie, a.s.
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABF014
|
||||
- &id002
|
||||
identifier_scheme: IČO
|
||||
identifier_value: '45795908'
|
||||
- &id003
|
||||
identifier_scheme: DIČ
|
||||
identifier_value: CZ45795908
|
||||
locations:
|
||||
- street_address: Ruzyně - letiště
|
||||
postal_code: 160 08
|
||||
city: Praha 6
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:15.792471+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-CAS
|
||||
ghcid_original: CZ-HL-PRA-L-ČAS
|
||||
ghcid_uuid: e1fd3676-d4e0-5959-bb25-ff516014a9dc
|
||||
ghcid_uuid_sha256: b9edb1dd-c76f-853f-bd4b-3b74130c84ad
|
||||
ghcid_numeric: 13397560032588944703
|
||||
record_id: 6f5b847f-5531-45b6-994f-fde37203ff3b
|
||||
generation_timestamp: '2025-12-06T23:37:15.792471+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-CAS
|
||||
valid_from: "2025-12-10T09:46:58Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-CAS
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:58Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-CAS
|
||||
ghcid_numeric: 13397560032588944703
|
||||
valid_from: '2025-12-08T11:21:21.233118+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ČAS)'
|
||||
- ghcid: CZ-HL-PRA-L-ČAS
|
||||
ghcid_numeric: 16494477793071435589
|
||||
valid_from: '2025-12-06T23:37:15.792471+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:21.233118+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: České aerolinie, a.s.
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-CAS
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: e1fd3676-d4e0-5959-bb25-ff516014a9dc
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: b9edb1dd-c76f-853f-bd4b-3b74130c84ad
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '13397560032588944703'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 6f5b847f-5531-45b6-994f-fde37203ff3b
|
||||
- *id001
|
||||
- *id002
|
||||
- *id003
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.914209+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.914209+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: České aerolinie, a.s.
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.914209+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.914209+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 6
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.914209+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABF014
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.914209+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:15.792471+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 160 08
|
||||
street_address: Ruzyně - letiště
|
||||
normalization_timestamp: '2025-12-09T20:41:05.751437+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:05.751422+00:00'
|
||||
270
data/custodian.backup.20251230/CZ-10-PRA-L-CBSK.yaml
Normal file
270
data/custodian.backup.20251230/CZ-10-PRA-L-CBSK.yaml
Normal file
|
|
@ -0,0 +1,270 @@
|
|||
original_entry:
|
||||
name: Česká botanická společnost - Knihovna
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABB108
|
||||
- &id002
|
||||
identifier_scheme: IČO
|
||||
identifier_value: '00444553'
|
||||
- &id003
|
||||
identifier_scheme: Website
|
||||
identifier_value: https://katalog.botanospol.cz/
|
||||
identifier_url: https://katalog.botanospol.cz/
|
||||
- &id004
|
||||
identifier_scheme: Wikidata
|
||||
identifier_value: Q114617585
|
||||
identifier_url: https://www.wikidata.org/wiki/Q114617585
|
||||
locations:
|
||||
- street_address: Benátská 2
|
||||
postal_code: 128 00
|
||||
city: Praha 2
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
latitude: 50.071464
|
||||
longitude: 14.420864
|
||||
processing_timestamp: '2025-12-06T23:35:06.433942+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-CBSK
|
||||
ghcid_original: CZ-HL-PRA-L-ČBSK
|
||||
ghcid_uuid: 8c696492-e42e-5b10-a0b0-d67d7be12203
|
||||
ghcid_uuid_sha256: 9ec8dfa1-4d46-84b2-9a72-a4a7483f2012
|
||||
ghcid_numeric: 11441640737213916338
|
||||
record_id: a52123de-0f83-4b07-a53a-e96b07d5b66e
|
||||
generation_timestamp: '2025-12-06T23:35:06.433942+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-CBSK
|
||||
valid_from: '2025-12-10T09:46:58Z'
|
||||
valid_to:
|
||||
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
|
||||
- ghcid: CZ-HL-PRA-L-CBSK
|
||||
valid_from:
|
||||
valid_to: '2025-12-10T09:46:58Z'
|
||||
reason: Previous GHCID with incorrect region code
|
||||
- ghcid: CZ-HL-PRA-L-CBSK
|
||||
ghcid_numeric: 11441640737213916338
|
||||
valid_from: '2025-12-08T11:21:27.632820+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ČBSK)'
|
||||
- ghcid: CZ-HL-PRA-L-ČBSK
|
||||
ghcid_numeric: 3946897244846925513
|
||||
valid_from: '2025-12-06T23:35:06.433942+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:27.632820+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Česká botanická společnost - Knihovna
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-CBSK
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 8c696492-e42e-5b10-a0b0-d67d7be12203
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 9ec8dfa1-4d46-84b2-9a72-a4a7483f2012
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '11441640737213916338'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: a52123de-0f83-4b07-a53a-e96b07d5b66e
|
||||
- *id001
|
||||
- *id002
|
||||
- *id003
|
||||
- *id004
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.800191+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
|
||||
schema_version: 2.0.0
|
||||
enrichment_provenance:
|
||||
wikidata_enrichment:
|
||||
content_hash: sha256-oY4wPu7P3YKsrWpr2H3FmyYjbSLqUh+rxXz32uzFU2c=
|
||||
verified_at: '2025-12-28T19:58:23.405533+00:00'
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617585
|
||||
provenance_schema_version: '2.0'
|
||||
standards_compliance:
|
||||
- W3C PROV-O
|
||||
- W3C SRI (content hashes)
|
||||
generated_at: '2025-12-28T19:58:23.405552+00:00'
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.800191+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Česká botanická společnost - Knihovna
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.800191+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.800191+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 2
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.800191+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: wikidata_id
|
||||
claim_value: Q114617585
|
||||
property_uri: owl:sameAs
|
||||
provenance:
|
||||
namespace: wikidata
|
||||
path: /entity/Q114617585
|
||||
timestamp: '2025-11-19T10:14:29.800191+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.98
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABB108
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.800191+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:35:06.433942+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
wikidata_enrichment:
|
||||
wikidata_entity_id: Q114617585
|
||||
api_metadata:
|
||||
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
fetch_timestamp: '2025-12-18T22:27:33.397393+00:00'
|
||||
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
|
||||
enrichment_version: 2.1_generic
|
||||
properties_found:
|
||||
- P31
|
||||
- P17
|
||||
- P159
|
||||
- P968
|
||||
wikidata_labels:
|
||||
cs: Česká botanická společnost, knihovna
|
||||
en: Czech Botanical Society
|
||||
wikidata_label_en: Czech Botanical Society
|
||||
wikidata_label_cs: Česká botanická společnost, knihovna
|
||||
wikidata_descriptions:
|
||||
cs: knihovna v Praze
|
||||
wikidata_aliases:
|
||||
cs:
|
||||
- ČBS
|
||||
wikidata_classification:
|
||||
instance_of: &id005
|
||||
- id: Q385994
|
||||
label: special library
|
||||
description: specialized library providing resources on a particular topic or discipline
|
||||
wikidata_instance_of: *id005
|
||||
wikidata_location:
|
||||
country: &id006
|
||||
id: Q213
|
||||
label: Czech Republic
|
||||
description: country in Central Europe
|
||||
headquarters_location:
|
||||
id: Q2444636
|
||||
label: Prague 2
|
||||
description: administrative district, municipal district and municipal part of Prague
|
||||
wikidata_country: *id006
|
||||
wikidata_contact:
|
||||
email: mailto:botspol@natur.cuni.cz
|
||||
_provenance:
|
||||
content_hash:
|
||||
algorithm: sha256
|
||||
value: sha256-oY4wPu7P3YKsrWpr2H3FmyYjbSLqUh+rxXz32uzFU2c=
|
||||
scope: enrichment_section
|
||||
computed_at: '2025-12-28T19:58:23.405533+00:00'
|
||||
prov:
|
||||
wasGeneratedBy:
|
||||
'@type': prov:Activity
|
||||
name: wikidata_api_fetch
|
||||
used: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617585
|
||||
verification:
|
||||
status: verified
|
||||
last_verified: '2025-12-28T19:58:23.405542+00:00'
|
||||
location:
|
||||
latitude: 50.071464
|
||||
longitude: 14.420864
|
||||
coordinate_provenance:
|
||||
source_type: ORIGINAL_ENTRY
|
||||
source_path: original_entry.locations[0]
|
||||
city: Praha 2
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 128 00
|
||||
street_address: Benátská 2
|
||||
normalization_timestamp: '2025-12-09T06:50:47.995759+00:00'
|
||||
logo_enrichment:
|
||||
enrichment_timestamp: '2025-12-23T21:59:31.706423+00:00'
|
||||
source_url: https://katalog.botanospol.cz
|
||||
extraction_method: crawl4ai
|
||||
claims:
|
||||
- claim_type: favicon_url
|
||||
claim_value: https://katalog.botanospol.cz/themes/root/images/vufind-favicon.ico
|
||||
source_url: https://katalog.botanospol.cz
|
||||
css_selector: '[document] > html > head > link:nth-of-type(7)'
|
||||
retrieved_on: '2025-12-23T21:59:31.706423+00:00'
|
||||
extraction_method: crawl4ai_link_rel
|
||||
favicon_type: image/x-icon
|
||||
favicon_sizes: ''
|
||||
summary:
|
||||
total_claims: 1
|
||||
has_primary_logo: false
|
||||
has_favicon: true
|
||||
has_og_image: false
|
||||
favicon_count: 1
|
||||
161
data/custodian.backup.20251230/CZ-10-PRA-L-CBSPAC.yaml
Normal file
161
data/custodian.backup.20251230/CZ-10-PRA-L-CBSPAC.yaml
Normal file
|
|
@ -0,0 +1,161 @@
|
|||
original_entry:
|
||||
name: Československá botanická společnost při AV ČR
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABE333
|
||||
locations: []
|
||||
processing_timestamp: '2025-12-06T23:37:25.736072+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-CBSPAC
|
||||
ghcid_original: XX-XX-XXX-L-ČBSPAČ
|
||||
ghcid_uuid: 1cd8e994-d075-551d-978c-51d7f78ce6cb
|
||||
ghcid_uuid_sha256: b1e65467-4d97-84c9-89ec-7d46502ff5d6
|
||||
ghcid_numeric: 12819026192016864457
|
||||
record_id: 5e1f9dc6-7fc7-4a56-83cd-9150353177d9
|
||||
generation_timestamp: '2025-12-06T23:37:25.736072+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: NAME_PATTERN_EXTRACTION
|
||||
resolution_timestamp: '2025-12-06T23:56:13.484431+00:00'
|
||||
resolution_notes: City 'AV ČR' extracted from institution name
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-CBSPAC
|
||||
ghcid_numeric: 12819026192016864457
|
||||
valid_from: '2025-12-08T11:21:22.614180+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-10-PRA-L-ČBSPAČ)'
|
||||
- ghcid: XX-XX-XXX-L-ČBSPAČ
|
||||
ghcid_numeric: 10375321127213274876
|
||||
valid_from: '2025-12-06T23:37:25.736072+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:22.614180+00:00'
|
||||
- ghcid: CZ-XX-XXX-L-ČBSPAČ
|
||||
valid_from: '2025-12-06T23:56:13.484437+00:00'
|
||||
reason: 'Country resolved via Wikidata P17: XX→CZ'
|
||||
- ghcid: CZ-10-PRA-L-ČBSPAČ
|
||||
valid_from: '2025-12-07T12:38:22.576530+00:00'
|
||||
reason: 'Location resolved from institution name pattern: ''AV ČR'' → region 10, city PRA'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Československá botanická společnost při AV ČR
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-CBSPAC
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 1cd8e994-d075-551d-978c-51d7f78ce6cb
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: b1e65467-4d97-84c9-89ec-7d46502ff5d6
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '12819026192016864457'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 5e1f9dc6-7fc7-4a56-83cd-9150353177d9
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:30.047635+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- 'Country resolved 2025-12-06T23:56:13Z: XX→CZ via Wikidata P17'
|
||||
- 'YouTube/Google Maps enrichment 2025-12-08T19:49:58Z: Maps: rejected by LLM; YouTube: not found'
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:19Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:48:05Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:49:29Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.047635+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Československá botanická společnost při AV ČR
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.047635+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.047635+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABE333
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.047635+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:25.736072+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
google_maps_status: NO_MATCH
|
||||
google_maps_rejected:
|
||||
candidate_name: Česká botanická společnost
|
||||
rejection_reason: 'NAME MATCH: Names are similar, likely referring to the same entity (''Československá'' vs ''Česká'' are
|
||||
close equivalents). LOCATION MATCH: Location is a strong match (both are in Prague, Czechia). TYPE MATCH: Google Place
|
||||
types (''point_of_interest'', ''establishment'') do not match the expected heritage institution types (museum, library,
|
||||
etc.). ENTITY TYPE: The entity is a learned society (a botanical society), not a heritage custodian institution like a
|
||||
museum, library, gallery, or archive. Therefore, it is not a GRP.HER.'
|
||||
timestamp: '2025-12-08T19:49:57.786616+00:00'
|
||||
youtube_status: NOT_FOUND
|
||||
youtube_search_query: Československá botanická společnost při AV ČR official
|
||||
youtube_search_timestamp: '2025-12-08T19:49:58.157855+00:00'
|
||||
location:
|
||||
region_code: 10
|
||||
country: CZ
|
||||
normalization_timestamp: '2025-12-09T20:40:58.889046+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
city: Prague
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:40:58.889035+00:00'
|
||||
164
data/custodian.backup.20251230/CZ-10-PRA-L-CBUUER.yaml
Normal file
164
data/custodian.backup.20251230/CZ-10-PRA-L-CBUUER.yaml
Normal file
|
|
@ -0,0 +1,164 @@
|
|||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: ČSAV - Botanický ústav - úsek ekologie rostlin
|
||||
source_type: ch_annotator
|
||||
original_entry:
|
||||
name: ČSAV - Botanický ústav - úsek ekologie rostlin
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: BOE031
|
||||
locations: []
|
||||
processing_timestamp: '2025-12-06T23:37:25.837611+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-CBUUER
|
||||
ghcid_original: XX-XX-XXX-L-ČBÚÚER
|
||||
ghcid_uuid: ceb02c28-0dec-5d1d-aee9-843abb460aa7
|
||||
ghcid_uuid_sha256: 27d03f00-17d2-8dd1-be41-bdd369b306d4
|
||||
ghcid_numeric: 2868862232267226577
|
||||
record_id: d0f92280-9bec-4289-a47b-9a90766cd98f
|
||||
generation_timestamp: '2025-12-06T23:37:25.837611+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: NAME_PATTERN_EXTRACTION
|
||||
resolution_timestamp: '2025-12-06T23:56:13.163034+00:00'
|
||||
resolution_notes: City 'ČSAV' extracted from institution name
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-CBUUER
|
||||
ghcid_numeric: 2868862232267226577
|
||||
valid_from: '2025-12-08T11:21:34.560987+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-10-PRA-L-ČBÚÚER)'
|
||||
- ghcid: XX-XX-XXX-L-ČBÚÚER
|
||||
ghcid_numeric: 8403561736984873728
|
||||
valid_from: '2025-12-06T23:37:25.837611+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:34.560987+00:00'
|
||||
- ghcid: CZ-XX-XXX-L-ČBÚÚER
|
||||
valid_from: '2025-12-06T23:56:13.163043+00:00'
|
||||
reason: 'Country resolved via Wikidata P17: XX→CZ'
|
||||
- ghcid: CZ-10-PRA-L-ČBÚÚER
|
||||
valid_from: '2025-12-07T12:38:22.577753+00:00'
|
||||
reason: 'Location resolved from institution name pattern: ''ČSAV'' → region 10,
|
||||
city PRA'
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-CBUUER
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: ceb02c28-0dec-5d1d-aee9-843abb460aa7
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 27d03f00-17d2-8dd1-be41-bdd369b306d4
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '2868862232267226577'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: d0f92280-9bec-4289-a47b-9a90766cd98f
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:30.048230+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- 'Country resolved 2025-12-06T23:56:13Z: XX→CZ via Wikidata P17'
|
||||
- 'YouTube/Google Maps enrichment 2025-12-08T19:50:03Z: Maps: rejected by LLM; YouTube:
|
||||
not found'
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:19Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:48:05Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:49:29Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.048230+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date: null
|
||||
verified_by: null
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: ČSAV - Botanický ústav - úsek ekologie rostlin
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.048230+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.048230+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-BOE031
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.048230+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:25.837611+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
google_maps_status: NO_MATCH
|
||||
google_maps_rejected:
|
||||
candidate_name: Oddělení vegetační ekologie, Botanický ústav AV ČR
|
||||
rejection_reason: 'REJECT: TYPE_MISMATCH. The Google Place candidate, ''Oddělení
|
||||
vegetační ekologie, Botanický ústav AV ČR'', is a department of a scientific research
|
||||
institute. Although the names and location indicate it is the correct institution,
|
||||
its function is scientific research, not heritage curation like a museum, library,
|
||||
or gallery. The Google Place types ''point_of_interest'' and ''establishment''
|
||||
are too generic and do not support classification as a heritage institution.'
|
||||
timestamp: '2025-12-08T19:50:02.915435+00:00'
|
||||
youtube_status: NOT_FOUND
|
||||
youtube_search_query: ČSAV - Botanický ústav - úsek ekologie rostlin official
|
||||
youtube_search_timestamp: '2025-12-08T19:50:03.282523+00:00'
|
||||
location:
|
||||
region_code: 10
|
||||
country: CZ
|
||||
normalization_timestamp: '2025-12-09T20:40:58.926923+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
city: Prague
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:40:58.926901+00:00'
|
||||
163
data/custodian.backup.20251230/CZ-10-PRA-L-CCS.yaml
Normal file
163
data/custodian.backup.20251230/CZ-10-PRA-L-CCS.yaml
Normal file
|
|
@ -0,0 +1,163 @@
|
|||
original_entry:
|
||||
name: Českomoravský cement, a.s.
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABF505
|
||||
locations:
|
||||
- postal_code: 153 02
|
||||
city: Praha 5 - Radotín
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:16.356857+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-CCS
|
||||
ghcid_original: CZ-HL-PRA-L-ČCS
|
||||
ghcid_uuid: 3a9e1eed-4461-5523-a050-78eba32bc1c6
|
||||
ghcid_uuid_sha256: 426cee88-c5b2-8ef0-bcef-963c5ef4d2c5
|
||||
ghcid_numeric: 4786462775187738352
|
||||
record_id: c1903e67-931d-4cc0-bc08-b53aa007c3c4
|
||||
generation_timestamp: '2025-12-06T23:37:16.356857+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-CCS
|
||||
valid_from: "2025-12-10T09:46:58Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-CCS
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:58Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-CCS
|
||||
ghcid_numeric: 4786462775187738352
|
||||
valid_from: '2025-12-08T11:21:21.969957+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ČCS)'
|
||||
- ghcid: CZ-HL-PRA-L-ČCS
|
||||
ghcid_numeric: 5999739163448361722
|
||||
valid_from: '2025-12-06T23:37:16.356857+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:21.969957+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Českomoravský cement, a.s.
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-CCS
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 3a9e1eed-4461-5523-a050-78eba32bc1c6
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 426cee88-c5b2-8ef0-bcef-963c5ef4d2c5
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '4786462775187738352'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: c1903e67-931d-4cc0-bc08-b53aa007c3c4
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.918721+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:48Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.918721+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Českomoravský cement, a.s.
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.918721+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.918721+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 5 - Radotín
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.918721+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABF505
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.918721+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:16.356857+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 153 02
|
||||
normalization_timestamp: '2025-12-09T20:41:05.794726+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:05.794713+00:00'
|
||||
161
data/custodian.backup.20251230/CZ-10-PRA-L-CDOPES.yaml
Normal file
161
data/custodian.backup.20251230/CZ-10-PRA-L-CDOPES.yaml
Normal file
|
|
@ -0,0 +1,161 @@
|
|||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Centrální dispečerská organizace propojených energetických soustav
|
||||
source_type: ch_annotator
|
||||
original_entry:
|
||||
name: Centrální dispečerská organizace propojených energetických soustav
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABE185
|
||||
locations:
|
||||
- street_address: Jungmannova 29
|
||||
postal_code: 111 32
|
||||
city: Praha 1
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:15.370319+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-CDOPES
|
||||
ghcid_original: CZ-HL-PRA-L-CDOPES
|
||||
ghcid_uuid: 93df1df2-b55b-50e1-aa91-2f01d98c37d7
|
||||
ghcid_uuid_sha256: 0c623fff-4d22-8192-1ce2-ea63343a0f2f
|
||||
ghcid_numeric: 892346041916084626
|
||||
record_id: ef77331d-980d-417a-9abe-d32f177a8e14
|
||||
generation_timestamp: '2025-12-06T23:37:15.370319+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-CDOPES
|
||||
valid_from: '2025-12-10T09:46:58Z'
|
||||
valid_to: null
|
||||
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha))
|
||||
per ISO 3166-2:CZ
|
||||
- ghcid: CZ-HL-PRA-L-CDOPES
|
||||
valid_from: null
|
||||
valid_to: '2025-12-10T09:46:58Z'
|
||||
reason: Previous GHCID with incorrect region code
|
||||
- ghcid: CZ-HL-PRA-L-CDOPES
|
||||
ghcid_numeric: 892346041916084626
|
||||
valid_from: '2025-12-06T23:37:15.370319+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-CDOPES
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 93df1df2-b55b-50e1-aa91-2f01d98c37d7
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 0c623fff-4d22-8192-1ce2-ea63343a0f2f
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '892346041916084626'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: ef77331d-980d-417a-9abe-d32f177a8e14
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.908755+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:48Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.908755+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date: null
|
||||
verified_by: null
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Centrální dispečerská organizace propojených energetických soustav
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.908755+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.908755+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 1
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.908755+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABE185
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.908755+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:15.370319+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 111 32
|
||||
street_address: Jungmannova 29
|
||||
normalization_timestamp: '2025-12-09T20:41:05.845268+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:05.845243+00:00'
|
||||
166
data/custodian.backup.20251230/CZ-10-PRA-L-CDS.yaml
Normal file
166
data/custodian.backup.20251230/CZ-10-PRA-L-CDS.yaml
Normal file
|
|
@ -0,0 +1,166 @@
|
|||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: ČKD DUKLA, a.s.
|
||||
source_type: ch_annotator
|
||||
original_entry:
|
||||
name: ČKD DUKLA, a.s.
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABF021
|
||||
locations:
|
||||
- street_address: Českomoravská 21
|
||||
postal_code: 190 05
|
||||
city: Praha 9
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:15.802317+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-CDS
|
||||
ghcid_original: CZ-HL-PRA-L-ČDS
|
||||
ghcid_uuid: 4717b660-a36b-5e93-86a5-bd2a0e2cbaf2
|
||||
ghcid_uuid_sha256: 941ae1c7-9bb9-8f34-904d-a441c6c32476
|
||||
ghcid_numeric: 10672090514435149620
|
||||
record_id: 1d1b204d-f167-46b7-829f-1fbfea2ea1f3
|
||||
generation_timestamp: '2025-12-06T23:37:15.802317+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-CDS
|
||||
valid_from: '2025-12-10T09:46:58Z'
|
||||
valid_to: null
|
||||
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha))
|
||||
per ISO 3166-2:CZ
|
||||
- ghcid: CZ-HL-PRA-L-CDS
|
||||
valid_from: null
|
||||
valid_to: '2025-12-10T09:46:58Z'
|
||||
reason: Previous GHCID with incorrect region code
|
||||
- ghcid: CZ-HL-PRA-L-CDS
|
||||
ghcid_numeric: 10672090514435149620
|
||||
valid_from: '2025-12-08T11:21:26.723638+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ČDS)'
|
||||
- ghcid: CZ-HL-PRA-L-ČDS
|
||||
ghcid_numeric: 8990548551803364279
|
||||
valid_from: '2025-12-06T23:37:15.802317+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:26.723638+00:00'
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-CDS
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 4717b660-a36b-5e93-86a5-bd2a0e2cbaf2
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 941ae1c7-9bb9-8f34-904d-a441c6c32476
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '10672090514435149620'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 1d1b204d-f167-46b7-829f-1fbfea2ea1f3
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.914285+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:48Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.914285+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date: null
|
||||
verified_by: null
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: ČKD DUKLA, a.s.
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.914285+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.914285+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 9
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.914285+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABF021
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.914285+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:15.802317+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 190 05
|
||||
street_address: Českomoravská 21
|
||||
normalization_timestamp: '2025-12-09T20:41:05.913228+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:05.913214+00:00'
|
||||
165
data/custodian.backup.20251230/CZ-10-PRA-L-CDSCLS.yaml
Normal file
165
data/custodian.backup.20251230/CZ-10-PRA-L-CDSCLS.yaml
Normal file
|
|
@ -0,0 +1,165 @@
|
|||
original_entry:
|
||||
name: ČKD Dopravní systémy - ČKD Lokomotivka, a.s.
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABC002
|
||||
locations:
|
||||
- street_address: Českomoravská 205
|
||||
postal_code: 190 00
|
||||
city: Praha 9
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:25.476508+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-CDSCLS
|
||||
ghcid_original: CZ-HL-PRA-L-ČDSČLS
|
||||
ghcid_uuid: 03c112af-44cd-58c6-9d03-6f1c8c533159
|
||||
ghcid_uuid_sha256: f4f8a01e-c0d3-87e4-ab3b-577f2490c2ec
|
||||
ghcid_numeric: 17652034793423189988
|
||||
record_id: 6415bb37-6c79-4475-9db9-1e6d9f3fba0c
|
||||
generation_timestamp: '2025-12-06T23:37:25.476508+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-CDSCLS
|
||||
valid_from: "2025-12-10T09:46:58Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-CDSCLS
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:58Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-CDSCLS
|
||||
ghcid_numeric: 17652034793423189988
|
||||
valid_from: '2025-12-08T11:21:39.771994+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ČDSČLS)'
|
||||
- ghcid: CZ-HL-PRA-L-ČDSČLS
|
||||
ghcid_numeric: 14907291261378136606
|
||||
valid_from: '2025-12-06T23:37:25.476508+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:39.771994+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: ČKD Dopravní systémy - ČKD Lokomotivka, a.s.
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-CDSCLS
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 03c112af-44cd-58c6-9d03-6f1c8c533159
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: f4f8a01e-c0d3-87e4-ab3b-577f2490c2ec
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '17652034793423189988'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 6415bb37-6c79-4475-9db9-1e6d9f3fba0c
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:30.045814+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:48Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.045814+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: ČKD Dopravní systémy - ČKD Lokomotivka, a.s.
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.045814+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.045814+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 9
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.045814+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABC002
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.045814+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:25.476508+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 190 00
|
||||
street_address: Českomoravská 205
|
||||
normalization_timestamp: '2025-12-09T20:41:05.964036+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:05.964008+00:00'
|
||||
166
data/custodian.backup.20251230/CZ-10-PRA-L-CDSCTS.yaml
Normal file
166
data/custodian.backup.20251230/CZ-10-PRA-L-CDSCTS.yaml
Normal file
|
|
@ -0,0 +1,166 @@
|
|||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: ČKD Dopravní systémy - ČKD Trakce, a.s.
|
||||
source_type: ch_annotator
|
||||
original_entry:
|
||||
name: ČKD Dopravní systémy - ČKD Trakce, a.s.
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABF026
|
||||
locations:
|
||||
- street_address: Kolbenova 40
|
||||
postal_code: 190 00
|
||||
city: Praha 9
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:25.762860+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-CDSCTS
|
||||
ghcid_original: CZ-HL-PRA-L-ČDSČTS
|
||||
ghcid_uuid: 939b075a-63fe-5905-951a-74d436d37d64
|
||||
ghcid_uuid_sha256: 6f504902-f4f7-83b0-8cd9-7ac2b03daa22
|
||||
ghcid_numeric: 8020991213395477424
|
||||
record_id: e25f2850-1721-40b3-ab48-c0631658ca67
|
||||
generation_timestamp: '2025-12-06T23:37:25.762860+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-CDSCTS
|
||||
valid_from: '2025-12-10T09:46:58Z'
|
||||
valid_to: null
|
||||
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha))
|
||||
per ISO 3166-2:CZ
|
||||
- ghcid: CZ-HL-PRA-L-CDSCTS
|
||||
valid_from: null
|
||||
valid_to: '2025-12-10T09:46:58Z'
|
||||
reason: Previous GHCID with incorrect region code
|
||||
- ghcid: CZ-HL-PRA-L-CDSCTS
|
||||
ghcid_numeric: 8020991213395477424
|
||||
valid_from: '2025-12-08T11:21:33.916666+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ČDSČTS)'
|
||||
- ghcid: CZ-HL-PRA-L-ČDSČTS
|
||||
ghcid_numeric: 10608205474449462001
|
||||
valid_from: '2025-12-06T23:37:25.762860+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:33.916666+00:00'
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-CDSCTS
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 939b075a-63fe-5905-951a-74d436d37d64
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 6f504902-f4f7-83b0-8cd9-7ac2b03daa22
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '8020991213395477424'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: e25f2850-1721-40b3-ab48-c0631658ca67
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:30.047807+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:48Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.047807+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date: null
|
||||
verified_by: null
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: ČKD Dopravní systémy - ČKD Trakce, a.s.
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.047807+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.047807+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 9
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.047807+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABF026
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.047807+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:25.762860+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 190 00
|
||||
street_address: Kolbenova 40
|
||||
normalization_timestamp: '2025-12-09T20:41:06.020234+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:06.020205+00:00'
|
||||
165
data/custodian.backup.20251230/CZ-10-PRA-L-CDSS.yaml
Normal file
165
data/custodian.backup.20251230/CZ-10-PRA-L-CDSS.yaml
Normal file
|
|
@ -0,0 +1,165 @@
|
|||
original_entry:
|
||||
name: ČKD Dopravní systémy, a.s.
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABF025
|
||||
locations:
|
||||
- street_address: Chabská 6
|
||||
postal_code: 155 41
|
||||
city: Praha 5 - Zličín
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:15.811932+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-CDSS
|
||||
ghcid_original: CZ-HL-PRA-L-ČDSS
|
||||
ghcid_uuid: c8a463f0-4628-5712-a4d2-03fbb449c631
|
||||
ghcid_uuid_sha256: 34e3d3cc-a477-8d52-892a-988fadd5ba4b
|
||||
ghcid_numeric: 3811122585571683666
|
||||
record_id: 547d3bb1-b8b1-4574-821f-0bcb62163961
|
||||
generation_timestamp: '2025-12-06T23:37:15.811932+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-CDSS
|
||||
valid_from: "2025-12-10T09:46:58Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-CDSS
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:58Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-CDSS
|
||||
ghcid_numeric: 3811122585571683666
|
||||
valid_from: '2025-12-08T11:21:21.353542+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ČDSS)'
|
||||
- ghcid: CZ-HL-PRA-L-ČDSS
|
||||
ghcid_numeric: 13591157510468734875
|
||||
valid_from: '2025-12-06T23:37:15.811932+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:21.353542+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: ČKD Dopravní systémy, a.s.
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-CDSS
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: c8a463f0-4628-5712-a4d2-03fbb449c631
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 34e3d3cc-a477-8d52-892a-988fadd5ba4b
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '3811122585571683666'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 547d3bb1-b8b1-4574-821f-0bcb62163961
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.914362+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:48Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.914362+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: ČKD Dopravní systémy, a.s.
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.914362+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.914362+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 5 - Zličín
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.914362+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABF025
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.914362+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:15.811932+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 155 41
|
||||
street_address: Chabská 6
|
||||
normalization_timestamp: '2025-12-09T20:41:06.063750+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:06.063728+00:00'
|
||||
165
data/custodian.backup.20251230/CZ-10-PRA-L-CDSVZPOKV.yaml
Normal file
165
data/custodian.backup.20251230/CZ-10-PRA-L-CDSVZPOKV.yaml
Normal file
|
|
@ -0,0 +1,165 @@
|
|||
original_entry:
|
||||
name: České dráhy - Správa výstavby železnic Praha - odbor koncepce a výhledu
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABE103
|
||||
locations:
|
||||
- street_address: Italská 45
|
||||
postal_code: 121 31
|
||||
city: Praha 2
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:15.256615+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-CDSVZPOKV
|
||||
ghcid_original: CZ-HL-PRA-L-ČDSVŽPOKV
|
||||
ghcid_uuid: d184e724-ad62-5b75-a412-96575f9db4d6
|
||||
ghcid_uuid_sha256: df9f1b2b-ea42-8e8c-bfae-51de5d291665
|
||||
ghcid_numeric: 16113627867182718604
|
||||
record_id: 002b54b4-5382-4745-bc71-fc9577a3d54f
|
||||
generation_timestamp: '2025-12-06T23:37:15.256615+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-CDSVZPOKV
|
||||
valid_from: "2025-12-10T09:46:58Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-CDSVZPOKV
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:58Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-CDSVZPOKV
|
||||
ghcid_numeric: 16113627867182718604
|
||||
valid_from: '2025-12-08T11:21:26.472989+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ČDSVŽPOKV)'
|
||||
- ghcid: CZ-HL-PRA-L-ČDSVŽPOKV
|
||||
ghcid_numeric: 12616989030595101585
|
||||
valid_from: '2025-12-06T23:37:15.256615+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:26.472989+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: České dráhy - Správa výstavby železnic Praha - odbor koncepce a výhledu
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-CDSVZPOKV
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: d184e724-ad62-5b75-a412-96575f9db4d6
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: df9f1b2b-ea42-8e8c-bfae-51de5d291665
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '16113627867182718604'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 002b54b4-5382-4745-bc71-fc9577a3d54f
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.907676+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:48Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.907676+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: České dráhy - Správa výstavby železnic Praha - odbor koncepce a výhledu
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.907676+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.907676+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 2
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.907676+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABE103
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.907676+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:15.256615+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 121 31
|
||||
street_address: Italská 45
|
||||
normalization_timestamp: '2025-12-09T20:41:06.110303+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:06.110275+00:00'
|
||||
258
data/custodian.backup.20251230/CZ-10-PRA-L-CDZUKCD.yaml
Normal file
258
data/custodian.backup.20251230/CZ-10-PRA-L-CDZUKCD.yaml
Normal file
|
|
@ -0,0 +1,258 @@
|
|||
original_entry:
|
||||
name: Cesta domů, z.ú. - Knihovna Cesty domů
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABE288
|
||||
- &id002
|
||||
identifier_scheme: IČO
|
||||
identifier_value: '26528843'
|
||||
- &id003
|
||||
identifier_scheme: DIČ
|
||||
identifier_value: CZ26528843
|
||||
- &id004
|
||||
identifier_scheme: Website
|
||||
identifier_value: https://raks.mlp.cz/library/cestydomu?device=1308
|
||||
identifier_url: https://raks.mlp.cz/library/cestydomu?device=1308
|
||||
- &id005
|
||||
identifier_scheme: Wikidata
|
||||
identifier_value: Q114619385
|
||||
identifier_url: https://www.wikidata.org/wiki/Q114619385
|
||||
locations:
|
||||
- street_address: Heleny Kočvarové 1
|
||||
postal_code: 140 00
|
||||
city: Praha 4
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
latitude: 50.053757
|
||||
longitude: 14.454272
|
||||
processing_timestamp: '2025-12-06T23:37:26.110340+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-CDZUKCD
|
||||
ghcid_original: CZ-HL-PRA-L-CDZÚKCD
|
||||
ghcid_uuid: 7d9842ae-57b4-5afc-ac24-4e92b583320e
|
||||
ghcid_uuid_sha256: b0479407-ba06-8eff-8765-3748b7af304a
|
||||
ghcid_numeric: 12702284034928430847
|
||||
record_id: 569a8cc3-8623-4865-b078-f5f358b8dac8
|
||||
generation_timestamp: '2025-12-06T23:37:26.110340+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-CDZUKCD
|
||||
valid_from: '2025-12-10T09:46:58Z'
|
||||
valid_to:
|
||||
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
|
||||
- ghcid: CZ-HL-PRA-L-CDZUKCD
|
||||
valid_from:
|
||||
valid_to: '2025-12-10T09:46:58Z'
|
||||
reason: Previous GHCID with incorrect region code
|
||||
- ghcid: CZ-HL-PRA-L-CDZUKCD
|
||||
ghcid_numeric: 12702284034928430847
|
||||
valid_from: '2025-12-08T11:21:37.155204+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-CDZÚKCD)'
|
||||
- ghcid: CZ-HL-PRA-L-CDZÚKCD
|
||||
ghcid_numeric: 1611998794903873172
|
||||
valid_from: '2025-12-06T23:37:26.110340+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:37.155204+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Cesta domů, z.ú. - Knihovna Cesty domů
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-CDZUKCD
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 7d9842ae-57b4-5afc-ac24-4e92b583320e
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: b0479407-ba06-8eff-8765-3748b7af304a
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '12702284034928430847'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 569a8cc3-8623-4865-b078-f5f358b8dac8
|
||||
- *id001
|
||||
- *id002
|
||||
- *id003
|
||||
- *id004
|
||||
- *id005
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:30.050046+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:48Z
|
||||
schema_version: 2.0.0
|
||||
enrichment_provenance:
|
||||
wikidata_enrichment:
|
||||
content_hash: sha256-uyxljZX/ehXA7EicsBnW5SpiqATAabtCWAOUuIFLxJo=
|
||||
verified_at: '2025-12-28T19:58:23.516984+00:00'
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114619385
|
||||
provenance_schema_version: '2.0'
|
||||
standards_compliance:
|
||||
- W3C PROV-O
|
||||
- W3C SRI (content hashes)
|
||||
generated_at: '2025-12-28T19:58:23.517020+00:00'
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.050046+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Cesta domů, z.ú. - Knihovna Cesty domů
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.050046+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.050046+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 4
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.050046+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: wikidata_id
|
||||
claim_value: Q114619385
|
||||
property_uri: owl:sameAs
|
||||
provenance:
|
||||
namespace: wikidata
|
||||
path: /entity/Q114619385
|
||||
timestamp: '2025-11-19T10:14:30.050046+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.98
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABE288
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.050046+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:26.110340+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
wikidata_enrichment:
|
||||
wikidata_entity_id: Q114619385
|
||||
api_metadata:
|
||||
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
fetch_timestamp: '2025-12-18T22:27:41.458852+00:00'
|
||||
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
|
||||
enrichment_version: 2.1_generic
|
||||
properties_found:
|
||||
- P31
|
||||
- P17
|
||||
- P159
|
||||
- P968
|
||||
- P463
|
||||
wikidata_labels:
|
||||
cs: Cesta domů, z.ú., Knihovna Cesty domů
|
||||
en: Home Care Hospice Cesta domů and Advisory Centre, Library
|
||||
wikidata_label_en: Home Care Hospice Cesta domů and Advisory Centre, Library
|
||||
wikidata_label_cs: Cesta domů, z.ú., Knihovna Cesty domů
|
||||
wikidata_descriptions:
|
||||
cs: knihovna v Praze
|
||||
wikidata_classification:
|
||||
instance_of: &id006
|
||||
- id: Q6150991
|
||||
label: medical library
|
||||
description: library that focuses on medical or health information
|
||||
wikidata_instance_of: *id006
|
||||
wikidata_location:
|
||||
country: &id007
|
||||
id: Q213
|
||||
label: Czech Republic
|
||||
description: country in Central Europe
|
||||
headquarters_location:
|
||||
id: Q2686587
|
||||
label: Prague 4
|
||||
description: municipal part of Prague
|
||||
wikidata_country: *id007
|
||||
wikidata_organization:
|
||||
member_of:
|
||||
- id: Q4809890
|
||||
label: Association of Library and Information Professionals of the Czech Republic
|
||||
description: organization
|
||||
wikidata_contact:
|
||||
email: mailto:knihovna@cestadomu.cz
|
||||
_provenance:
|
||||
content_hash:
|
||||
algorithm: sha256
|
||||
value: sha256-uyxljZX/ehXA7EicsBnW5SpiqATAabtCWAOUuIFLxJo=
|
||||
scope: enrichment_section
|
||||
computed_at: '2025-12-28T19:58:23.516984+00:00'
|
||||
prov:
|
||||
wasGeneratedBy:
|
||||
'@type': prov:Activity
|
||||
name: wikidata_api_fetch
|
||||
used: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114619385
|
||||
verification:
|
||||
status: verified
|
||||
last_verified: '2025-12-28T19:58:23.517010+00:00'
|
||||
location:
|
||||
latitude: 50.053757
|
||||
longitude: 14.454272
|
||||
coordinate_provenance:
|
||||
source_type: ORIGINAL_ENTRY
|
||||
source_path: original_entry.locations[0]
|
||||
city: Praha 4
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 140 00
|
||||
street_address: Heleny Kočvarové 1
|
||||
normalization_timestamp: '2025-12-09T06:50:48.166008+00:00'
|
||||
266
data/custodian.backup.20251230/CZ-10-PRA-L-CEKJK.yaml
Normal file
266
data/custodian.backup.20251230/CZ-10-PRA-L-CEKJK.yaml
Normal file
|
|
@ -0,0 +1,266 @@
|
|||
original_entry:
|
||||
name: CERGE-EI knihovna Jana Kmenty
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABB005
|
||||
- &id002
|
||||
identifier_scheme: IČO
|
||||
identifier_value: 00216208
|
||||
- &id003
|
||||
identifier_scheme: DIČ
|
||||
identifier_value: CZ00216208
|
||||
- &id004
|
||||
identifier_scheme: Website
|
||||
identifier_value: https://www.cerge-ei.cz/cs/knihovna
|
||||
identifier_url: https://www.cerge-ei.cz/cs/knihovna
|
||||
- &id005
|
||||
identifier_scheme: Wikidata
|
||||
identifier_value: Q114617537
|
||||
identifier_url: https://www.wikidata.org/wiki/Q114617537
|
||||
locations:
|
||||
- street_address: Politických vězňů 7
|
||||
postal_code: 111 21
|
||||
city: Praha 1
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
latitude: 50.083233
|
||||
longitude: 14.429039
|
||||
processing_timestamp: '2025-12-06T23:35:06.269914+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-CEKJK
|
||||
ghcid_original: CZ-HL-PRA-L-CEKJK
|
||||
ghcid_uuid: 7c1d5854-eedc-5e41-830d-5c1a3f22bee1
|
||||
ghcid_uuid_sha256: b3925072-f548-81b2-f37c-a560d63bf363
|
||||
ghcid_numeric: 12939493134060507570
|
||||
record_id: 2d687a48-630e-4b43-8179-1027f3fe1a8b
|
||||
generation_timestamp: '2025-12-06T23:35:06.269914+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-CEKJK
|
||||
valid_from: '2025-12-10T09:46:58Z'
|
||||
valid_to:
|
||||
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
|
||||
- ghcid: CZ-HL-PRA-L-CEKJK
|
||||
valid_from:
|
||||
valid_to: '2025-12-10T09:46:58Z'
|
||||
reason: Previous GHCID with incorrect region code
|
||||
- ghcid: CZ-HL-PRA-L-CEKJK
|
||||
ghcid_numeric: 12939493134060507570
|
||||
valid_from: '2025-12-06T23:35:06.269914+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: CERGE-EI knihovna Jana Kmenty
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-CEKJK
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 7c1d5854-eedc-5e41-830d-5c1a3f22bee1
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: b3925072-f548-81b2-f37c-a560d63bf363
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '12939493134060507570'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 2d687a48-630e-4b43-8179-1027f3fe1a8b
|
||||
- *id001
|
||||
- *id002
|
||||
- *id003
|
||||
- *id004
|
||||
- *id005
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.796972+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:13Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:48Z
|
||||
schema_version: 2.0.0
|
||||
enrichment_provenance:
|
||||
wikidata_enrichment:
|
||||
content_hash: sha256-8mQ/wjiQEl+5oUWSFyMM+Cr/yQYAW/e5lI7abSoyAXU=
|
||||
verified_at: '2025-12-28T19:58:23.537241+00:00'
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617537
|
||||
provenance_schema_version: '2.0'
|
||||
standards_compliance:
|
||||
- W3C PROV-O
|
||||
- W3C SRI (content hashes)
|
||||
generated_at: '2025-12-28T19:58:23.537260+00:00'
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.796972+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: CERGE-EI knihovna Jana Kmenty
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.796972+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.796972+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 1
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.796972+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: wikidata_id
|
||||
claim_value: Q114617537
|
||||
property_uri: owl:sameAs
|
||||
provenance:
|
||||
namespace: wikidata
|
||||
path: /entity/Q114617537
|
||||
timestamp: '2025-11-19T10:14:29.796972+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.98
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABB005
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.796972+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:35:06.269914+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
wikidata_enrichment:
|
||||
wikidata_entity_id: Q114617537
|
||||
api_metadata:
|
||||
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
fetch_timestamp: '2025-12-18T22:27:49.559323+00:00'
|
||||
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
|
||||
enrichment_version: 2.1_generic
|
||||
properties_found:
|
||||
- P31
|
||||
- P17
|
||||
- P159
|
||||
- P968
|
||||
wikidata_labels:
|
||||
cs: CERGE-EI knihovna Jana Kmenty
|
||||
en: Jan Kmenta CERGE-EI Library
|
||||
wikidata_label_en: Jan Kmenta CERGE-EI Library
|
||||
wikidata_label_cs: CERGE-EI knihovna Jana Kmenty
|
||||
wikidata_descriptions:
|
||||
cs: knihovna v Praze
|
||||
wikidata_classification:
|
||||
instance_of: &id006
|
||||
- id: Q1438040
|
||||
label: research library
|
||||
description: library that collects materials on one or more subjects to support scholarly or scientific research
|
||||
wikidata_instance_of: *id006
|
||||
wikidata_location:
|
||||
country: &id007
|
||||
id: Q213
|
||||
label: Czech Republic
|
||||
description: country in Central Europe
|
||||
headquarters_location:
|
||||
id: Q973974
|
||||
label: Prague 1
|
||||
description: administrative district, municipal district and municipal part of Prague
|
||||
wikidata_country: *id007
|
||||
wikidata_contact:
|
||||
email: mailto:library@cerge-ei.cz
|
||||
_provenance:
|
||||
content_hash:
|
||||
algorithm: sha256
|
||||
value: sha256-8mQ/wjiQEl+5oUWSFyMM+Cr/yQYAW/e5lI7abSoyAXU=
|
||||
scope: enrichment_section
|
||||
computed_at: '2025-12-28T19:58:23.537241+00:00'
|
||||
prov:
|
||||
wasGeneratedBy:
|
||||
'@type': prov:Activity
|
||||
name: wikidata_api_fetch
|
||||
used: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617537
|
||||
verification:
|
||||
status: verified
|
||||
last_verified: '2025-12-28T19:58:23.537250+00:00'
|
||||
location:
|
||||
latitude: 50.083233
|
||||
longitude: 14.429039
|
||||
coordinate_provenance:
|
||||
source_type: ORIGINAL_ENTRY
|
||||
source_path: original_entry.locations[0]
|
||||
city: Praha 1
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 111 21
|
||||
street_address: Politických vězňů 7
|
||||
normalization_timestamp: '2025-12-09T06:50:48.191069+00:00'
|
||||
logo_enrichment:
|
||||
enrichment_timestamp: '2025-12-23T22:00:01.805561+00:00'
|
||||
source_url: https://www.cerge-ei.cz/cs/knihovna
|
||||
extraction_method: crawl4ai
|
||||
claims:
|
||||
- claim_type: favicon_url
|
||||
claim_value: https://www.cerge-ei.cz/favicon.ico
|
||||
source_url: https://www.cerge-ei.cz/cs/knihovna
|
||||
css_selector: '[document] > html > head > link:nth-of-type(3)'
|
||||
retrieved_on: '2025-12-23T22:00:01.805561+00:00'
|
||||
extraction_method: crawl4ai_link_rel
|
||||
favicon_type: image/vnd.microsoft.icon
|
||||
favicon_sizes: ''
|
||||
summary:
|
||||
total_claims: 1
|
||||
has_primary_logo: false
|
||||
has_favicon: true
|
||||
has_og_image: false
|
||||
favicon_count: 1
|
||||
165
data/custodian.backup.20251230/CZ-10-PRA-L-CES.yaml
Normal file
165
data/custodian.backup.20251230/CZ-10-PRA-L-CES.yaml
Normal file
|
|
@ -0,0 +1,165 @@
|
|||
original_entry:
|
||||
name: ČKD Elektrotechnika, a.s.
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABF023
|
||||
locations:
|
||||
- street_address: U Kolbenky 159
|
||||
postal_code: 190 00
|
||||
city: Praha 9
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:25.760334+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-CES
|
||||
ghcid_original: CZ-HL-PRA-L-ČES
|
||||
ghcid_uuid: 3008a3a2-1521-5246-a2cd-e7329405be87
|
||||
ghcid_uuid_sha256: faa39735-7625-842a-8573-45cc88b567d4
|
||||
ghcid_numeric: 18060445186557060138
|
||||
record_id: 94c290d6-fca3-4ebd-b390-c83112c6721d
|
||||
generation_timestamp: '2025-12-06T23:37:25.760334+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-CES
|
||||
valid_from: "2025-12-10T09:46:58Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-CES
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:58Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-CES
|
||||
ghcid_numeric: 18060445186557060138
|
||||
valid_from: '2025-12-08T11:21:23.607373+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ČES)'
|
||||
- ghcid: CZ-HL-PRA-L-ČES
|
||||
ghcid_numeric: 1379789753405388328
|
||||
valid_from: '2025-12-06T23:37:25.760334+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:23.607373+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: ČKD Elektrotechnika, a.s.
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-CES
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 3008a3a2-1521-5246-a2cd-e7329405be87
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: faa39735-7625-842a-8573-45cc88b567d4
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '18060445186557060138'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 94c290d6-fca3-4ebd-b390-c83112c6721d
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:30.047788+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:13Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:48Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.047788+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: ČKD Elektrotechnika, a.s.
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.047788+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.047788+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 9
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.047788+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABF023
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.047788+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:25.760334+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 190 00
|
||||
street_address: U Kolbenky 159
|
||||
normalization_timestamp: '2025-12-09T20:41:06.157711+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:06.157684+00:00'
|
||||
166
data/custodian.backup.20251230/CZ-10-PRA-L-CEU.yaml
Normal file
166
data/custodian.backup.20251230/CZ-10-PRA-L-CEU.yaml
Normal file
|
|
@ -0,0 +1,166 @@
|
|||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: ČSAV - Entomologický ústav
|
||||
source_type: ch_annotator
|
||||
original_entry:
|
||||
name: ČSAV - Entomologický ústav
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABB008
|
||||
locations:
|
||||
- street_address: Mánesova 55
|
||||
postal_code: 120 00
|
||||
city: Praha 2
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:25.516148+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-CEU
|
||||
ghcid_original: CZ-HL-PRA-L-ČEÚ
|
||||
ghcid_uuid: 504431ec-ac7b-5800-a430-bd4bd2d6cf07
|
||||
ghcid_uuid_sha256: 0354bb33-d832-8ba5-a76e-4e3210db321c
|
||||
ghcid_numeric: 240022511502416805
|
||||
record_id: db702a6d-f686-4857-b3df-0462837fd430
|
||||
generation_timestamp: '2025-12-06T23:37:25.516148+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-CEU
|
||||
valid_from: '2025-12-10T09:46:58Z'
|
||||
valid_to: null
|
||||
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha))
|
||||
per ISO 3166-2:CZ
|
||||
- ghcid: CZ-HL-PRA-L-CEU
|
||||
valid_from: null
|
||||
valid_to: '2025-12-10T09:46:58Z'
|
||||
reason: Previous GHCID with incorrect region code
|
||||
- ghcid: CZ-HL-PRA-L-CEU
|
||||
ghcid_numeric: 240022511502416805
|
||||
valid_from: '2025-12-08T11:21:23.566900+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ČEÚ)'
|
||||
- ghcid: CZ-HL-PRA-L-ČEÚ
|
||||
ghcid_numeric: 10740319947005259027
|
||||
valid_from: '2025-12-06T23:37:25.516148+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:23.566900+00:00'
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-CEU
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 504431ec-ac7b-5800-a430-bd4bd2d6cf07
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 0354bb33-d832-8ba5-a76e-4e3210db321c
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '240022511502416805'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: db702a6d-f686-4857-b3df-0462837fd430
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:30.046042+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:13Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:48Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046042+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date: null
|
||||
verified_by: null
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: ČSAV - Entomologický ústav
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046042+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046042+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 2
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046042+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABB008
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046042+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:25.516148+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 120 00
|
||||
street_address: Mánesova 55
|
||||
normalization_timestamp: '2025-12-09T20:41:06.197560+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:06.197537+00:00'
|
||||
163
data/custodian.backup.20251230/CZ-10-PRA-L-CEUEML.yaml
Normal file
163
data/custodian.backup.20251230/CZ-10-PRA-L-CEUEML.yaml
Normal file
|
|
@ -0,0 +1,163 @@
|
|||
original_entry:
|
||||
name: ČSAV - Ekonomický ústav - Ekonomicko-matematická laboratoř
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABB006
|
||||
locations:
|
||||
- postal_code: 110 00
|
||||
city: Praha 1
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:25.500404+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-CEUEML
|
||||
ghcid_original: CZ-HL-PRA-L-ČEÚEML
|
||||
ghcid_uuid: b3ecd9fb-08cb-5b6d-9620-01ecd8ec398e
|
||||
ghcid_uuid_sha256: 8e609de9-9345-8dd4-a723-695e6ce433dd
|
||||
ghcid_numeric: 10259373577673739732
|
||||
record_id: 11c7b490-8879-47e4-bdc8-4c3f778228c1
|
||||
generation_timestamp: '2025-12-06T23:37:25.500404+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-CEUEML
|
||||
valid_from: "2025-12-10T09:46:58Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-CEUEML
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:58Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-CEUEML
|
||||
ghcid_numeric: 10259373577673739732
|
||||
valid_from: '2025-12-08T11:21:41.451434+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ČEÚEML)'
|
||||
- ghcid: CZ-HL-PRA-L-ČEÚEML
|
||||
ghcid_numeric: 1512922320452027432
|
||||
valid_from: '2025-12-06T23:37:25.500404+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:41.451434+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: ČSAV - Ekonomický ústav - Ekonomicko-matematická laboratoř
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-CEUEML
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: b3ecd9fb-08cb-5b6d-9620-01ecd8ec398e
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 8e609de9-9345-8dd4-a723-695e6ce433dd
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '10259373577673739732'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 11c7b490-8879-47e4-bdc8-4c3f778228c1
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:30.046001+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:13Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:48Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046001+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: ČSAV - Ekonomický ústav - Ekonomicko-matematická laboratoř
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046001+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046001+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 1
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046001+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABB006
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046001+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:25.500404+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 110 00
|
||||
normalization_timestamp: '2025-12-09T20:41:06.242627+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:06.242612+00:00'
|
||||
165
data/custodian.backup.20251230/CZ-10-PRA-L-CFU.yaml
Normal file
165
data/custodian.backup.20251230/CZ-10-PRA-L-CFU.yaml
Normal file
|
|
@ -0,0 +1,165 @@
|
|||
original_entry:
|
||||
name: Československý filmový ústav
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABC117
|
||||
locations:
|
||||
- street_address: Národní 40
|
||||
postal_code: 110 00
|
||||
city: Praha 1
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:25.611839+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-CFU
|
||||
ghcid_original: CZ-HL-PRA-L-ČFÚ
|
||||
ghcid_uuid: ecf730c1-9b99-5072-9d7d-db744bc75a6d
|
||||
ghcid_uuid_sha256: 0fbc3c26-a026-88c1-beae-a8f2a427a7d7
|
||||
ghcid_numeric: 1133847342783834305
|
||||
record_id: d88ba827-90e8-4d79-8758-6f99098470a1
|
||||
generation_timestamp: '2025-12-06T23:37:25.611839+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-CFU
|
||||
valid_from: "2025-12-10T09:46:58Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-CFU
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:58Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-CFU
|
||||
ghcid_numeric: 1133847342783834305
|
||||
valid_from: '2025-12-08T11:21:31.115869+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ČFÚ)'
|
||||
- ghcid: CZ-HL-PRA-L-ČFÚ
|
||||
ghcid_numeric: 14053249839522649669
|
||||
valid_from: '2025-12-06T23:37:25.611839+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:31.115869+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Československý filmový ústav
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-CFU
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: ecf730c1-9b99-5072-9d7d-db744bc75a6d
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 0fbc3c26-a026-88c1-beae-a8f2a427a7d7
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '1133847342783834305'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: d88ba827-90e8-4d79-8758-6f99098470a1
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:30.046736+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:13Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:48Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046736+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Československý filmový ústav
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046736+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046736+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 1
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046736+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABC117
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046736+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:25.611839+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 110 00
|
||||
street_address: Národní 40
|
||||
normalization_timestamp: '2025-12-09T20:41:06.286052+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:06.286029+00:00'
|
||||
276
data/custodian.backup.20251230/CZ-10-PRA-L-CFUPVVSV.yaml
Normal file
276
data/custodian.backup.20251230/CZ-10-PRA-L-CFUPVVSV.yaml
Normal file
|
|
@ -0,0 +1,276 @@
|
|||
original_entry:
|
||||
name: CEFRES - Francouzský ústav pro výzkum ve společenských vědách
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABC163
|
||||
- &id002
|
||||
identifier_scheme: IČO
|
||||
identifier_value: neuvedeno
|
||||
- &id003
|
||||
identifier_scheme: Website
|
||||
identifier_value: https://cefres.kpsys.cz/#!/
|
||||
identifier_url: https://cefres.kpsys.cz/#!/
|
||||
- &id004
|
||||
identifier_scheme: Wikidata
|
||||
identifier_value: Q114617627
|
||||
identifier_url: https://www.wikidata.org/wiki/Q114617627
|
||||
locations:
|
||||
- street_address: Na Florenci 3
|
||||
postal_code: 110 00
|
||||
city: Praha 1
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
latitude: 50.088931
|
||||
longitude: 14.433407
|
||||
processing_timestamp: '2025-12-06T23:37:14.629294+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-CFUPVVSV
|
||||
ghcid_original: CZ-HL-PRA-L-CFÚPVVSV
|
||||
ghcid_uuid: d4761e5f-b824-5577-9836-cccfeab220cc
|
||||
ghcid_uuid_sha256: 215b886d-e14d-839c-88e9-1c5dc4a47909
|
||||
ghcid_numeric: 2403664831645053852
|
||||
record_id: 75be2349-64a5-400a-b210-15b115273345
|
||||
generation_timestamp: '2025-12-06T23:37:14.629294+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-CFUPVVSV
|
||||
valid_from: '2025-12-10T09:46:58Z'
|
||||
valid_to:
|
||||
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
|
||||
- ghcid: CZ-HL-PRA-L-CFUPVVSV
|
||||
valid_from:
|
||||
valid_to: '2025-12-10T09:46:58Z'
|
||||
reason: Previous GHCID with incorrect region code
|
||||
- ghcid: CZ-HL-PRA-L-CFUPVVSV
|
||||
ghcid_numeric: 2403664831645053852
|
||||
valid_from: '2025-12-08T11:21:29.529071+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-CFÚPVVSV)'
|
||||
- ghcid: CZ-HL-PRA-L-CFÚPVVSV
|
||||
ghcid_numeric: 1416280962864530885
|
||||
valid_from: '2025-12-06T23:37:14.629294+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:29.529071+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: CEFRES - Francouzský ústav pro výzkum ve společenských vědách
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-CFUPVVSV
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: d4761e5f-b824-5577-9836-cccfeab220cc
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 215b886d-e14d-839c-88e9-1c5dc4a47909
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '2403664831645053852'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 75be2349-64a5-400a-b210-15b115273345
|
||||
- *id001
|
||||
- *id002
|
||||
- *id003
|
||||
- *id004
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.803999+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:13Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:48Z
|
||||
schema_version: 2.0.0
|
||||
enrichment_provenance:
|
||||
wikidata_enrichment:
|
||||
content_hash: sha256-6nroyhIMUTL56XzH1D+9ek2r604EZrx9EOMKi2yRxSo=
|
||||
verified_at: '2025-12-28T19:58:23.597078+00:00'
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617627
|
||||
provenance_schema_version: '2.0'
|
||||
standards_compliance:
|
||||
- W3C PROV-O
|
||||
- W3C SRI (content hashes)
|
||||
generated_at: '2025-12-28T19:58:23.597094+00:00'
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.803999+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: CEFRES - Francouzský ústav pro výzkum ve společenských vědách
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.803999+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.803999+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 1
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.803999+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: wikidata_id
|
||||
claim_value: Q114617627
|
||||
property_uri: owl:sameAs
|
||||
provenance:
|
||||
namespace: wikidata
|
||||
path: /entity/Q114617627
|
||||
timestamp: '2025-11-19T10:14:29.803999+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.98
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABC163
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.803999+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:14.629294+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
wikidata_enrichment:
|
||||
wikidata_entity_id: Q114617627
|
||||
api_metadata:
|
||||
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
fetch_timestamp: '2025-12-18T22:27:57.590410+00:00'
|
||||
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
|
||||
enrichment_version: 2.1_generic
|
||||
properties_found:
|
||||
- P31
|
||||
- P17
|
||||
- P159
|
||||
- P968
|
||||
wikidata_labels:
|
||||
cs: CEFRES, Francouzský ústav pro výzkum ve společenských vědách, knihovna
|
||||
fr: bibliothèque du CEFRES
|
||||
mul: Bibliothèque du CEFRES
|
||||
wikidata_label_fr: bibliothèque du CEFRES
|
||||
wikidata_label_cs: CEFRES, Francouzský ústav pro výzkum ve společenských vědách, knihovna
|
||||
wikidata_descriptions:
|
||||
cs: knihovna v Praze
|
||||
fr: bibliothèque scientifique à Prague
|
||||
en: library in Prague
|
||||
wikidata_description_en: library in Prague
|
||||
wikidata_aliases:
|
||||
fr:
|
||||
- bibliothèque du Centre français de recherche en sciences sociales
|
||||
mul:
|
||||
- Bibliothèque du Centre français de recherche en sciences sociales
|
||||
wikidata_classification:
|
||||
instance_of: &id005
|
||||
- id: Q1438040
|
||||
label: research library
|
||||
description: library that collects materials on one or more subjects to support scholarly or scientific research
|
||||
wikidata_instance_of: *id005
|
||||
wikidata_location:
|
||||
country: &id006
|
||||
id: Q213
|
||||
label: Czech Republic
|
||||
description: country in Central Europe
|
||||
headquarters_location:
|
||||
id: Q973974
|
||||
label: Prague 1
|
||||
description: administrative district, municipal district and municipal part of Prague
|
||||
wikidata_country: *id006
|
||||
wikidata_contact:
|
||||
email: mailto:claire@cefres.cz
|
||||
_provenance:
|
||||
content_hash:
|
||||
algorithm: sha256
|
||||
value: sha256-6nroyhIMUTL56XzH1D+9ek2r604EZrx9EOMKi2yRxSo=
|
||||
scope: enrichment_section
|
||||
computed_at: '2025-12-28T19:58:23.597078+00:00'
|
||||
prov:
|
||||
wasGeneratedBy:
|
||||
'@type': prov:Activity
|
||||
name: wikidata_api_fetch
|
||||
used: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617627
|
||||
verification:
|
||||
status: verified
|
||||
last_verified: '2025-12-28T19:58:23.597085+00:00'
|
||||
location:
|
||||
latitude: 50.088931
|
||||
longitude: 14.433407
|
||||
coordinate_provenance:
|
||||
source_type: ORIGINAL_ENTRY
|
||||
source_path: original_entry.locations[0]
|
||||
city: Praha 1
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 110 00
|
||||
street_address: Na Florenci 3
|
||||
normalization_timestamp: '2025-12-09T06:50:48.297317+00:00'
|
||||
logo_enrichment:
|
||||
enrichment_timestamp: '2025-12-23T22:00:16.423383+00:00'
|
||||
source_url: https://cefres.kpsys.cz/#!
|
||||
extraction_method: crawl4ai
|
||||
claims:
|
||||
- claim_type: favicon_url
|
||||
claim_value: https://cefres.kpsys.cz/favicon.png?v=2.3.0-32021
|
||||
source_url: https://cefres.kpsys.cz/#!
|
||||
css_selector: '#ng-app > head > link:nth-of-type(9)'
|
||||
retrieved_on: '2025-12-23T22:00:16.423383+00:00'
|
||||
extraction_method: crawl4ai_link_rel
|
||||
favicon_type: image/png
|
||||
favicon_sizes: 256x256
|
||||
summary:
|
||||
total_claims: 1
|
||||
has_primary_logo: false
|
||||
has_favicon: true
|
||||
has_og_image: false
|
||||
favicon_count: 1
|
||||
267
data/custodian.backup.20251230/CZ-10-PRA-L-CGSK.yaml
Normal file
267
data/custodian.backup.20251230/CZ-10-PRA-L-CGSK.yaml
Normal file
|
|
@ -0,0 +1,267 @@
|
|||
original_entry:
|
||||
name: Česká geologická služba - Knihovna
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABC023
|
||||
- &id002
|
||||
identifier_scheme: IČO
|
||||
identifier_value: 00025798
|
||||
- &id003
|
||||
identifier_scheme: Website
|
||||
identifier_value: https://cgs.gov.cz/knihovna-archivy-sbirky/knihovna
|
||||
identifier_url: https://cgs.gov.cz/knihovna-archivy-sbirky/knihovna
|
||||
- &id004
|
||||
identifier_scheme: Wikidata
|
||||
identifier_value: Q114617601
|
||||
identifier_url: https://www.wikidata.org/wiki/Q114617601
|
||||
locations:
|
||||
- street_address: Klárov 3/131
|
||||
postal_code: 118 21
|
||||
city: Praha 1
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
latitude: 50.091722
|
||||
longitude: 14.409786
|
||||
processing_timestamp: '2025-12-06T23:37:14.421598+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-CGSK
|
||||
ghcid_original: CZ-HL-PRA-L-ČGSK
|
||||
ghcid_uuid: 48ef0c1d-6c9e-51e7-94e0-c2d1d898ea88
|
||||
ghcid_uuid_sha256: d8231fa2-9b99-8b7d-aa55-30380a785fe6
|
||||
ghcid_numeric: 15574326719633001341
|
||||
record_id: bd17fa8c-fd47-46f5-98be-66c76dedb878
|
||||
generation_timestamp: '2025-12-06T23:37:14.421598+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-CGSK
|
||||
valid_from: '2025-12-10T09:46:58Z'
|
||||
valid_to:
|
||||
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
|
||||
- ghcid: CZ-HL-PRA-L-CGSK
|
||||
valid_from:
|
||||
valid_to: '2025-12-10T09:46:58Z'
|
||||
reason: Previous GHCID with incorrect region code
|
||||
- ghcid: CZ-HL-PRA-L-CGSK
|
||||
ghcid_numeric: 15574326719633001341
|
||||
valid_from: '2025-12-08T11:21:30.084883+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ČGSK)'
|
||||
- ghcid: CZ-HL-PRA-L-ČGSK
|
||||
ghcid_numeric: 10769415008464129773
|
||||
valid_from: '2025-12-06T23:37:14.421598+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:30.084883+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Česká geologická služba - Knihovna
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-CGSK
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 48ef0c1d-6c9e-51e7-94e0-c2d1d898ea88
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: d8231fa2-9b99-8b7d-aa55-30380a785fe6
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '15574326719633001341'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: bd17fa8c-fd47-46f5-98be-66c76dedb878
|
||||
- *id001
|
||||
- *id002
|
||||
- *id003
|
||||
- *id004
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.801378+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:13Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:48Z
|
||||
schema_version: 2.0.0
|
||||
enrichment_provenance:
|
||||
wikidata_enrichment:
|
||||
content_hash: sha256-oj3JlTi5DQWD5zLxOMay+soc3UFUP0XsdO0Fu8qh3Eg=
|
||||
verified_at: '2025-12-28T19:58:23.618005+00:00'
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617601
|
||||
provenance_schema_version: '2.0'
|
||||
standards_compliance:
|
||||
- W3C PROV-O
|
||||
- W3C SRI (content hashes)
|
||||
generated_at: '2025-12-28T19:58:23.618025+00:00'
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.801378+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Česká geologická služba - Knihovna
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.801378+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.801378+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 1
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.801378+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: wikidata_id
|
||||
claim_value: Q114617601
|
||||
property_uri: owl:sameAs
|
||||
provenance:
|
||||
namespace: wikidata
|
||||
path: /entity/Q114617601
|
||||
timestamp: '2025-11-19T10:14:29.801378+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.98
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABC023
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.801378+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:14.421598+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
wikidata_enrichment:
|
||||
wikidata_entity_id: Q114617601
|
||||
api_metadata:
|
||||
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
fetch_timestamp: '2025-12-18T22:28:05.533308+00:00'
|
||||
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
|
||||
enrichment_version: 2.1_generic
|
||||
properties_found:
|
||||
- P31
|
||||
- P17
|
||||
- P159
|
||||
- P968
|
||||
wikidata_labels:
|
||||
cs: Česká geologická služba, knihovna
|
||||
en: Czech Geological Survey
|
||||
wikidata_label_en: Czech Geological Survey
|
||||
wikidata_label_cs: Česká geologická služba, knihovna
|
||||
wikidata_descriptions:
|
||||
cs: knihovna v Praze
|
||||
wikidata_classification:
|
||||
instance_of: &id005
|
||||
- id: Q1438040
|
||||
label: research library
|
||||
description: library that collects materials on one or more subjects to support scholarly or scientific research
|
||||
wikidata_instance_of: *id005
|
||||
wikidata_location:
|
||||
country: &id006
|
||||
id: Q213
|
||||
label: Czech Republic
|
||||
description: country in Central Europe
|
||||
headquarters_location:
|
||||
id: Q973974
|
||||
label: Prague 1
|
||||
description: administrative district, municipal district and municipal part of Prague
|
||||
wikidata_country: *id006
|
||||
wikidata_contact:
|
||||
email: mailto:knihovna@geology.cz
|
||||
_provenance:
|
||||
content_hash:
|
||||
algorithm: sha256
|
||||
value: sha256-oj3JlTi5DQWD5zLxOMay+soc3UFUP0XsdO0Fu8qh3Eg=
|
||||
scope: enrichment_section
|
||||
computed_at: '2025-12-28T19:58:23.618005+00:00'
|
||||
prov:
|
||||
wasGeneratedBy:
|
||||
'@type': prov:Activity
|
||||
name: wikidata_api_fetch
|
||||
used: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617601
|
||||
verification:
|
||||
status: verified
|
||||
last_verified: '2025-12-28T19:58:23.618015+00:00'
|
||||
location:
|
||||
latitude: 50.091722
|
||||
longitude: 14.409786
|
||||
coordinate_provenance:
|
||||
source_type: ORIGINAL_ENTRY
|
||||
source_path: original_entry.locations[0]
|
||||
city: Praha 1
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 118 21
|
||||
street_address: Klárov 3/131
|
||||
normalization_timestamp: '2025-12-09T06:50:48.318122+00:00'
|
||||
logo_enrichment:
|
||||
enrichment_timestamp: '2025-12-23T22:00:25.647871+00:00'
|
||||
source_url: https://cgs.gov.cz/knihovna-archivy-sbirky/knihovna
|
||||
extraction_method: crawl4ai
|
||||
claims:
|
||||
- claim_type: favicon_url
|
||||
claim_value: https://cgs.gov.cz/themes/star_cgs/assets/img/favicon/safari-pinned-tab.svg
|
||||
source_url: https://cgs.gov.cz/knihovna-archivy-sbirky/knihovna
|
||||
css_selector: '[document] > html.js > head > link:nth-of-type(9)'
|
||||
retrieved_on: '2025-12-23T22:00:25.647871+00:00'
|
||||
extraction_method: crawl4ai_link_rel
|
||||
favicon_type: ''
|
||||
favicon_sizes: ''
|
||||
summary:
|
||||
total_claims: 1
|
||||
has_primary_logo: false
|
||||
has_favicon: true
|
||||
has_og_image: false
|
||||
favicon_count: 4
|
||||
152
data/custodian.backup.20251230/CZ-10-PRA-L-CGUMS.yaml
Normal file
152
data/custodian.backup.20251230/CZ-10-PRA-L-CGUMS.yaml
Normal file
|
|
@ -0,0 +1,152 @@
|
|||
original_entry:
|
||||
name: ČSAV - Geografický ústav - Mapová sbírka
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABB014
|
||||
locations: []
|
||||
processing_timestamp: '2025-12-06T23:37:25.519173+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-CGUMS
|
||||
ghcid_original: XX-XX-XXX-L-ČGÚMS
|
||||
ghcid_uuid: 59693ad2-a909-598f-aeb0-4676e4c92ac9
|
||||
ghcid_uuid_sha256: b3d06caa-9df4-8f35-87e4-f49890fe973e
|
||||
ghcid_numeric: 12956975607995236149
|
||||
record_id: 6b124526-af21-4124-8ca8-0ca145b8a250
|
||||
generation_timestamp: '2025-12-06T23:37:25.519173+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: NAME_PATTERN_EXTRACTION
|
||||
resolution_timestamp: '2025-12-06T23:56:13.377356+00:00'
|
||||
resolution_notes: City 'ČSAV' extracted from institution name
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-CGUMS
|
||||
ghcid_numeric: 12956975607995236149
|
||||
valid_from: '2025-12-08T11:21:21.983376+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-10-PRA-L-ČGÚMS)'
|
||||
- ghcid: XX-XX-XXX-L-ČGÚMS
|
||||
ghcid_numeric: 12855336241998218597
|
||||
valid_from: '2025-12-06T23:37:25.519173+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:21.983376+00:00'
|
||||
- ghcid: CZ-XX-XXX-L-ČGÚMS
|
||||
valid_from: '2025-12-06T23:56:13.377367+00:00'
|
||||
reason: 'Country resolved via Wikidata P17: XX→CZ'
|
||||
- ghcid: CZ-10-PRA-L-ČGÚMS
|
||||
valid_from: '2025-12-07T12:38:22.578582+00:00'
|
||||
reason: 'Location resolved from institution name pattern: ''ČSAV'' → region 10, city PRA'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: ČSAV - Geografický ústav - Mapová sbírka
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-CGUMS
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 59693ad2-a909-598f-aeb0-4676e4c92ac9
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: b3d06caa-9df4-8f35-87e4-f49890fe973e
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '12956975607995236149'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 6b124526-af21-4124-8ca8-0ca145b8a250
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:30.046059+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- 'Country resolved 2025-12-06T23:56:13Z: XX→CZ via Wikidata P17'
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:19Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:48:05Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:49:29Z
|
||||
- 'YouTube/Google Maps enrichment 2025-12-09T09:28:27Z: YouTube: not found'
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046059+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: ČSAV - Geografický ústav - Mapová sbírka
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046059+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046059+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABB014
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046059+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:25.519173+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
region_code: 10
|
||||
country: CZ
|
||||
normalization_timestamp: '2025-12-09T20:40:58.958925+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
city: Prague
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:40:58.958906+00:00'
|
||||
youtube_status: NOT_FOUND
|
||||
youtube_search_query: ČSAV - Geografický ústav - Mapová sbírka official
|
||||
youtube_search_timestamp: '2025-12-09T09:28:27.414759+00:00'
|
||||
284
data/custodian.backup.20251230/CZ-10-PRA-L-CHVDISK.yaml
Normal file
284
data/custodian.backup.20251230/CZ-10-PRA-L-CHVDISK.yaml
Normal file
|
|
@ -0,0 +1,284 @@
|
|||
original_entry:
|
||||
name: Český helsinský výbor - Dokumentační a informační středisko - Knihovna
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABE176
|
||||
- &id002
|
||||
identifier_scheme: IČO
|
||||
identifier_value: 00539708
|
||||
- &id003
|
||||
identifier_scheme: Website
|
||||
identifier_value: https://helcom.cz/
|
||||
identifier_url: https://helcom.cz/
|
||||
- &id004
|
||||
identifier_scheme: Wikidata
|
||||
identifier_value: Q114617893
|
||||
identifier_url: https://www.wikidata.org/wiki/Q114617893
|
||||
locations:
|
||||
- street_address: Štefánikova 216 (budova Úřadu práce MČ P-5)
|
||||
postal_code: 150 00
|
||||
city: Praha 5
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
latitude: 50.074003
|
||||
longitude: 14.403544
|
||||
processing_timestamp: '2025-12-06T23:37:16.685316+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-CHVDISK
|
||||
ghcid_original: CZ-HL-PRA-L-ČHVDISK
|
||||
ghcid_uuid: 43a6b403-88de-5b82-bd5a-e4f57a21499f
|
||||
ghcid_uuid_sha256: ad1d63b6-02f3-85bd-a40f-7937f5633624
|
||||
ghcid_numeric: 12474236176270820797
|
||||
record_id: 7b2f3e1f-bcaa-48bf-bbf3-4581fc90812f
|
||||
generation_timestamp: '2025-12-06T23:37:16.685316+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-CHVDISK
|
||||
valid_from: '2025-12-10T09:46:58Z'
|
||||
valid_to:
|
||||
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
|
||||
- ghcid: CZ-HL-PRA-L-CHVDISK
|
||||
valid_from:
|
||||
valid_to: '2025-12-10T09:46:58Z'
|
||||
reason: Previous GHCID with incorrect region code
|
||||
- ghcid: CZ-HL-PRA-L-CHVDISK
|
||||
ghcid_numeric: 12474236176270820797
|
||||
valid_from: '2025-12-08T11:21:29.433582+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ČHVDISK)'
|
||||
- ghcid: CZ-HL-PRA-L-ČHVDISK
|
||||
ghcid_numeric: 16044111670572416104
|
||||
valid_from: '2025-12-06T23:37:16.685316+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:29.433582+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Český helsinský výbor - Dokumentační a informační středisko - Knihovna
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-CHVDISK
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 43a6b403-88de-5b82-bd5a-e4f57a21499f
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: ad1d63b6-02f3-85bd-a40f-7937f5633624
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '12474236176270820797'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 7b2f3e1f-bcaa-48bf-bbf3-4581fc90812f
|
||||
- *id001
|
||||
- *id002
|
||||
- *id003
|
||||
- *id004
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.924463+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:13Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:48Z
|
||||
schema_version: 2.0.0
|
||||
enrichment_provenance:
|
||||
wikidata_enrichment:
|
||||
content_hash: sha256-Y14TjAbdy6m5GqNpEVxY2uzIEqVxK6JjuvP/gG/ogFY=
|
||||
verified_at: '2025-12-28T19:58:23.649479+00:00'
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617893
|
||||
provenance_schema_version: '2.0'
|
||||
standards_compliance:
|
||||
- W3C PROV-O
|
||||
- W3C SRI (content hashes)
|
||||
generated_at: '2025-12-28T19:58:23.649498+00:00'
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.924463+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Český helsinský výbor - Dokumentační a informační středisko - Knihovna
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.924463+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.924463+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 5
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.924463+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: wikidata_id
|
||||
claim_value: Q114617893
|
||||
property_uri: owl:sameAs
|
||||
provenance:
|
||||
namespace: wikidata
|
||||
path: /entity/Q114617893
|
||||
timestamp: '2025-11-19T10:14:29.924463+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.98
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABE176
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.924463+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:16.685316+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
wikidata_enrichment:
|
||||
wikidata_entity_id: Q114617893
|
||||
api_metadata:
|
||||
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
fetch_timestamp: '2025-12-18T22:28:13.493073+00:00'
|
||||
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
|
||||
enrichment_version: 2.1_generic
|
||||
properties_found:
|
||||
- P31
|
||||
- P17
|
||||
- P159
|
||||
- P968
|
||||
wikidata_labels:
|
||||
cs: Český helsinský výbor, Dokumentační a informační středisko, knihovna
|
||||
en: Czech Helsinki Committee, Documentation and Information Centre, Library
|
||||
wikidata_label_en: Czech Helsinki Committee, Documentation and Information Centre, Library
|
||||
wikidata_label_cs: Český helsinský výbor, Dokumentační a informační středisko, knihovna
|
||||
wikidata_descriptions:
|
||||
cs: knihovna v Praze
|
||||
wikidata_aliases:
|
||||
cs:
|
||||
- DIS ČHV
|
||||
wikidata_classification:
|
||||
instance_of: &id005
|
||||
- id: Q385994
|
||||
label: special library
|
||||
description: specialized library providing resources on a particular topic or discipline
|
||||
wikidata_instance_of: *id005
|
||||
wikidata_location:
|
||||
country: &id006
|
||||
id: Q213
|
||||
label: Czech Republic
|
||||
description: country in Central Europe
|
||||
headquarters_location:
|
||||
id: Q577602
|
||||
label: Prague 5
|
||||
description: municipal part of Prague
|
||||
wikidata_country: *id006
|
||||
wikidata_contact:
|
||||
email: mailto:info@helcom.cz
|
||||
_provenance:
|
||||
content_hash:
|
||||
algorithm: sha256
|
||||
value: sha256-Y14TjAbdy6m5GqNpEVxY2uzIEqVxK6JjuvP/gG/ogFY=
|
||||
scope: enrichment_section
|
||||
computed_at: '2025-12-28T19:58:23.649479+00:00'
|
||||
prov:
|
||||
wasGeneratedBy:
|
||||
'@type': prov:Activity
|
||||
name: wikidata_api_fetch
|
||||
used: https://www.wikidata.org/w/rest.php/wikibase/v1
|
||||
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617893
|
||||
verification:
|
||||
status: verified
|
||||
last_verified: '2025-12-28T19:58:23.649489+00:00'
|
||||
location:
|
||||
latitude: 50.074003
|
||||
longitude: 14.403544
|
||||
coordinate_provenance:
|
||||
source_type: ORIGINAL_ENTRY
|
||||
source_path: original_entry.locations[0]
|
||||
city: Praha 5
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 150 00
|
||||
street_address: Štefánikova 216 (budova Úřadu práce MČ P-5)
|
||||
normalization_timestamp: '2025-12-09T06:50:48.344961+00:00'
|
||||
logo_enrichment:
|
||||
enrichment_timestamp: '2025-12-23T22:00:34.684584+00:00'
|
||||
source_url: https://helcom.cz
|
||||
extraction_method: crawl4ai
|
||||
claims:
|
||||
- claim_type: logo_url
|
||||
claim_value: https://helcom.cz/wp-content/uploads/2022/12/Logo_CHV_motto_cz.svg
|
||||
source_url: https://helcom.cz
|
||||
css_selector: '#et-boc > header.et-l.et-l--header > div.et_builder_inner_content.et_pb_gutters3 > div.et_pb_section.et_pb_section_0_tb_header > div.et_pb_with_border.et_pb_row > div.et_pb_column.et_pb_column_1_3 > div.et_pb_module.et_pb_image > a > span.et_pb_image_wrap > img.wp-image-111102'
|
||||
retrieved_on: '2025-12-23T22:00:34.684584+00:00'
|
||||
extraction_method: crawl4ai_header_logo
|
||||
detection_confidence: high
|
||||
alt_text: flexile-white-logo
|
||||
- claim_type: favicon_url
|
||||
claim_value: https://helcom.cz/wp-content/uploads/2022/09/favicon-1.png
|
||||
source_url: https://helcom.cz
|
||||
css_selector: '[document] > html.js > head > link:nth-of-type(19)'
|
||||
retrieved_on: '2025-12-23T22:00:34.684584+00:00'
|
||||
extraction_method: crawl4ai_link_rel
|
||||
favicon_type: ''
|
||||
favicon_sizes: 32x32
|
||||
- claim_type: og_image_url
|
||||
claim_value: https://helcom.cz/wp-content/uploads/2022/08/Logo_CHV_cz.jpg
|
||||
source_url: https://helcom.cz
|
||||
css_selector: '[document] > html.js > head > meta:nth-of-type(12)'
|
||||
retrieved_on: '2025-12-23T22:00:34.684584+00:00'
|
||||
extraction_method: crawl4ai_meta_og
|
||||
summary:
|
||||
total_claims: 3
|
||||
has_primary_logo: true
|
||||
has_favicon: true
|
||||
has_og_image: true
|
||||
favicon_count: 1
|
||||
166
data/custodian.backup.20251230/CZ-10-PRA-L-CKCJ.yaml
Normal file
166
data/custodian.backup.20251230/CZ-10-PRA-L-CKCJ.yaml
Normal file
|
|
@ -0,0 +1,166 @@
|
|||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: ČSAV - Kabinet cizích jazyků
|
||||
source_type: ch_annotator
|
||||
original_entry:
|
||||
name: ČSAV - Kabinet cizích jazyků
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABB021
|
||||
locations:
|
||||
- street_address: Valentinská 1
|
||||
postal_code: 110 00
|
||||
city: Praha 1
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:25.524014+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-CKCJ
|
||||
ghcid_original: CZ-HL-PRA-L-ČKCJ
|
||||
ghcid_uuid: 78f22202-db45-5112-8927-dbbfc3eca42c
|
||||
ghcid_uuid_sha256: 4878a8ef-7251-87c1-980e-2a787b8ca612
|
||||
ghcid_numeric: 5222109514304681921
|
||||
record_id: 9bc9afc6-3356-4113-a8ae-2e4aa26e8363
|
||||
generation_timestamp: '2025-12-06T23:37:25.524014+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-CKCJ
|
||||
valid_from: '2025-12-10T09:46:58Z'
|
||||
valid_to: null
|
||||
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha))
|
||||
per ISO 3166-2:CZ
|
||||
- ghcid: CZ-HL-PRA-L-CKCJ
|
||||
valid_from: null
|
||||
valid_to: '2025-12-10T09:46:58Z'
|
||||
reason: Previous GHCID with incorrect region code
|
||||
- ghcid: CZ-HL-PRA-L-CKCJ
|
||||
ghcid_numeric: 5222109514304681921
|
||||
valid_from: '2025-12-08T11:21:23.154226+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ČKCJ)'
|
||||
- ghcid: CZ-HL-PRA-L-ČKCJ
|
||||
ghcid_numeric: 5098596963913306570
|
||||
valid_from: '2025-12-06T23:37:25.524014+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:23.154226+00:00'
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-CKCJ
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 78f22202-db45-5112-8927-dbbfc3eca42c
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 4878a8ef-7251-87c1-980e-2a787b8ca612
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '5222109514304681921'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 9bc9afc6-3356-4113-a8ae-2e4aa26e8363
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:30.046139+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:13Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:48Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046139+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date: null
|
||||
verified_by: null
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: ČSAV - Kabinet cizích jazyků
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046139+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046139+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 1
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046139+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABB021
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:30.046139+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:25.524014+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 110 00
|
||||
street_address: Valentinská 1
|
||||
normalization_timestamp: '2025-12-09T20:41:06.322739+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:06.322714+00:00'
|
||||
165
data/custodian.backup.20251230/CZ-10-PRA-L-CKP.yaml
Normal file
165
data/custodian.backup.20251230/CZ-10-PRA-L-CKP.yaml
Normal file
|
|
@ -0,0 +1,165 @@
|
|||
original_entry:
|
||||
name: Československý kamenoprůmysl Praha
|
||||
institution_type: LIBRARY
|
||||
source: CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
identifiers:
|
||||
- &id001
|
||||
identifier_scheme: Sigla
|
||||
identifier_value: ABF020
|
||||
locations:
|
||||
- street_address: Liebknechtova 61
|
||||
postal_code: 150 00
|
||||
city: Praha 5
|
||||
region: Hlavní město Praha
|
||||
country: CZ
|
||||
processing_timestamp: '2025-12-06T23:37:15.799986+00:00'
|
||||
ghcid:
|
||||
ghcid_current: CZ-10-PRA-L-CKP
|
||||
ghcid_original: CZ-HL-PRA-L-ČKP
|
||||
ghcid_uuid: 00ac345f-83fb-5055-9659-92cffb54a22c
|
||||
ghcid_uuid_sha256: 3ca6be9a-da46-86de-91ae-070f72f5bc49
|
||||
ghcid_numeric: 4370390060705949406
|
||||
record_id: 5d12a8e8-7773-4093-99b0-3c6de6d30aa8
|
||||
generation_timestamp: '2025-12-06T23:37:15.799986+00:00'
|
||||
location_resolution:
|
||||
country_code: CZ
|
||||
region_code: 10
|
||||
city_code: PRA
|
||||
method: CH_ANNOTATOR_SOURCE
|
||||
ghcid_history:
|
||||
- ghcid: CZ-10-PRA-L-CKP
|
||||
valid_from: "2025-12-10T09:46:58Z"
|
||||
valid_to: null
|
||||
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
|
||||
- ghcid: CZ-HL-PRA-L-CKP
|
||||
valid_from: null
|
||||
valid_to: "2025-12-10T09:46:58Z"
|
||||
reason: "Previous GHCID with incorrect region code"
|
||||
- ghcid: CZ-HL-PRA-L-CKP
|
||||
ghcid_numeric: 4370390060705949406
|
||||
valid_from: '2025-12-08T11:21:23.110362+00:00'
|
||||
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ČKP)'
|
||||
- ghcid: CZ-HL-PRA-L-ČKP
|
||||
ghcid_numeric: 12709429357571624434
|
||||
valid_from: '2025-12-06T23:37:15.799986+00:00'
|
||||
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
|
||||
valid_to: '2025-12-08T11:21:23.110362+00:00'
|
||||
custodian_name:
|
||||
claim_type: custodian_name
|
||||
claim_value: Československý kamenoprůmysl Praha
|
||||
source_type: ch_annotator
|
||||
identifiers:
|
||||
- identifier_scheme: GHCID
|
||||
identifier_value: CZ-10-PRA-L-CKP
|
||||
- identifier_scheme: GHCID_UUID
|
||||
identifier_value: 00ac345f-83fb-5055-9659-92cffb54a22c
|
||||
- identifier_scheme: GHCID_UUID_SHA256
|
||||
identifier_value: 3ca6be9a-da46-86de-91ae-070f72f5bc49
|
||||
- identifier_scheme: GHCID_NUMERIC
|
||||
identifier_value: '4370390060705949406'
|
||||
- identifier_scheme: RECORD_ID
|
||||
identifier_value: 5d12a8e8-7773-4093-99b0-3c6de6d30aa8
|
||||
- *id001
|
||||
provenance:
|
||||
data_source: API_SCRAPING
|
||||
data_tier: TIER_1_AUTHORITATIVE
|
||||
extraction_date: '2025-11-19T10:14:29.914267+00:00'
|
||||
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
|
||||
confidence_score: 0.95
|
||||
notes:
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:13Z
|
||||
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:48Z
|
||||
ch_annotator:
|
||||
convention_id: ch_annotator-v1_7_0
|
||||
convention_version: 1.7.0
|
||||
entity_classification:
|
||||
hypernym: GRP
|
||||
hypernym_label: GROUP
|
||||
subtype: GRP.HER.LIB
|
||||
subtype_label: LIBRARY
|
||||
ontology_class: schema:Library
|
||||
alternative_classes:
|
||||
- org:FormalOrganization
|
||||
- rov:RegisteredOrganization
|
||||
- glam:HeritageCustodian
|
||||
extraction_provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.914267+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
annotation_provenance:
|
||||
annotation_agent: opencode-claude-sonnet-4
|
||||
annotation_date: '2025-12-06T21:12:32.389491+00:00'
|
||||
annotation_method: retroactive CH-Annotator application via batch script
|
||||
source_file: czech_unified.yaml
|
||||
annotation_metadata:
|
||||
confidence_score: 0.95
|
||||
verified: false
|
||||
verification_date:
|
||||
verified_by:
|
||||
entity_claims:
|
||||
- claim_type: full_name
|
||||
claim_value: Československý kamenoprůmysl Praha
|
||||
property_uri: skos:prefLabel
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.914267+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: institution_type
|
||||
claim_value: LIBRARY
|
||||
property_uri: rdf:type
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.914267+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.95
|
||||
- claim_type: located_in_city
|
||||
claim_value: Praha 5
|
||||
property_uri: schema:addressLocality
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.914267+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 0.9
|
||||
- claim_type: ghcid
|
||||
claim_value: CZ-ABF020
|
||||
property_uri: glam:ghcid
|
||||
provenance:
|
||||
namespace: glam
|
||||
path: /files/czech_unified.yaml
|
||||
timestamp: '2025-11-19T10:14:29.914267+00:00'
|
||||
agent: claude-conversation
|
||||
context_convention: ch_annotator-v1_7_0
|
||||
confidence: 1.0
|
||||
integration_note:
|
||||
created_from: czech_unified_ch_annotator.yaml
|
||||
creation_date: '2025-12-06T23:37:15.799986+00:00'
|
||||
creation_method: create_custodian_from_ch_annotator.py
|
||||
location:
|
||||
city: Prague
|
||||
region: Hlavní město Praha
|
||||
region_code: 10
|
||||
country: CZ
|
||||
postal_code: 150 00
|
||||
street_address: Liebknechtova 61
|
||||
normalization_timestamp: '2025-12-09T20:41:06.370465+00:00'
|
||||
latitude: 50.08804
|
||||
longitude: 14.42076
|
||||
geonames_id: 3067696
|
||||
geonames_name: Prague
|
||||
feature_code: PPLC
|
||||
coordinate_provenance:
|
||||
source_type: GEONAMES_GHCID_CITY_CODE
|
||||
source_path: data/reference/geonames.db
|
||||
entity_id: 3067696
|
||||
city_code: PRA
|
||||
original_timestamp: '2025-12-09T20:41:06.370441+00:00'
|
||||
Some files were not shown because too many files have changed in this diff Show more
Loading…
Reference in a new issue