backup CZ and JP

This commit is contained in:
kempersc 2025-12-30 23:19:38 +01:00
parent 45e873ec0a
commit b42d6bf5d2
10000 changed files with 2462216 additions and 87 deletions

View file

@ -0,0 +1,235 @@
# Migration Specification: `agent: claude-conversation` → Proper Provenance
**Created**: 2025-12-30
**Status**: SPECIFICATION (Not Yet Implemented)
**Related**: PROVENANCE_TIMESTAMP_RULES.md, WEB_OBSERVATION_PROVENANCE_RULES.md
## Problem Statement
24,328 custodian YAML files in `data/custodian/` have provenance statements with:
- `agent: claude-conversation` (vague, non-specific agent identifier)
- Single `timestamp` field (violates Rule 35: dual timestamp requirement)
- No distinction between statement creation and source archival
## Affected Files
All files matching:
```bash
grep -l "agent: claude-conversation" data/custodian/*.yaml
# Result: 24,328 files
```
### Provenance Locations in Each File
1. **`ch_annotator.extraction_provenance.agent`** - Top-level extraction agent
2. **`ch_annotator.entity_claims[].provenance.agent`** - Per-claim provenance (multiple instances)
## Source Data Categories
The 24,328 files come from different original sources, requiring different migration strategies:
### Category 1: ISIL Registry / CSV Sources (~18,000 files)
**Examples**: Japan, Austria, Switzerland, Czech, Bulgarian, Belgian ISIL registries
**Characteristics**:
- `path: /files/{country}_complete.yaml`
- Data originated from authoritative CSV registries
- The CSV files are already archived in `data/instances/`
**Migration Strategy** (Scripted):
```yaml
# BEFORE
extraction_provenance:
path: /files/japan_complete.yaml
timestamp: '2025-11-18T14:46:40.580095+00:00'
agent: claude-conversation # ← INVALID
# AFTER
extraction_provenance:
source_type: isil_registry_csv
source_path: /files/japan_complete.yaml
source_archived_at: '2025-11-18T14:46:40.580095+00:00' # When CSV was processed
statement_created_at: '2025-12-06T21:13:31.304940+00:00' # From annotation_date
agent: batch-script-create-custodian-from-ch-annotator
context_convention: ch_annotator-v1_7_0
```
### Category 2: Conversation-Extracted Data (~4,000 files)
**Examples**: Palestinian heritage custodians, some Latin American institutions
**Characteristics**:
- `path: /conversations/{uuid}`
- Data extracted from Claude conversation exports
- Need to trace back to original sources mentioned IN the conversation
**Migration Strategy** (Requires GLM4.7 + Manual Review):
1. Load the conversation JSON file
2. Use GLM4.7 to identify the ACTUAL sources mentioned in conversation
3. For each source type:
- **Web sources**: Use web-reader to archive + extract with XPath
- **Wikidata**: Add Wikidata entity provenance
- **Academic sources**: Add DOI/citation provenance
### Category 3: Web-Enriched Data (~2,000 files)
**Examples**: Institutions with `web_enrichment`, `google_maps_enrichment`
**Characteristics**:
- Have web-scraped data that needs XPath provenance
- May have Google Maps or OSM enrichment
**Migration Strategy** (Requires web-reader + Playwright):
1. Re-archive source websites using Playwright
2. Use web-reader to extract claims with XPath provenance
3. Generate dual timestamps from archival metadata
## Migration Pipeline Architecture
```
┌─────────────────────────────────────────────────────────────────────┐
│ MIGRATION PIPELINE │
├─────────────────────────────────────────────────────────────────────┤
│ │
│ ┌─────────────┐ ┌──────────────────┐ ┌─────────────────┐ │
│ │ Categorizer │ ──▶ │ Source Resolver │ ──▶ │ Provenance │ │
│ │ │ │ │ │ Generator │ │
│ │ - Detect │ │ - CSV Registry │ │ │ │
│ │ source │ │ - Conversation │ │ - Dual │ │
│ │ type │ │ - Web Archive │ │ timestamps │ │
│ │ - Route to │ │ - Wikidata │ │ - Valid agent │ │
│ │ handler │ │ │ │ - Source refs │ │
│ └─────────────┘ └──────────────────┘ └─────────────────┘ │
│ │ │ │ │
│ ▼ ▼ ▼ │
│ ┌─────────────────────────────────────────────────────────────┐ │
│ │ Source-Specific Handlers │ │
│ ├─────────────────────────────────────────────────────────────┤ │
│ │ │ │
│ │ ┌────────────────┐ ┌────────────────┐ ┌───────────────┐ │ │
│ │ │ ISIL/CSV │ │ Conversation │ │ Web Archive │ │ │
│ │ │ Handler │ │ Handler │ │ Handler │ │ │
│ │ │ │ │ │ │ │ │ │
│ │ │ - Read CSV │ │ - Parse JSON │ │ - Playwright │ │ │
│ │ │ - Map to │ │ - GLM4.7 │ │ - web-reader │ │ │
│ │ │ timestamps │ │ analysis │ │ - XPath │ │ │
│ │ │ - Update │ │ - Source │ │ extraction │ │ │
│ │ │ provenance │ │ tracing │ │ │ │ │
│ │ └────────────────┘ └────────────────┘ └───────────────┘ │ │
│ │ │ │
│ └─────────────────────────────────────────────────────────────┘ │
│ │
│ ┌─────────────────────────────────────────────────────────────┐ │
│ │ Validation Layer │ │
│ ├─────────────────────────────────────────────────────────────┤ │
│ │ - Dual timestamp check (Rule 35) │ │
│ │ - Agent identifier validation │ │
│ │ - source_archived_at <= statement_created_at │ │
│ │ - XPath verification (where applicable) │ │
│ └─────────────────────────────────────────────────────────────┘ │
│ │
└─────────────────────────────────────────────────────────────────────┘
```
## Implementation Phases
### Phase 1: Category 1 - ISIL/CSV Sources (Scripted, No LLM Required)
**Scope**: ~18,000 files
**Effort**: 1-2 days scripting
**Tools**: Python script only
Files where `path` matches `/files/*.yaml` or `/files/*.csv`:
- Parse the annotation_date to get `statement_created_at`
- Use the original file's processing timestamp for `source_archived_at`
- Replace `agent: claude-conversation` with source-specific agent
### Phase 2: Category 2 - Conversation Sources (GLM4.7 Required)
**Scope**: ~4,000 files
**Effort**: 3-5 days with LLM processing
**Tools**: GLM4.7 API, conversation JSON parser
For each file with `path: /conversations/{uuid}`:
1. Load conversation JSON from archive (if available)
2. Send to GLM4.7 with prompt to identify actual data sources
3. Update provenance based on source analysis
### Phase 3: Category 3 - Web Sources (web-reader + Playwright)
**Scope**: ~2,000 files
**Effort**: 5-10 days with web archival
**Tools**: Playwright, web-reader MCP, GLM4.7
For files with web-derived claims:
1. Archive source URLs using Playwright
2. Extract claims with XPath using web-reader
3. Generate dual timestamps from archival metadata
## File Updates
### Per-File Changes
For each of the 24,328 files:
1. **Update `ch_annotator.extraction_provenance`**:
```yaml
extraction_provenance:
# Existing fields retained
namespace: glam
path: /files/japan_complete.yaml
context_convention: ch_annotator-v1_7_0
# NEW: Dual timestamps
source_archived_at: '2025-11-18T14:46:40.580095+00:00'
statement_created_at: '2025-12-06T21:13:31.304940+00:00'
# NEW: Valid agent identifier
agent: batch-script-create-custodian-from-ch-annotator
# NEW: Source classification
source_type: isil_registry_csv
# NEW: Migration tracking
migration_note: 'Migrated from agent:claude-conversation on 2025-12-30'
```
2. **Update each `ch_annotator.entity_claims[].provenance`**:
```yaml
provenance:
namespace: glam
path: /files/japan_complete.yaml
context_convention: ch_annotator-v1_7_0
# NEW: Dual timestamps (inherited from parent)
source_archived_at: '2025-11-18T14:46:40.580095+00:00'
statement_created_at: '2025-12-06T21:13:31.304940+00:00'
# NEW: Valid agent
agent: batch-script-create-custodian-from-ch-annotator
```
## Validation Criteria
After migration, every provenance block MUST pass:
1. ✅ `statement_created_at` is present (ISO 8601)
2. ✅ `source_archived_at` is present (ISO 8601)
3. ✅ `source_archived_at <= statement_created_at`
4. ✅ `agent` is NOT `claude-conversation`, `claude`, `ai`, `opencode`, or `llm`
5. ✅ `agent` follows format `{tool}-{model}-{version}` or `{script-name}`
## Rollback Strategy
Before migration:
1. Create timestamped backup: `data/custodian.backup.2025-12-30/`
2. Store original provenance in `_migration_backup` field
3. Generate diff report for manual review
## References
- Rule 35: `.opencode/PROVENANCE_TIMESTAMP_RULES.md`
- Rule 6: `.opencode/WEB_OBSERVATION_PROVENANCE_RULES.md`
- CH-Annotator: `data/entity_annotation/ch_annotator-v1_7_0.yaml`
- web-reader script: `scripts/add_web_claim_provenance.py`

View file

@ -0,0 +1,477 @@
# GLM4.7 Prompts for Category 2: Conversation Source Analysis
**Created**: 2025-12-30
**Status**: SPECIFICATION
**Related**: CLAUDE_CONVERSATION_MIGRATION_SPEC.md, PROVENANCE_TIMESTAMP_RULES.md
## Purpose
Category 2 files (~4,000) have provenance paths like `/conversations/{uuid}` which reference Claude conversation exports. The actual data sources (Wikidata, websites, registries, academic papers) are mentioned WITHIN the conversation text.
GLM4.7 is used to:
1. Parse conversation JSON files
2. Identify the REAL data sources mentioned
3. Extract source metadata (URLs, timestamps, identifiers)
4. Generate proper dual-timestamp provenance
## Workflow Overview
```
┌─────────────────────────────────────────────────────────────────┐
│ CATEGORY 2 MIGRATION WORKFLOW │
├─────────────────────────────────────────────────────────────────┤
│ │
│ ┌──────────────┐ ┌──────────────┐ ┌─────────────────┐ │
│ │ Custodian │ │ Conversation │ │ GLM4.7 │ │
│ │ YAML File │ ──▶ │ JSON Archive │ ──▶ │ Source Analysis │ │
│ │ │ │ │ │ │ │
│ │ path: /conv/ │ │ Full text of │ │ Identify: │ │
│ │ {uuid} │ │ messages │ │ - URLs │ │
│ └──────────────┘ └──────────────┘ │ - Wikidata IDs │ │
│ │ - Registry refs │ │
│ │ - API calls │ │
│ └─────────────────┘ │
│ │ │
│ ▼ │
│ ┌─────────────────────────────────┐ │
│ │ Source-Specific Handlers │ │
│ ├─────────────────────────────────┤ │
│ │ │ │
│ │ ┌───────────┐ ┌───────────────┐ │ │
│ │ │ Web URLs │ │ Wikidata IDs │ │ │
│ │ │ │ │ │ │ │
│ │ │ Playwright│ │ SPARQL query │ │ │
│ │ │ archive + │ │ to verify │ │ │
│ │ │ web-reader│ │ claims │ │ │
│ │ └───────────┘ └───────────────┘ │ │
│ │ │ │
│ │ ┌───────────┐ ┌───────────────┐ │ │
│ │ │ Registry │ │ Academic │ │ │
│ │ │ References│ │ Citations │ │ │
│ │ │ │ │ │ │ │
│ │ │ Map to │ │ DOI lookup │ │ │
│ │ │ CSV files │ │ CrossRef API │ │ │
│ │ └───────────┘ └───────────────┘ │ │
│ └─────────────────────────────────┘ │
│ │
└─────────────────────────────────────────────────────────────────┘
```
## GLM4.7 Prompts
### Prompt 1: Source Identification
```markdown
# Task: Identify Data Sources in Heritage Custodian Conversation
You are analyzing a Claude conversation that was used to extract heritage institution data. Your task is to identify ALL data sources mentioned or used in this conversation.
## Conversation Content
{conversation_json}
## Institution Being Analyzed
- Name: {institution_name}
- GHCID: {ghcid}
- Current provenance path: /conversations/{conversation_uuid}
## Instructions
1. Read through the entire conversation carefully
2. Identify every data source mentioned or used, including:
- **Web URLs**: Institution websites, registry portals, news articles
- **Wikidata**: Entity IDs (Q-numbers) referenced or queried
- **API Calls**: Any structured data fetches (SPARQL, REST APIs)
- **CSV/Registry References**: ISIL registries, national databases
- **Academic Sources**: Papers, reports, DOIs
- **Government Sources**: Official publications, gazettes
3. For each source, extract:
- Source type (web, wikidata, api, registry, academic, government)
- Source identifier (URL, Q-number, DOI, etc.)
- What data was extracted from it
- Approximate timestamp of access (if mentioned)
## Output Format
Return a JSON array of sources:
```json
{
"institution_name": "{institution_name}",
"ghcid": "{ghcid}",
"conversation_uuid": "{conversation_uuid}",
"identified_sources": [
{
"source_type": "web",
"source_url": "https://example.org/about",
"source_identifier": null,
"data_extracted": ["name", "address", "opening_hours"],
"access_timestamp": "2025-09-22T14:40:00Z",
"confidence": 0.95,
"evidence_quote": "Looking at their website at example.org..."
},
{
"source_type": "wikidata",
"source_url": "https://www.wikidata.org/wiki/Q12345",
"source_identifier": "Q12345",
"data_extracted": ["instance_of", "country", "coordinates"],
"access_timestamp": null,
"confidence": 0.98,
"evidence_quote": "According to Wikidata (Q12345)..."
}
],
"analysis_notes": "Any relevant observations about source quality or gaps"
}
```
## Important
- Only include sources that were ACTUALLY used to extract data
- Do not invent sources - if unsure, set confidence lower
- Include the exact quote from conversation that references each source
- If no sources can be identified, return empty array with explanation
```
### Prompt 2: Claim-Source Attribution
```markdown
# Task: Map Claims to Their Original Sources
You have identified the following sources used in a heritage custodian conversation:
## Identified Sources
{identified_sources_json}
## Entity Claims from Custodian File
{entity_claims_json}
## Institution
- Name: {institution_name}
- GHCID: {ghcid}
## Instructions
For each entity claim, determine which source(s) it was derived from.
1. Analyze each claim (full_name, institution_type, located_in_city, etc.)
2. Match it to the most likely source based on:
- What data each source provides
- The conversation context
- Claim confidence scores
3. Generate proper provenance for each claim
## Output Format
Return updated provenance for each claim:
```json
{
"claim_provenance_updates": [
{
"claim_type": "full_name",
"claim_value": "Example Museum",
"attributed_source": {
"source_type": "web",
"source_url": "https://example.org/about",
"source_archived_at": "2025-09-22T14:40:00Z",
"statement_created_at": "2025-12-06T21:13:31Z",
"agent": "opencode-claude-sonnet-4",
"attribution_confidence": 0.92
},
"attribution_rationale": "Name found on official website header"
},
{
"claim_type": "wikidata_id",
"claim_value": "Q12345",
"attributed_source": {
"source_type": "wikidata",
"source_url": "https://www.wikidata.org/wiki/Q12345",
"source_archived_at": "2025-09-22T14:45:00Z",
"statement_created_at": "2025-12-06T21:13:31Z",
"agent": "opencode-claude-sonnet-4",
"attribution_confidence": 1.0
},
"attribution_rationale": "Directly queried from Wikidata"
}
],
"unattributed_claims": [
{
"claim_type": "opening_hours",
"claim_value": "Mon-Fri 9-17",
"reason": "Source could not be determined from conversation"
}
]
}
```
## Rules
- If a claim cannot be attributed to any identified source, add to unattributed_claims
- For unattributed claims, the migration script will flag for manual review
- Use the conversation UUID as fallback source_archived_at if no timestamp available
- statement_created_at should use the annotation_date from CH-Annotator
```
### Prompt 3: Web Source Verification
```markdown
# Task: Verify Web Sources for Archival
Before we archive web sources with Playwright, verify they are valid and relevant.
## Web Sources to Verify
{web_sources_json}
## Institution
- Name: {institution_name}
- GHCID: {ghcid}
## Instructions
For each web source, determine:
1. **URL Validity**: Is the URL well-formed and likely still accessible?
2. **Relevance**: Does this URL relate to the institution?
3. **Archive Priority**: Should we archive this with Playwright?
4. **Expected Content**: What data should we extract with web-reader?
## Output Format
```json
{
"web_source_verification": [
{
"source_url": "https://example.org/about",
"url_valid": true,
"is_institution_website": true,
"archive_priority": "high",
"expected_claims": ["name", "address", "description", "contact"],
"web_reader_selectors": {
"name": "h1.institution-name",
"address": ".contact-info address",
"description": "main .about-text"
},
"notes": "Official institution website - primary source"
},
{
"source_url": "https://twitter.com/example",
"url_valid": true,
"is_institution_website": false,
"archive_priority": "low",
"expected_claims": ["social_media_handle"],
"web_reader_selectors": null,
"notes": "Social media - only need URL, not content"
}
],
"sources_to_archive": ["https://example.org/about"],
"sources_to_skip": ["https://twitter.com/example"]
}
```
## Priority Levels
- **high**: Institution's own website - archive immediately
- **medium**: Government registries, Wikipedia - archive if accessible
- **low**: Social media, aggregators - just store URL
- **skip**: Dead links, paywalled content, dynamic apps
```
## Implementation Script Outline
```python
#!/usr/bin/env python3
"""
Phase 2 Migration: Conversation Sources → Proper Provenance
Uses GLM4.7 to analyze conversation JSON files and identify
actual data sources for heritage custodian claims.
"""
import json
import os
from pathlib import Path
from datetime import datetime, timezone
import httpx
import yaml
# Z.AI GLM API configuration (per Rule 11)
ZAI_API_URL = "https://api.z.ai/api/coding/paas/v4/chat/completions"
ZAI_MODEL = "glm-4.5" # or glm-4.6 for higher quality
def get_zai_token() -> str:
"""Get Z.AI API token from environment."""
token = os.environ.get("ZAI_API_TOKEN")
if not token:
raise ValueError("ZAI_API_TOKEN environment variable not set")
return token
def call_glm4(prompt: str, system_prompt: str = None) -> str:
"""Call GLM4 API with prompt."""
headers = {
"Authorization": f"Bearer {get_zai_token()}",
"Content-Type": "application/json"
}
messages = []
if system_prompt:
messages.append({"role": "system", "content": system_prompt})
messages.append({"role": "user", "content": prompt})
payload = {
"model": ZAI_MODEL,
"messages": messages,
"temperature": 0.1, # Low temperature for consistent extraction
"max_tokens": 4096
}
response = httpx.post(ZAI_API_URL, json=payload, headers=headers, timeout=60)
response.raise_for_status()
return response.json()["choices"][0]["message"]["content"]
def load_conversation_json(uuid: str) -> dict:
"""Load conversation JSON from archive."""
# Conversation archives stored in data/conversations/
conv_path = Path(f"data/conversations/{uuid}.json")
if not conv_path.exists():
# Try alternative locations
alt_path = Path(f"~/Documents/claude/glam/{uuid}.json").expanduser()
if alt_path.exists():
conv_path = alt_path
else:
return None
with open(conv_path, 'r') as f:
return json.load(f)
def identify_sources_for_institution(custodian_file: Path) -> dict:
"""
Analyze conversation to identify sources for a custodian.
Returns dict with:
- identified_sources: list of sources found
- claim_attributions: mapping of claims to sources
- web_sources_to_archive: URLs needing Playwright archival
"""
# Load custodian YAML
with open(custodian_file, 'r') as f:
custodian = yaml.safe_load(f)
# Extract conversation UUID from provenance path
ch_annotator = custodian.get('ch_annotator', {})
extraction_prov = ch_annotator.get('extraction_provenance', {})
path = extraction_prov.get('path', '')
if not path.startswith('/conversations/'):
return {'error': 'Not a conversation source file'}
conv_uuid = path.replace('/conversations/', '')
# Load conversation JSON
conversation = load_conversation_json(conv_uuid)
if not conversation:
return {'error': f'Conversation not found: {conv_uuid}'}
# Extract relevant info
institution_name = custodian.get('custodian_name', {}).get('claim_value', 'Unknown')
ghcid = custodian.get('ghcid', {}).get('ghcid_current', 'Unknown')
entity_claims = ch_annotator.get('entity_claims', [])
# Step 1: Identify sources using GLM4
source_prompt = PROMPT_1_SOURCE_IDENTIFICATION.format(
conversation_json=json.dumps(conversation, indent=2)[:50000], # Truncate if needed
institution_name=institution_name,
ghcid=ghcid,
conversation_uuid=conv_uuid
)
sources_response = call_glm4(source_prompt)
identified_sources = json.loads(sources_response)
# Step 2: Attribute claims to sources
attribution_prompt = PROMPT_2_CLAIM_ATTRIBUTION.format(
identified_sources_json=json.dumps(identified_sources['identified_sources'], indent=2),
entity_claims_json=json.dumps(entity_claims, indent=2),
institution_name=institution_name,
ghcid=ghcid
)
attributions_response = call_glm4(attribution_prompt)
claim_attributions = json.loads(attributions_response)
# Step 3: Verify web sources
web_sources = [s for s in identified_sources['identified_sources'] if s['source_type'] == 'web']
if web_sources:
verification_prompt = PROMPT_3_WEB_VERIFICATION.format(
web_sources_json=json.dumps(web_sources, indent=2),
institution_name=institution_name,
ghcid=ghcid
)
verification_response = call_glm4(verification_prompt)
web_verification = json.loads(verification_response)
else:
web_verification = {'sources_to_archive': [], 'sources_to_skip': []}
return {
'custodian_file': str(custodian_file),
'conversation_uuid': conv_uuid,
'identified_sources': identified_sources,
'claim_attributions': claim_attributions,
'web_verification': web_verification
}
# Prompt templates (loaded from this file or external)
PROMPT_1_SOURCE_IDENTIFICATION = """...""" # From Prompt 1 above
PROMPT_2_CLAIM_ATTRIBUTION = """...""" # From Prompt 2 above
PROMPT_3_WEB_VERIFICATION = """...""" # From Prompt 3 above
```
## Conversation JSON Location
Conversation exports need to be located. Check these paths:
1. `~/Documents/claude/glam/*.json` - Original Claude exports
2. `data/conversations/*.json` - Project archive location
3. `data/instances/conversations/` - Alternative archive
If conversations are not archived, they may need to be re-exported from Claude.
## Integration with Phase 1
Phase 2 runs AFTER Phase 1 completes:
1. **Phase 1**: Migrates ~18,000 Category 1 files (ISIL/CSV sources)
2. **Phase 2**: Processes ~4,000 Category 2 files (conversation sources)
3. **Phase 3**: Archives web sources with Playwright for Category 3
## Cost Estimation
GLM4 API calls (per Rule 11: FREE via Z.AI Coding Plan):
- ~4,000 files × 3 prompts = ~12,000 API calls
- Cost: $0 (Z.AI Coding Plan)
- Time: ~2-4 hours (rate limited)
## Validation Criteria
After Phase 2 migration, every Category 2 file MUST pass:
1. ✅ `source_archived_at` is present (from identified source or conversation timestamp)
2. ✅ `statement_created_at` is present (from annotation_date)
3. ✅ `agent` is valid (opencode-claude-sonnet-4 or similar)
4. ✅ At least one source identified, OR flagged for manual review
5. ✅ Web sources queued for Playwright archival (Phase 3)
## References
- Rule 11: `.opencode/ZAI_GLM_API_RULES.md`
- Rule 35: `.opencode/PROVENANCE_TIMESTAMP_RULES.md`
- Migration Spec: `.opencode/CLAUDE_CONVERSATION_MIGRATION_SPEC.md`

View file

@ -0,0 +1,393 @@
# Provenance Timestamp Rules
**Created**: 2025-12-30
**Updated**: 2025-12-30
**Status**: Active Rule
**Related**: WEB_CLAIM_PROVENANCE_SCHEMA.md, YAML_PROVENANCE_SCHEMA.md, WEB_OBSERVATION_PROVENANCE_RULES.md
## Core Principle: Every Provenance Statement MUST Have At Least Two Timestamps
**All provenance statements in custodian data MUST include at minimum two timestamps:**
1. **`statement_created_at`** - When the provenance statement/claim was created (extraction/annotation time)
2. **`source_archived_at`** - When the source material was archived/captured
These two timestamps are MANDATORY. Additional temporal metadata is encouraged but optional.
---
## Mandatory Timestamps
### 1. Statement Created Timestamp (`statement_created_at`)
**Purpose**: Records when the claim/statement was extracted, annotated, or created by the agent.
**Format**: ISO 8601 with timezone (UTC preferred)
**Example**:
```yaml
statement_created_at: "2025-12-30T14:30:00Z"
```
**Source**: Generated by the extraction/annotation agent at processing time.
### 2. Source Archived Timestamp (`source_archived_at`)
**Purpose**: Records when the source material (webpage, document, API response) was archived/captured.
**Format**: ISO 8601 with timezone (UTC preferred)
**Example**:
```yaml
source_archived_at: "2025-12-29T10:15:00Z"
```
**Source**:
- For web sources: Playwright archival timestamp, Wayback Machine memento datetime
- For API sources: API response fetch timestamp
- For documents: Document capture/download timestamp
---
## Optional Timestamps (Encouraged)
### 3. Source Created Timestamp (`source_created_at`)
**Purpose**: When the original source content was created/published.
**Example**:
```yaml
source_created_at: "2022-07-15T14:15:00Z" # Article publish date
```
**Sources**:
- `article:published_time` meta tag
- `datePublished` in JSON-LD
- File creation date
- API response `created_at` field
### 4. Source Last Modified Timestamp (`source_last_modified_at`)
**Purpose**: When the source content was last updated.
**Example**:
```yaml
source_last_modified_at: "2023-01-10T09:00:00Z"
```
**Sources**:
- `article:modified_time` meta tag
- `dateModified` in JSON-LD
- HTTP `Last-Modified` header
- File modification date
### 5. Verification Timestamp (`last_verified_at`)
**Purpose**: When the claim was last re-verified against the source.
**Example**:
```yaml
last_verified_at: "2025-12-30T14:30:00Z"
```
### 6. Next Verification Due (`next_verification_due`)
**Purpose**: When the claim should be re-verified (for staleness tracking).
**Example**:
```yaml
next_verification_due: "2026-03-30T00:00:00Z" # 90 days from last verification
```
---
## Complete Provenance Timestamp Structure
### For Web Claims
```yaml
provenance:
# MANDATORY (both required)
statement_created_at: "2025-12-30T14:30:00Z" # When we extracted this
source_archived_at: "2025-12-29T10:15:00Z" # When we archived the webpage
# OPTIONAL (encouraged)
source_created_at: "2022-07-15T14:15:00Z" # When article was published
source_last_modified_at: "2023-01-10T09:00:00Z" # When article was updated
last_verified_at: "2025-12-30T14:30:00Z" # Last verification
next_verification_due: "2026-03-30T00:00:00Z" # Re-verify in 90 days
```
### For API-Sourced Data (Wikidata, Google Maps, etc.)
```yaml
_provenance:
# MANDATORY
statement_created_at: "2025-12-30T14:30:00Z" # When we processed API response
source_archived_at: "2025-12-30T14:29:55Z" # When API was queried (fetch_timestamp)
# OPTIONAL
source_last_modified_at: "2025-12-15T00:00:00Z" # Wikidata entity last modified
last_verified_at: "2025-12-30T14:30:00Z"
```
### For CH-Annotator Extracted Claims
```yaml
provenance:
namespace: glam
path: /conversations/edc75d66-ee42-4199-8e22-65b0d2347922
# MANDATORY
statement_created_at: "2025-12-06T21:13:56Z" # When CH-Annotator processed this
source_archived_at: "2025-11-06T08:02:44Z" # When conversation was exported
# Agent identification
agent: opencode-claude-sonnet-4
context_convention: ch_annotator-v1_7_0
```
---
## Invalid Provenance: `agent: claude-conversation`
**PROBLEM**: 24,328 custodian files currently contain provenance statements like:
```yaml
# INVALID - Missing timestamps and proper source identification
extraction_provenance:
namespace: glam
path: /conversations/edc75d66-ee42-4199-8e22-65b0d2347922
timestamp: '2025-11-06T08:02:44.240037+00:00' # Only ONE timestamp!
agent: claude-conversation # Vague agent identifier
context_convention: ch_annotator-v1_7_0
```
**ISSUES**:
1. `claude-conversation` is not a valid agent identifier (which Claude model? which session?)
2. Only one timestamp - doesn't distinguish statement creation from source archival
3. No UUID reference to the specific conversation
4. No archived source path
---
## Valid Provenance Structure (Migration Target)
```yaml
extraction_provenance:
namespace: glam
# Source identification
source_type: claude_conversation_export
source_path: /conversations/edc75d66-ee42-4199-8e22-65b0d2347922
conversation_uuid: edc75d66-ee42-4199-8e22-65b0d2347922
# MANDATORY timestamps
statement_created_at: "2025-12-06T21:13:56.173868+00:00" # Annotation time
source_archived_at: "2025-11-06T08:02:44.240037+00:00" # Conversation export time
# Agent identification (proper format)
agent:
name: opencode-claude-sonnet-4
model: claude-sonnet-4-20250514
session_type: opencode_conversation
# Context
context_convention: ch_annotator-v1_7_0
# Archive reference
archive:
format: claude_conversation_json
local_path: data/conversations/edc75d66-ee42-4199-8e22-65b0d2347922.json
```
---
## Timestamp Hierarchy and Derivation
When only one timestamp is available, derive the other:
| Available | Derive `statement_created_at` | Derive `source_archived_at` |
|-----------|------------------------------|----------------------------|
| Only `timestamp` | Use as `statement_created_at` | Set to same value (assume simultaneous) |
| Only `extraction_date` | Use as `statement_created_at` | Set to same value |
| Only `fetch_timestamp` | Set to same value | Use as `source_archived_at` |
| Only `annotation_date` | Use as `statement_created_at` | Look for `timestamp` in source |
**Migration rule**: If we cannot determine `source_archived_at`, use the earliest available timestamp from the source chain.
---
## Agent Identification Standards
### Invalid Agent Identifiers
```yaml
# INVALID - Too vague
agent: claude-conversation
agent: claude
agent: ai
agent: llm
agent: opencode
```
### Valid Agent Identifiers
```yaml
# Format: {tool}-{model}-{version}
agent: opencode-claude-sonnet-4
agent: opencode-claude-opus-4
agent: batch-script-python-3.11
agent: manual-human-curator
# Or structured format
agent:
name: opencode-claude-sonnet-4
model: claude-sonnet-4-20250514
tool: opencode
version: "1.0.0"
```
---
## PROV-O Alignment
These timestamps align with W3C PROV-O:
| Our Field | PROV-O Property | Description |
|-----------|-----------------|-------------|
| `statement_created_at` | `prov:generatedAtTime` | When entity was generated |
| `source_archived_at` | `prov:atTime` (on Activity) | When archival activity occurred |
| `source_created_at` | `dcterms:created` | Original creation date |
| `source_last_modified_at` | `dcterms:modified` | Last modification date |
```yaml
prov:
generatedAtTime: "2025-12-30T14:30:00Z" # = statement_created_at
wasGeneratedBy:
"@type": "prov:Activity"
name: "web_extraction"
atTime: "2025-12-29T10:15:00Z" # = source_archived_at
```
---
## Validation Rules
### Rule 1: Both Mandatory Timestamps Required
```python
def validate_provenance_timestamps(provenance: dict) -> list[str]:
errors = []
# Check for mandatory timestamps
if 'statement_created_at' not in provenance:
errors.append("Missing mandatory 'statement_created_at' timestamp")
if 'source_archived_at' not in provenance:
errors.append("Missing mandatory 'source_archived_at' timestamp")
return errors
```
### Rule 2: Timestamps Must Be Valid ISO 8601
```python
from datetime import datetime
def validate_timestamp_format(timestamp: str) -> bool:
try:
datetime.fromisoformat(timestamp.replace('Z', '+00:00'))
return True
except ValueError:
return False
```
### Rule 3: source_archived_at <= statement_created_at
The source must be archived BEFORE or AT the same time as the statement is created.
```python
def validate_timestamp_order(provenance: dict) -> bool:
archived = datetime.fromisoformat(provenance['source_archived_at'])
created = datetime.fromisoformat(provenance['statement_created_at'])
return archived <= created
```
---
## Migration Strategy for Existing Files
### Phase 1: Identify Files Needing Migration
```bash
# Count affected files
find data/custodian -name "*.yaml" -exec grep -l "agent: claude-conversation" {} \; | wc -l
# Result: 24,328 files
```
### Phase 2: Parse and Transform
For each file with `agent: claude-conversation`:
1. Extract existing `timestamp` field
2. Set `source_archived_at` = existing `timestamp`
3. Set `statement_created_at` = `annotation_date` if present, else use current time
4. Replace `agent: claude-conversation` with proper agent identifier
5. Add conversation UUID from path
### Phase 3: Validate and Write
```python
def migrate_provenance(data: dict) -> dict:
"""Migrate old claude-conversation provenance to new format."""
if 'ch_annotator' in data:
ch = data['ch_annotator']
if ch.get('extraction_provenance', {}).get('agent') == 'claude-conversation':
old_prov = ch['extraction_provenance']
# Extract conversation UUID from path
path = old_prov.get('path', '')
conv_uuid = path.split('/')[-1] if '/conversations/' in path else None
# Get timestamps
source_archived_at = old_prov.get('timestamp')
statement_created_at = ch.get('annotation_provenance', {}).get('annotation_date', source_archived_at)
# Build new provenance
ch['extraction_provenance'] = {
'namespace': old_prov.get('namespace', 'glam'),
'source_type': 'claude_conversation_export',
'source_path': old_prov.get('path'),
'conversation_uuid': conv_uuid,
'statement_created_at': statement_created_at,
'source_archived_at': source_archived_at,
'agent': 'opencode-claude-sonnet-4', # Default migration value
'context_convention': old_prov.get('context_convention'),
'migration_note': 'Migrated from agent:claude-conversation on 2025-12-30'
}
return data
```
---
## Implementation Checklist
- [ ] Add `statement_created_at` to all new provenance statements
- [ ] Add `source_archived_at` to all new provenance statements
- [ ] Replace `agent: claude-conversation` with proper agent identifiers
- [ ] Add conversation UUIDs where applicable
- [ ] Migrate existing 24,328 files with invalid provenance
- [ ] Update LinkML schema to require dual timestamps
- [ ] Add validation to data pipeline
---
## Related Documentation
- `.opencode/WEB_CLAIM_PROVENANCE_SCHEMA.md` - Web claim provenance structure
- `.opencode/YAML_PROVENANCE_SCHEMA.md` - YAML enrichment provenance
- `.opencode/WEB_OBSERVATION_PROVENANCE_RULES.md` - XPath provenance requirements
- `AGENTS.md` - Rule 35: Provenance Timestamps

View file

@ -0,0 +1,160 @@
original_entry:
name: BODHI PRAHA - Knihovna
institution_type: HOLY_SITES
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABE360
- &id002
identifier_scheme: IČO
identifier_value: '49627414'
locations:
- street_address: Slezská 140
postal_code: 130 00
city: Praha 3
region: Hlavní město Praha
country: CZ
latitude: 50.076911
longitude: 14.464614
processing_timestamp: '2025-12-06T23:37:20.849119+00:00'
ghcid:
ghcid_current: CZ-10-PRA-H-BPK
ghcid_original: CZ-HL-PRA-H-BPK
ghcid_uuid: 7c1fd1a4-ee17-527e-9349-55fd16c01486
ghcid_uuid_sha256: 6367a388-cb73-8eee-53b7-8ea64c233ab4
ghcid_numeric: 7162873540280291054
record_id: 2a67ea10-8a77-44d3-bcaf-ae62cde0aa16
generation_timestamp: '2025-12-06T23:37:20.849119+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-H-BPK
valid_from: "2025-12-10T09:46:57Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-H-BPK
valid_from: null
valid_to: "2025-12-10T09:46:57Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-H-BPK
ghcid_numeric: 7162873540280291054
valid_from: '2025-12-06T23:37:20.849119+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
custodian_name:
claim_type: custodian_name
claim_value: BODHI PRAHA - Knihovna
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-H-BPK
- identifier_scheme: GHCID_UUID
identifier_value: 7c1fd1a4-ee17-527e-9349-55fd16c01486
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 6367a388-cb73-8eee-53b7-8ea64c233ab4
- identifier_scheme: GHCID_NUMERIC
identifier_value: '7162873540280291054'
- identifier_scheme: RECORD_ID
identifier_value: 2a67ea10-8a77-44d3-bcaf-ae62cde0aa16
- *id001
- *id002
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.991619+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:46Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.HOL
subtype_label: HOLY_SITES
ontology_class: schema:PlaceOfWorship
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.991619+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date: null
verified_by: null
entity_claims:
- claim_type: full_name
claim_value: BODHI PRAHA - Knihovna
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.991619+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: HOLY_SITES
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.991619+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 3
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.991619+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABE360
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.991619+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:20.849119+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
latitude: 50.076911
longitude: 14.464614
coordinate_provenance:
source_type: ORIGINAL_ENTRY
source_path: original_entry.locations[0]
city: Praha 3
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 130 00
street_address: Slezská 140
normalization_timestamp: '2025-12-09T06:50:46.626432+00:00'

View file

@ -0,0 +1,292 @@
original_entry:
name: Církev československá husitská - Úřad Ústřední rady - Ústřední archiv a muzeum
institution_type: HOLY_SITES
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABE460
- &id002
identifier_scheme: IČO
identifier_value: 00445185
- &id003
identifier_scheme: Website
identifier_value: https://www.ccsh.cz
identifier_url: https://www.ccsh.cz
- &id004
identifier_scheme: Wikidata
identifier_value: Q114619244
identifier_url: https://www.wikidata.org/wiki/Q114619244
locations:
- street_address: Wuchterlova 5
postal_code: 166 26
city: Praha 6
region: Hlavní město Praha
country: CZ
latitude: 50.097831
longitude: 14.397203
processing_timestamp: '2025-12-06T23:37:24.561843+00:00'
ghcid:
ghcid_current: CZ-10-PRA-H-CCHUURUAM
ghcid_original: CZ-HL-PRA-H-CČHÚÚRÚAM
ghcid_uuid: 7d10ac5c-5235-5ba1-ba30-2edfef4c0af9
ghcid_uuid_sha256: 6ff0d000-3401-8df7-a4c7-6eefa0376a81
ghcid_numeric: 8066175631911677431
record_id: c29783fc-ee79-410b-8d60-1acb510e3c72
generation_timestamp: '2025-12-06T23:37:24.561843+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-H-CCHUURUAM
valid_from: '2025-12-10T09:46:57Z'
valid_to:
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
- ghcid: CZ-HL-PRA-H-CCHUURUAM
valid_from:
valid_to: '2025-12-10T09:46:57Z'
reason: Previous GHCID with incorrect region code
- ghcid: CZ-HL-PRA-H-CCHUURUAM
ghcid_numeric: 8066175631911677431
valid_from: '2025-12-08T11:21:41.193514+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-H-CČHÚÚRÚAM)'
- ghcid: CZ-HL-PRA-H-CČHÚÚRÚAM
ghcid_numeric: 4847761274494306046
valid_from: '2025-12-06T23:37:24.561843+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:41.193514+00:00'
custodian_name:
claim_type: custodian_name
claim_value: Církev československá husitská - Úřad Ústřední rady - Ústřední archiv a muzeum
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-H-CCHUURUAM
- identifier_scheme: GHCID_UUID
identifier_value: 7d10ac5c-5235-5ba1-ba30-2edfef4c0af9
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 6ff0d000-3401-8df7-a4c7-6eefa0376a81
- identifier_scheme: GHCID_NUMERIC
identifier_value: '8066175631911677431'
- identifier_scheme: RECORD_ID
identifier_value: c29783fc-ee79-410b-8d60-1acb510e3c72
- *id001
- *id002
- *id003
- *id004
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:30.039417+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:46Z
schema_version: 2.0.0
enrichment_provenance:
wikidata_enrichment:
content_hash: sha256-meF0a8cLF7Bs9iN06RJgME6ags8hQktQ+hi14hroIfU=
verified_at: '2025-12-28T19:58:22.443284+00:00'
wasDerivedFrom: https://www.wikidata.org/wiki/Q114619244
provenance_schema_version: '2.0'
standards_compliance:
- W3C PROV-O
- W3C SRI (content hashes)
generated_at: '2025-12-28T19:58:22.443305+00:00'
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.HOL
subtype_label: HOLY_SITES
ontology_class: schema:PlaceOfWorship
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.039417+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Církev československá husitská - Úřad Ústřední rady - Ústřední archiv a muzeum
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.039417+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: HOLY_SITES
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.039417+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 6
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.039417+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: wikidata_id
claim_value: Q114619244
property_uri: owl:sameAs
provenance:
namespace: wikidata
path: /entity/Q114619244
timestamp: '2025-11-19T10:14:30.039417+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.98
- claim_type: ghcid
claim_value: CZ-ABE460
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.039417+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:24.561843+00:00'
creation_method: create_custodian_from_ch_annotator.py
wikidata_enrichment:
wikidata_entity_id: Q114619244
api_metadata:
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
fetch_timestamp: '2025-12-18T22:25:32.394804+00:00'
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
enrichment_version: 2.1_generic
properties_found:
- P31
- P17
- P159
- P968
- P749
- P361
wikidata_labels:
cs: Církev československá husitská, Úřad Ústřední rady, Ústřední archiv a muzeum (knihovna)
en: Czechoslovak Hussite Church, Central Council, Central Archiv and Museum
wikidata_label_en: Czechoslovak Hussite Church, Central Council, Central Archiv and Museum
wikidata_label_cs: Církev československá husitská, Úřad Ústřední rady, Ústřední archiv a muzeum (knihovna)
wikidata_descriptions:
cs: knihovna v Praze
wikidata_aliases:
cs:
- ÚAM ÚÚR CČSH
wikidata_classification:
instance_of: &id005
- id: Q13866185
label: museum library
description: library that is part of a museum
wikidata_instance_of: *id005
wikidata_location:
country: &id006
id: Q213
label: Czech Republic
description: country in Central Europe
headquarters_location:
id: Q2667290
label: Prague 6
description: municipal part of Prague
wikidata_country: *id006
wikidata_organization:
parent_organization: &id007
id: Q115654444
label: Ústřední archiv a muzeum Církve československé husitské
description: museum in Czechia
part_of: *id007
wikidata_contact:
email: mailto:archiv@ccsh.cz
_provenance:
content_hash:
algorithm: sha256
value: sha256-meF0a8cLF7Bs9iN06RJgME6ags8hQktQ+hi14hroIfU=
scope: enrichment_section
computed_at: '2025-12-28T19:58:22.443284+00:00'
prov:
wasGeneratedBy:
'@type': prov:Activity
name: wikidata_api_fetch
used: https://www.wikidata.org/w/rest.php/wikibase/v1
wasDerivedFrom: https://www.wikidata.org/wiki/Q114619244
verification:
status: verified
last_verified: '2025-12-28T19:58:22.443294+00:00'
location:
latitude: 50.097831
longitude: 14.397203
coordinate_provenance:
source_type: ORIGINAL_ENTRY
source_path: original_entry.locations[0]
city: Praha 6
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 166 26
street_address: Wuchterlova 5
normalization_timestamp: '2025-12-09T06:50:46.651423+00:00'
logo_enrichment:
enrichment_timestamp: '2025-12-23T21:55:27.491797+00:00'
source_url: https://www.ccsh.cz
extraction_method: crawl4ai
claims:
- claim_type: logo_url
claim_value: https://www.ccsh.cz/images/logoG.png
source_url: https://www.ccsh.cz
css_selector: '#t3-header > div.row > div.col-xs-12.col-sm-6 > div.logo-image > a > img.logo-img'
retrieved_on: '2025-12-23T21:55:27.491797+00:00'
extraction_method: crawl4ai_header_logo
detection_confidence: high
alt_text: CČSH
- claim_type: favicon_url
claim_value: https://www.ccsh.cz/safari-pinned-tab.svg
source_url: https://www.ccsh.cz
css_selector: '[document] > html.com_content.view-featured > head > link:nth-of-type(39)'
retrieved_on: '2025-12-23T21:55:27.491797+00:00'
extraction_method: crawl4ai_link_rel
favicon_type: ''
favicon_sizes: ''
- claim_type: og_image_url
claim_value: https://nwork.vbox.cz/images/joomlart/cirkev/kdo_jsme.jpg
source_url: https://www.ccsh.cz
css_selector: '[document] > html.com_content.view-featured > head > meta:nth-of-type(4)'
retrieved_on: '2025-12-23T21:55:27.491797+00:00'
extraction_method: crawl4ai_meta_og
summary:
total_claims: 3
has_primary_logo: true
has_favicon: true
has_og_image: true
favicon_count: 5

View file

@ -0,0 +1,165 @@
original_entry:
name: Česká křesťanská akademie - Centrální katolická knihovna
institution_type: HOLY_SITES
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABE454
locations:
- street_address: Thákurova 3
postal_code: 160 00
city: Praha 6
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:25.754984+00:00'
ghcid:
ghcid_current: CZ-10-PRA-H-CKACKK
ghcid_original: CZ-HL-PRA-H-ČKACKK
ghcid_uuid: 4ef4bd45-9642-5128-b3e9-d4d9edc50960
ghcid_uuid_sha256: 834f87f4-a28d-8c8e-8612-25a472753f8d
ghcid_numeric: 9461930826897628302
record_id: 70f6726f-eb75-4ded-baf4-cf10eca63649
generation_timestamp: '2025-12-06T23:37:25.754984+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-H-CKACKK
valid_from: "2025-12-10T09:46:57Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-H-CKACKK
valid_from: null
valid_to: "2025-12-10T09:46:57Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-H-CKACKK
ghcid_numeric: 9461930826897628302
valid_from: '2025-12-08T11:21:38.069976+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-H-ČKACKK)'
- ghcid: CZ-HL-PRA-H-ČKACKK
ghcid_numeric: 2876805312996636643
valid_from: '2025-12-06T23:37:25.754984+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:38.069976+00:00'
custodian_name:
claim_type: custodian_name
claim_value: Česká křesťanská akademie - Centrální katolická knihovna
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-H-CKACKK
- identifier_scheme: GHCID_UUID
identifier_value: 4ef4bd45-9642-5128-b3e9-d4d9edc50960
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 834f87f4-a28d-8c8e-8612-25a472753f8d
- identifier_scheme: GHCID_NUMERIC
identifier_value: '9461930826897628302'
- identifier_scheme: RECORD_ID
identifier_value: 70f6726f-eb75-4ded-baf4-cf10eca63649
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:30.047751+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:46Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.HOL
subtype_label: HOLY_SITES
ontology_class: schema:PlaceOfWorship
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.047751+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Česká křesťanská akademie - Centrální katolická knihovna
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.047751+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: HOLY_SITES
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.047751+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 6
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.047751+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABE454
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.047751+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:25.754984+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 160 00
street_address: Thákurova 3
normalization_timestamp: '2025-12-09T20:41:03.900039+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:03.900021+00:00'

View file

@ -0,0 +1,263 @@
original_entry:
name: Česká Provincie řádu sv. Augustina - Farnost sv. Tomáše v Praze na Malé Straně - Augustiniánská knihovna
institution_type: HOLY_SITES
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABE462
- &id002
identifier_scheme: Website
identifier_value: https://augustiniani.cz/
identifier_url: https://augustiniani.cz/
- &id003
identifier_scheme: Wikidata
identifier_value: Q114619631
identifier_url: https://www.wikidata.org/wiki/Q114619631
locations:
- street_address: Josefská 28/8
postal_code: 118 01
city: Praha 1
region: Hlavní město Praha
country: CZ
latitude: 50.088826
longitude: 14.405173
processing_timestamp: '2025-12-06T23:37:27.387413+00:00'
ghcid:
ghcid_current: CZ-10-PRA-H-CPRSAFSTVP-ceska_provincie_radu_sv_augustina_farnost_sv_tomas
ghcid_original: CZ-HL-PRA-H-ČPŘSAFSTVP-ceska_provincie_radu_sv_augustina_farnost_sv_tomas
ghcid_uuid: 6450dbfb-6387-5dff-93ab-f3e0ff979ee7
ghcid_uuid_sha256: d52be405-b36d-8bb1-b10d-b15f1cee5323
ghcid_numeric: 15360621667213487025
record_id: 2955d359-aa74-4f10-b9cb-a9971074c851
generation_timestamp: '2025-12-06T23:37:27.387413+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-H-CPRSAFSTVP-ceska_provincie_radu_sv_augustina_farnost_sv_tomas
valid_from: '2025-12-10T09:46:57Z'
valid_to:
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
- ghcid: CZ-HL-PRA-H-CPRSAFSTVP-ceska_provincie_radu_sv_augustina_farnost_sv_tomas
valid_from:
valid_to: '2025-12-10T09:46:57Z'
reason: Previous GHCID with incorrect region code
- ghcid: CZ-HL-PRA-H-CPRSAFSTVP-ceska_provincie_radu_sv_augustina_farnost_sv_tomas
ghcid_numeric: 15360621667213487025
valid_from: '2025-12-08T11:21:29.229081+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-H-ČPŘSAFSTVP-ceska_provincie_radu_sv_augustina_farnost_sv_tomas)'
- ghcid: CZ-HL-PRA-H-ČPŘSAFSTVP-ceska_provincie_radu_sv_augustina_farnost_sv_tomas
ghcid_numeric: 10650113264983091993
valid_from: '2025-12-06T23:37:27.387413+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:29.229081+00:00'
custodian_name:
claim_type: custodian_name
claim_value: Česká Provincie řádu sv. Augustina - Farnost sv. Tomáše v Praze na Malé Straně - Augustiniánská knihovna
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-H-CPRSAFSTVP-ceska_provincie_radu_sv_augustina_farnost_sv_tomas
- identifier_scheme: GHCID_UUID
identifier_value: 6450dbfb-6387-5dff-93ab-f3e0ff979ee7
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: d52be405-b36d-8bb1-b10d-b15f1cee5323
- identifier_scheme: GHCID_NUMERIC
identifier_value: '15360621667213487025'
- identifier_scheme: RECORD_ID
identifier_value: 2955d359-aa74-4f10-b9cb-a9971074c851
- *id001
- *id002
- *id003
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:30.062682+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:46Z
schema_version: 2.0.0
enrichment_provenance:
wikidata_enrichment:
content_hash: sha256-wFZjrBWmASC3ygUuMmJ9WvXmSxWeu0b3P2suN3Jaa48=
verified_at: '2025-12-28T19:58:22.476372+00:00'
wasDerivedFrom: https://www.wikidata.org/wiki/Q114619631
provenance_schema_version: '2.0'
standards_compliance:
- W3C PROV-O
- W3C SRI (content hashes)
generated_at: '2025-12-28T19:58:22.476396+00:00'
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.HOL
subtype_label: HOLY_SITES
ontology_class: schema:PlaceOfWorship
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.062682+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Česká Provincie řádu sv. Augustina - Farnost sv. Tomáše v Praze na Malé Straně - Augustiniánská knihovna
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.062682+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: HOLY_SITES
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.062682+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 1
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.062682+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: wikidata_id
claim_value: Q114619631
property_uri: owl:sameAs
provenance:
namespace: wikidata
path: /entity/Q114619631
timestamp: '2025-11-19T10:14:30.062682+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.98
- claim_type: ghcid
claim_value: CZ-ABE462
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.062682+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:27.387413+00:00'
creation_method: create_custodian_from_ch_annotator.py
wikidata_enrichment:
wikidata_entity_id: Q114619631
api_metadata:
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
fetch_timestamp: '2025-12-18T22:25:40.452127+00:00'
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
enrichment_version: 2.1_generic
properties_found:
- P31
- P17
- P159
- P968
wikidata_labels:
cs: Česká Provincie řádu sv. Augustina, Farnost sv. Tomáše v Praze na Malé Straně, Augustiniánská knihovna
en: The Order of St. Augustine, St. Thomas Church in Prague, Library
wikidata_label_en: The Order of St. Augustine, St. Thomas Church in Prague, Library
wikidata_label_cs: Česká Provincie řádu sv. Augustina, Farnost sv. Tomáše v Praze na Malé Straně, Augustiniánská knihovna
wikidata_descriptions:
cs: knihovna v Praze
wikidata_classification:
instance_of: &id004
- id: Q50379845
label: religious library
description: type of library with collections that focus on religion and related subjects
wikidata_instance_of: *id004
wikidata_location:
country: &id005
id: Q213
label: Czech Republic
description: country in Central Europe
headquarters_location:
id: Q973974
label: Prague 1
description: administrative district, municipal district and municipal part of Prague
wikidata_country: *id005
wikidata_contact:
email: mailto:osaprag@augustiniani.cz
_provenance:
content_hash:
algorithm: sha256
value: sha256-wFZjrBWmASC3ygUuMmJ9WvXmSxWeu0b3P2suN3Jaa48=
scope: enrichment_section
computed_at: '2025-12-28T19:58:22.476372+00:00'
prov:
wasGeneratedBy:
'@type': prov:Activity
name: wikidata_api_fetch
used: https://www.wikidata.org/w/rest.php/wikibase/v1
wasDerivedFrom: https://www.wikidata.org/wiki/Q114619631
verification:
status: verified
last_verified: '2025-12-28T19:58:22.476385+00:00'
location:
latitude: 50.088826
longitude: 14.405173
coordinate_provenance:
source_type: ORIGINAL_ENTRY
source_path: original_entry.locations[0]
city: Praha 1
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 118 01
street_address: Josefská 28/8
normalization_timestamp: '2025-12-09T06:50:46.697835+00:00'
logo_enrichment:
enrichment_timestamp: '2025-12-23T21:55:34.949111+00:00'
source_url: https://augustiniani.cz
extraction_method: crawl4ai
claims:
- claim_type: favicon_url
claim_value: https://augustiniani.cz/wp-content/uploads/2021/03/cropped-znak-osa-180x180.png
source_url: https://augustiniani.cz
css_selector: '[document] > html > head > link:nth-of-type(62)'
retrieved_on: '2025-12-23T21:55:34.949111+00:00'
extraction_method: crawl4ai_link_rel
favicon_type: ''
favicon_sizes: ''
summary:
total_claims: 1
has_primary_logo: false
has_favicon: true
has_og_image: false
favicon_count: 3

View file

@ -0,0 +1,270 @@
original_entry:
name: Česká provincie řádu svatého Augustina - Farnost sv. Tomáše v Praze na Malé Straně - Knihovna
institution_type: HOLY_SITES
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABE458
- &id002
identifier_scheme: IČO
identifier_value: 00569631
- &id003
identifier_scheme: Website
identifier_value: https://augustiniani.cz/
identifier_url: https://augustiniani.cz/
- &id004
identifier_scheme: Wikidata
identifier_value: Q114618792
identifier_url: https://www.wikidata.org/wiki/Q114618792
locations:
- street_address: Josefská 28/8
postal_code: 118 01
city: Praha 1
region: Hlavní město Praha
country: CZ
latitude: 50.088825
longitude: 14.405172
processing_timestamp: '2025-12-06T23:37:21.578012+00:00'
ghcid:
ghcid_current: CZ-10-PRA-H-CPRSAFSTVP
ghcid_original: CZ-HL-PRA-H-ČPŘSAFSTVP
ghcid_uuid: 5f25d43b-863b-54b1-9d9b-cbfdfc103b98
ghcid_uuid_sha256: 0aecf24a-999e-8c9b-a6c0-02886452c5c0
ghcid_numeric: 787270437101800603
record_id: c6886339-40c0-4790-967c-f6eaab3f4b72
generation_timestamp: '2025-12-06T23:37:21.578012+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-H-CPRSAFSTVP
valid_from: '2025-12-10T09:46:57Z'
valid_to:
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
- ghcid: CZ-HL-PRA-H-CPRSAFSTVP
valid_from:
valid_to: '2025-12-10T09:46:57Z'
reason: Previous GHCID with incorrect region code
- ghcid: CZ-HL-PRA-H-CPRSAFSTVP
ghcid_numeric: 787270437101800603
valid_from: '2025-12-08T11:21:35.155241+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-H-ČPŘSAFSTVP)'
- ghcid: CZ-HL-PRA-H-ČPŘSAFSTVP
ghcid_numeric: 1916463988346664738
valid_from: '2025-12-06T23:37:21.578012+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:35.155241+00:00'
custodian_name:
claim_type: custodian_name
claim_value: Česká provincie řádu svatého Augustina - Farnost sv. Tomáše v Praze na Malé Straně - Knihovna
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-H-CPRSAFSTVP
- identifier_scheme: GHCID_UUID
identifier_value: 5f25d43b-863b-54b1-9d9b-cbfdfc103b98
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 0aecf24a-999e-8c9b-a6c0-02886452c5c0
- identifier_scheme: GHCID_NUMERIC
identifier_value: '787270437101800603'
- identifier_scheme: RECORD_ID
identifier_value: c6886339-40c0-4790-967c-f6eaab3f4b72
- *id001
- *id002
- *id003
- *id004
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:30.009282+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:46Z
schema_version: 2.0.0
enrichment_provenance:
wikidata_enrichment:
content_hash: sha256-SljGXkAVhGLTN6EVn1SWAncLhXjJmVaAHZxw289RWxI=
verified_at: '2025-12-28T19:58:22.498279+00:00'
wasDerivedFrom: https://www.wikidata.org/wiki/Q114618792
provenance_schema_version: '2.0'
standards_compliance:
- W3C PROV-O
- W3C SRI (content hashes)
generated_at: '2025-12-28T19:58:22.498298+00:00'
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.HOL
subtype_label: HOLY_SITES
ontology_class: schema:PlaceOfWorship
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.009282+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Česká provincie řádu svatého Augustina - Farnost sv. Tomáše v Praze na Malé Straně - Knihovna
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.009282+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: HOLY_SITES
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.009282+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 1
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.009282+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: wikidata_id
claim_value: Q114618792
property_uri: owl:sameAs
provenance:
namespace: wikidata
path: /entity/Q114618792
timestamp: '2025-11-19T10:14:30.009282+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.98
- claim_type: ghcid
claim_value: CZ-ABE458
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.009282+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:21.578012+00:00'
creation_method: create_custodian_from_ch_annotator.py
wikidata_enrichment:
wikidata_entity_id: Q114618792
api_metadata:
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
fetch_timestamp: '2025-12-18T22:25:48.379615+00:00'
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
enrichment_version: 2.1_generic
properties_found:
- P31
- P17
- P159
- P968
wikidata_labels:
cs: Česká provincie řádu svatého Augustina, Farnost sv. Tomáše v Praze na Malé Straně, knihovna
en: Bohemica Augustinianorum Pragae
wikidata_label_en: Bohemica Augustinianorum Pragae
wikidata_label_cs: Česká provincie řádu svatého Augustina, Farnost sv. Tomáše v Praze na Malé Straně, knihovna
wikidata_descriptions:
cs: knihovna v Praze
wikidata_aliases:
cs:
- BAUGP
wikidata_classification:
instance_of: &id005
- id: Q50379845
label: religious library
description: type of library with collections that focus on religion and related subjects
wikidata_instance_of: *id005
wikidata_location:
country: &id006
id: Q213
label: Czech Republic
description: country in Central Europe
headquarters_location:
id: Q973974
label: Prague 1
description: administrative district, municipal district and municipal part of Prague
wikidata_country: *id006
wikidata_contact:
email: mailto:osaprag@augustiniani.cz
_provenance:
content_hash:
algorithm: sha256
value: sha256-SljGXkAVhGLTN6EVn1SWAncLhXjJmVaAHZxw289RWxI=
scope: enrichment_section
computed_at: '2025-12-28T19:58:22.498279+00:00'
prov:
wasGeneratedBy:
'@type': prov:Activity
name: wikidata_api_fetch
used: https://www.wikidata.org/w/rest.php/wikibase/v1
wasDerivedFrom: https://www.wikidata.org/wiki/Q114618792
verification:
status: verified
last_verified: '2025-12-28T19:58:22.498289+00:00'
location:
latitude: 50.088825
longitude: 14.405172
coordinate_provenance:
source_type: ORIGINAL_ENTRY
source_path: original_entry.locations[0]
city: Praha 1
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 118 01
street_address: Josefská 28/8
normalization_timestamp: '2025-12-09T06:50:46.723119+00:00'
logo_enrichment:
enrichment_timestamp: '2025-12-23T21:55:40.149799+00:00'
source_url: https://augustiniani.cz
extraction_method: crawl4ai
claims:
- claim_type: favicon_url
claim_value: https://augustiniani.cz/wp-content/uploads/2021/03/cropped-znak-osa-180x180.png
source_url: https://augustiniani.cz
css_selector: '[document] > html > head > link:nth-of-type(62)'
retrieved_on: '2025-12-23T21:55:40.149799+00:00'
extraction_method: crawl4ai_link_rel
favicon_type: ''
favicon_sizes: ''
summary:
total_claims: 1
has_primary_logo: false
has_favicon: true
has_og_image: false
favicon_count: 3

View file

@ -0,0 +1,251 @@
original_entry:
name: Evangelikální teologický seminář - Vyšší odborná škola teologická a sociální - knihovna
institution_type: HOLY_SITES
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABG450
- &id002
identifier_scheme: IČO
identifier_value: '41690184'
- &id003
identifier_scheme: Website
identifier_value: https://ets.jabok.cuni.cz/eg/opac/home
identifier_url: https://ets.jabok.cuni.cz/eg/opac/home
- &id004
identifier_scheme: Wikidata
identifier_value: Q114619199
identifier_url: https://www.wikidata.org/wiki/Q114619199
locations:
- street_address: Stoliňská 2417/41a
postal_code: 193 00
city: Praha 9
region: Hlavní město Praha
country: CZ
latitude: 50.107464
longitude: 14.592644
processing_timestamp: '2025-12-06T23:37:24.239954+00:00'
ghcid:
ghcid_current: CZ-10-PRA-H-ETSVOSTSK
ghcid_original: CZ-HL-PRA-H-ETSVOŠTSK
ghcid_uuid: 3725cb10-ac06-5e92-8bae-37d6b7f6808d
ghcid_uuid_sha256: 43204928-ab70-888c-a318-a62b466affca
ghcid_numeric: 4836946438819739788
record_id: 941be124-274b-4f6c-bdc2-26136c7ab711
generation_timestamp: '2025-12-06T23:37:24.239954+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-H-ETSVOSTSK
valid_from: '2025-12-10T09:46:57Z'
valid_to:
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
- ghcid: CZ-HL-PRA-H-ETSVOSTSK
valid_from:
valid_to: '2025-12-10T09:46:57Z'
reason: Previous GHCID with incorrect region code
- ghcid: CZ-HL-PRA-H-ETSVOSTSK
ghcid_numeric: 4836946438819739788
valid_from: '2025-12-08T11:21:31.045391+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-H-ETSVOŠTSK)'
- ghcid: CZ-HL-PRA-H-ETSVOŠTSK
ghcid_numeric: 12474982366854812758
valid_from: '2025-12-06T23:37:24.239954+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:31.045391+00:00'
custodian_name:
claim_type: custodian_name
claim_value: Evangelikální teologický seminář - Vyšší odborná škola teologická a sociální - knihovna
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-H-ETSVOSTSK
- identifier_scheme: GHCID_UUID
identifier_value: 3725cb10-ac06-5e92-8bae-37d6b7f6808d
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 43204928-ab70-888c-a318-a62b466affca
- identifier_scheme: GHCID_NUMERIC
identifier_value: '4836946438819739788'
- identifier_scheme: RECORD_ID
identifier_value: 941be124-274b-4f6c-bdc2-26136c7ab711
- *id001
- *id002
- *id003
- *id004
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:30.035614+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:46Z
schema_version: 2.0.0
enrichment_provenance:
wikidata_enrichment:
content_hash: sha256-9q2YCHCyTeS8SeizRm2nZXth340jdonIZ71izeua7RU=
verified_at: '2025-12-28T19:58:22.518111+00:00'
wasDerivedFrom: https://www.wikidata.org/wiki/Q114619199
provenance_schema_version: '2.0'
standards_compliance:
- W3C PROV-O
- W3C SRI (content hashes)
generated_at: '2025-12-28T19:58:22.518129+00:00'
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.HOL
subtype_label: HOLY_SITES
ontology_class: schema:PlaceOfWorship
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.035614+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Evangelikální teologický seminář - Vyšší odborná škola teologická a sociální - knihovna
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.035614+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: HOLY_SITES
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.035614+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 9
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.035614+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: wikidata_id
claim_value: Q114619199
property_uri: owl:sameAs
provenance:
namespace: wikidata
path: /entity/Q114619199
timestamp: '2025-11-19T10:14:30.035614+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.98
- claim_type: ghcid
claim_value: CZ-ABG450
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.035614+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:24.239954+00:00'
creation_method: create_custodian_from_ch_annotator.py
wikidata_enrichment:
wikidata_entity_id: Q114619199
api_metadata:
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
fetch_timestamp: '2025-12-18T22:25:56.334339+00:00'
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
enrichment_version: 2.1_generic
properties_found:
- P31
- P17
- P159
- P968
wikidata_labels:
cs: Evangelikální teologický seminář, Vyšší odborná škola teologická a sociální, knihovna
en: Evangelical Teological Seminary, Higher Theological Vocational School
wikidata_label_en: Evangelical Teological Seminary, Higher Theological Vocational School
wikidata_label_cs: Evangelikální teologický seminář, Vyšší odborná škola teologická a sociální, knihovna
wikidata_descriptions:
cs: knihovna v Praze
wikidata_aliases:
cs:
- ETS-VOŠTS
wikidata_classification:
instance_of: &id005
- id: Q50379845
label: religious library
description: type of library with collections that focus on religion and related subjects
wikidata_instance_of: *id005
wikidata_location:
country: &id006
id: Q213
label: Czech Republic
description: country in Central Europe
headquarters_location:
id: Q2750534
label: Prague 9
description: administrative district and municipal part of Prague
wikidata_country: *id006
wikidata_contact:
email: mailto:info@etspraha.cz
_provenance:
content_hash:
algorithm: sha256
value: sha256-9q2YCHCyTeS8SeizRm2nZXth340jdonIZ71izeua7RU=
scope: enrichment_section
computed_at: '2025-12-28T19:58:22.518111+00:00'
prov:
wasGeneratedBy:
'@type': prov:Activity
name: wikidata_api_fetch
used: https://www.wikidata.org/w/rest.php/wikibase/v1
wasDerivedFrom: https://www.wikidata.org/wiki/Q114619199
verification:
status: verified
last_verified: '2025-12-28T19:58:22.518120+00:00'
location:
latitude: 50.107464
longitude: 14.592644
coordinate_provenance:
source_type: ORIGINAL_ENTRY
source_path: original_entry.locations[0]
city: Praha 9
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 193 00
street_address: Stoliňská 2417/41a
normalization_timestamp: '2025-12-09T06:50:46.750937+00:00'

View file

@ -0,0 +1,185 @@
custodian_name:
claim_type: custodian_name
claim_value: Husův institut teologických studií
source_type: ch_annotator
original_entry:
name: Husův institut teologických studií
institution_type: HOLY_SITES
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABE467
- &id002
identifier_scheme: IČO
identifier_value: '73632571'
- &id003
identifier_scheme: Website
identifier_value: https://www.hitspraha.cz/index.php/hits-top.html
identifier_url: https://www.hitspraha.cz/index.php/hits-top.html
locations:
- street_address: Roháčova 1305/66
postal_code: 130 00
city: Praha 3
region: Hlavní město Praha
country: CZ
latitude: 50.088143
longitude: 14.460887
processing_timestamp: '2025-12-06T23:37:43.603692+00:00'
ghcid:
ghcid_current: CZ-10-PRA-H-HITS
ghcid_original: CZ-HL-PRA-H-HITS
ghcid_uuid: 46f9b78d-b0a3-51d4-8a9f-13f51703a2c8
ghcid_uuid_sha256: 65f6b716-9bbc-852c-071f-e596068235c3
ghcid_numeric: 7347261149831529772
record_id: fa3e7a61-8914-4960-9e1a-736421221995
generation_timestamp: '2025-12-06T23:37:43.603692+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-H-HITS
valid_from: '2025-12-10T09:46:57Z'
valid_to: null
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha))
per ISO 3166-2:CZ
- ghcid: CZ-HL-PRA-H-HITS
valid_from: null
valid_to: '2025-12-10T09:46:57Z'
reason: Previous GHCID with incorrect region code
- ghcid: CZ-HL-PRA-H-HITS
ghcid_numeric: 7347261149831529772
valid_from: '2025-12-06T23:37:43.603692+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-H-HITS
- identifier_scheme: GHCID_UUID
identifier_value: 46f9b78d-b0a3-51d4-8a9f-13f51703a2c8
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 65f6b716-9bbc-852c-071f-e596068235c3
- identifier_scheme: GHCID_NUMERIC
identifier_value: '7347261149831529772'
- identifier_scheme: RECORD_ID
identifier_value: fa3e7a61-8914-4960-9e1a-736421221995
- *id001
- *id002
- *id003
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:30.309554+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:46Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.HOL
subtype_label: HOLY_SITES
ontology_class: schema:PlaceOfWorship
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.309554+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date: null
verified_by: null
entity_claims:
- claim_type: full_name
claim_value: Husův institut teologických studií
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.309554+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: HOLY_SITES
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.309554+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 3
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.309554+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABE467
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.309554+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:43.603692+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
latitude: 50.088143
longitude: 14.460887
coordinate_provenance:
source_type: ORIGINAL_ENTRY
source_path: original_entry.locations[0]
city: Praha 3
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 130 00
street_address: Roháčova 1305/66
normalization_timestamp: '2025-12-09T06:50:46.768907+00:00'
logo_enrichment:
enrichment_timestamp: '2025-12-23T21:55:48.452790+00:00'
source_url: https://www.hitspraha.cz/index.php/hits-top.html
extraction_method: crawl4ai
claims:
- claim_type: favicon_url
claim_value: https://www.hitspraha.cz/templates/hits/favicon.ico
source_url: https://www.hitspraha.cz/index.php/hits-top.html
css_selector: '[document] > html > head > link:nth-of-type(3)'
retrieved_on: '2025-12-23T21:55:48.452790+00:00'
extraction_method: crawl4ai_link_rel
favicon_type: image/vnd.microsoft.icon
favicon_sizes: ''
summary:
total_claims: 1
has_primary_logo: false
has_favicon: true
has_og_image: false
favicon_count: 1

View file

@ -0,0 +1,213 @@
original_entry:
name: Katolický domov studujících - domov mládeže a školní jídelna - Knihovna
institution_type: HOLY_SITES
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABE463
- &id002
identifier_scheme: IČO
identifier_value: '43001106'
- &id003
identifier_scheme: Website
identifier_value: https://kds.jabok.cuni.cz/eg/opac/home
identifier_url: https://kds.jabok.cuni.cz/eg/opac/home
locations:
- street_address: Černá 1610/14
postal_code: 110 00
city: Praha 1
region: Hlavní město Praha
country: CZ
latitude: 50.079251
longitude: 14.418292
processing_timestamp: '2025-12-06T23:37:27.465222+00:00'
ghcid:
ghcid_current: CZ-10-PRA-H-KDSDMSJK
ghcid_original: CZ-HL-PRA-H-KDSDMŠJK
ghcid_uuid: 1b95dc6b-e729-52d7-89a6-169a765493fa
ghcid_uuid_sha256: b1506a18-e303-8d8f-bef1-d468e3f5cff9
ghcid_numeric: 12776828797970501007
record_id: 93f49c33-a042-4e90-bc3e-f621595371cb
generation_timestamp: '2025-12-06T23:37:27.465222+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-H-KDSDMSJK
valid_from: '2025-12-10T09:46:57Z'
valid_to:
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
- ghcid: CZ-HL-PRA-H-KDSDMSJK
valid_from:
valid_to: '2025-12-10T09:46:57Z'
reason: Previous GHCID with incorrect region code
- ghcid: CZ-HL-PRA-H-KDSDMSJK
ghcid_numeric: 12776828797970501007
valid_from: '2025-12-08T11:21:31.569552+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-H-KDSDMŠJK)'
- ghcid: CZ-HL-PRA-H-KDSDMŠJK
ghcid_numeric: 10088411494359326508
valid_from: '2025-12-06T23:37:27.465222+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:31.569552+00:00'
custodian_name:
claim_type: custodian_name
claim_value: Katolický domov studujících - domov mládeže a školní jídelna - Knihovna
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-H-KDSDMSJK
- identifier_scheme: GHCID_UUID
identifier_value: 1b95dc6b-e729-52d7-89a6-169a765493fa
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: b1506a18-e303-8d8f-bef1-d468e3f5cff9
- identifier_scheme: GHCID_NUMERIC
identifier_value: '12776828797970501007'
- identifier_scheme: RECORD_ID
identifier_value: 93f49c33-a042-4e90-bc3e-f621595371cb
- *id001
- *id002
- *id003
- identifier_scheme: Wikidata
identifier_value: Q114619639
identifier_url: https://www.wikidata.org/wiki/Q114619639
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:30.063047+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:46Z
schema_version: 2.0.0
enrichment_provenance:
wikidata_enrichment:
content_hash: sha256-qvancJjwidfjm8sgTqwf/kcf9K/ABK4FBmwC5ASuUOw=
verified_at: '2025-12-28T19:58:22.545896+00:00'
wasDerivedFrom: https://www.wikidata.org/wiki/Q114619639
provenance_schema_version: '2.0'
standards_compliance:
- W3C PROV-O
- W3C SRI (content hashes)
generated_at: '2025-12-28T19:58:22.545911+00:00'
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.HOL
subtype_label: HOLY_SITES
ontology_class: schema:PlaceOfWorship
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.063047+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Katolický domov studujících - domov mládeže a školní jídelna - Knihovna
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.063047+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: HOLY_SITES
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.063047+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 1
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.063047+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABE463
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.063047+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:27.465222+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
latitude: 50.079251
longitude: 14.418292
coordinate_provenance:
source_type: ORIGINAL_ENTRY
source_path: original_entry.locations[0]
city: Praha 1
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 110 00
street_address: Černá 1610/14
normalization_timestamp: '2025-12-09T06:50:46.792111+00:00'
wikidata_enrichment:
wikidata_id: Q114619639
wikidata_url: https://www.wikidata.org/wiki/Q114619639
matched_by: sigla_identifier
matched_sigla: ABE463
wikidata_label: knihovna Katolického domova studujících v Praze
wikidata_description: knihovna v Praze
enrichment_date: '2025-12-19T10:12:58.406651+00:00'
enrichment_version: 2.1.0
instance_of:
- Q50379845
- Q7075
_provenance:
content_hash:
algorithm: sha256
value: sha256-qvancJjwidfjm8sgTqwf/kcf9K/ABK4FBmwC5ASuUOw=
scope: enrichment_section
computed_at: '2025-12-28T19:58:22.545896+00:00'
prov:
wasGeneratedBy:
'@type': prov:Activity
name: wikidata_api_fetch
used: https://www.wikidata.org/w/rest.php/wikibase/v1
wasDerivedFrom: https://www.wikidata.org/wiki/Q114619639
generatedAtTime: '2025-12-19T10:12:58.406651+00:00'
verification:
status: verified
last_verified: '2025-12-28T19:58:22.545903+00:00'
wikidata_entity_id: Q114619639

View file

@ -0,0 +1,231 @@
original_entry:
name: Královská kanonie premonstrátů na Strahově - Strahovská knihovna
institution_type: HOLY_SITES
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABE459
- &id002
identifier_scheme: IČO
identifier_value: 00415090
- &id003
identifier_scheme: Website
identifier_value: https://strahovskyklaster.tritius.cz/
identifier_url: https://strahovskyklaster.tritius.cz/
locations:
- street_address: Strahovské nádvoří 132
postal_code: 118 00
city: Praha 1
region: Hlavní město Praha
country: CZ
latitude: 50.086114
longitude: 14.388775
processing_timestamp: '2025-12-06T23:37:20.874363+00:00'
ghcid:
ghcid_current: CZ-10-PRA-H-KKPSSK
ghcid_original: CZ-HL-PRA-H-KKPSSK
ghcid_uuid: b2c77662-30d1-5657-b6ec-508b07c76189
ghcid_uuid_sha256: d4c0208c-8472-8a7a-19ec-cf9964c4be3f
ghcid_numeric: 15330288919458781818
record_id: 364d0cfc-9693-4138-9891-541407733834
generation_timestamp: '2025-12-06T23:37:20.874363+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-H-KKPSSK
valid_from: '2025-12-10T09:46:57Z'
valid_to:
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
- ghcid: CZ-HL-PRA-H-KKPSSK
valid_from:
valid_to: '2025-12-10T09:46:57Z'
reason: Previous GHCID with incorrect region code
- ghcid: CZ-HL-PRA-H-KKPSSK
ghcid_numeric: 15330288919458781818
valid_from: '2025-12-06T23:37:20.874363+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
custodian_name:
claim_type: custodian_name
claim_value: Královská kanonie premonstrátů na Strahově - Strahovská knihovna
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-H-KKPSSK
- identifier_scheme: GHCID_UUID
identifier_value: b2c77662-30d1-5657-b6ec-508b07c76189
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: d4c0208c-8472-8a7a-19ec-cf9964c4be3f
- identifier_scheme: GHCID_NUMERIC
identifier_value: '15330288919458781818'
- identifier_scheme: RECORD_ID
identifier_value: 364d0cfc-9693-4138-9891-541407733834
- *id001
- *id002
- *id003
- identifier_scheme: Wikidata
identifier_value: Q12056498
identifier_url: https://www.wikidata.org/wiki/Q12056498
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.992094+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:46Z
schema_version: 2.0.0
enrichment_provenance:
wikidata_enrichment:
content_hash: sha256-RhhGG/XuQP8MzMGh7TLIasW7BJlzmG09o/v3YuYczE4=
verified_at: '2025-12-28T19:58:22.563127+00:00'
wasDerivedFrom: https://www.wikidata.org/wiki/Q12056498
provenance_schema_version: '2.0'
standards_compliance:
- W3C PROV-O
- W3C SRI (content hashes)
generated_at: '2025-12-28T19:58:22.563143+00:00'
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.HOL
subtype_label: HOLY_SITES
ontology_class: schema:PlaceOfWorship
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.992094+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Královská kanonie premonstrátů na Strahově - Strahovská knihovna
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.992094+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: HOLY_SITES
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.992094+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 1
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.992094+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABE459
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.992094+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:20.874363+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
latitude: 50.086114
longitude: 14.388775
coordinate_provenance:
source_type: ORIGINAL_ENTRY
source_path: original_entry.locations[0]
city: Praha 1
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 118 00
street_address: Strahovské nádvoří 132
normalization_timestamp: '2025-12-09T06:50:46.810708+00:00'
wikidata_enrichment:
wikidata_id: Q12056498
wikidata_url: https://www.wikidata.org/wiki/Q12056498
matched_by: sigla_identifier
matched_sigla: ABE459
wikidata_label: Strahovská knihovna
wikidata_description: klášterní knihovna
enrichment_date: '2025-12-19T10:12:59.125749+00:00'
enrichment_version: 2.1.0
wikidata_coordinates:
longitude: 14.3892417
latitude: 50.0858778
image: https://commons.wikimedia.org/wiki/Special:FilePath/Strahov_knihovna_2.jpg
instance_of:
- Q1776381
located_in: Q1085
_provenance:
content_hash:
algorithm: sha256
value: sha256-RhhGG/XuQP8MzMGh7TLIasW7BJlzmG09o/v3YuYczE4=
scope: enrichment_section
computed_at: '2025-12-28T19:58:22.563127+00:00'
prov:
wasGeneratedBy:
'@type': prov:Activity
name: wikidata_api_fetch
used: https://www.wikidata.org/w/rest.php/wikibase/v1
wasDerivedFrom: https://www.wikidata.org/wiki/Q12056498
generatedAtTime: '2025-12-19T10:12:59.125749+00:00'
verification:
status: verified
last_verified: '2025-12-28T19:58:22.563134+00:00'
wikidata_entity_id: Q12056498
logo_enrichment:
enrichment_timestamp: '2025-12-23T21:56:00.222655+00:00'
source_url: https://strahovskyklaster.tritius.cz
extraction_method: crawl4ai
claims:
- claim_type: favicon_url
claim_value: https://strahovskyklaster.tritius.cz/apple-touch-icon-180x180.png
source_url: https://strahovskyklaster.tritius.cz
css_selector: '[document] > html > head > link:nth-of-type(14)'
retrieved_on: '2025-12-23T21:56:00.222655+00:00'
extraction_method: crawl4ai_link_rel
favicon_type: ''
favicon_sizes: 180x180
summary:
total_claims: 1
has_primary_logo: false
has_favicon: true
has_og_image: false
favicon_count: 14

View file

@ -0,0 +1,164 @@
original_entry:
name: Mezinárodní baptistický teologický seminář - Knihovna
institution_type: HOLY_SITES
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABE453
- &id002
identifier_scheme: IČO
identifier_value: '25741683'
- &id003
identifier_scheme: DIČ
identifier_value: CZ25741683
locations:
- street_address: Nad Habrovkou 3 - Jenerálka
postal_code: 164 00
city: Praha 6
region: Hlavní město Praha
country: CZ
latitude: 50.105856
longitude: 14.351694
processing_timestamp: '2025-12-06T23:37:15.760570+00:00'
ghcid:
ghcid_current: CZ-10-PRA-H-MBTSK
ghcid_original: CZ-HL-PRA-H-MBTSK
ghcid_uuid: 04b9e8ce-ced9-53f0-99b8-9c2e092da029
ghcid_uuid_sha256: 55dce9c3-069f-84b7-0b6a-0dd7326feead
ghcid_numeric: 6187077011939239095
record_id: cbc98ac0-743f-4787-8ce0-6e14e643b10b
generation_timestamp: '2025-12-06T23:37:15.760570+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-H-MBTSK
valid_from: "2025-12-10T09:46:57Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-H-MBTSK
valid_from: null
valid_to: "2025-12-10T09:46:57Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-H-MBTSK
ghcid_numeric: 6187077011939239095
valid_from: '2025-12-06T23:37:15.760570+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
custodian_name:
claim_type: custodian_name
claim_value: Mezinárodní baptistický teologický seminář - Knihovna
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-H-MBTSK
- identifier_scheme: GHCID_UUID
identifier_value: 04b9e8ce-ced9-53f0-99b8-9c2e092da029
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 55dce9c3-069f-84b7-0b6a-0dd7326feead
- identifier_scheme: GHCID_NUMERIC
identifier_value: '6187077011939239095'
- identifier_scheme: RECORD_ID
identifier_value: cbc98ac0-743f-4787-8ce0-6e14e643b10b
- *id001
- *id002
- *id003
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.913964+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:46Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.HOL
subtype_label: HOLY_SITES
ontology_class: schema:PlaceOfWorship
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.913964+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date: null
verified_by: null
entity_claims:
- claim_type: full_name
claim_value: Mezinárodní baptistický teologický seminář - Knihovna
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.913964+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: HOLY_SITES
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.913964+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 6
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.913964+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABE453
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.913964+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:15.760570+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
latitude: 50.105856
longitude: 14.351694
coordinate_provenance:
source_type: ORIGINAL_ENTRY
source_path: original_entry.locations[0]
city: Praha 6
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 164 00
street_address: Nad Habrovkou 3 - Jenerálka
normalization_timestamp: '2025-12-09T06:50:46.833765+00:00'

View file

@ -0,0 +1,280 @@
original_entry:
name: Provincie bratří františkánů - Knihovna
institution_type: HOLY_SITES
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABE450
- &id002
identifier_scheme: IČO
identifier_value: 00169757
- &id003
identifier_scheme: Website
identifier_value: https://www.ofm.cz/
identifier_url: https://www.ofm.cz/
- &id004
identifier_scheme: Wikidata
identifier_value: Q114617806
identifier_url: https://www.wikidata.org/wiki/Q114617806
locations:
- street_address: Františkánský klášter, Jungmannovo nám. 18
postal_code: 110 00
city: Praha 1
region: Hlavní město Praha
country: CZ
latitude: 50.082922
longitude: 14.422764
processing_timestamp: '2025-12-06T23:37:15.748590+00:00'
ghcid:
ghcid_current: CZ-10-PRA-H-PBFK
ghcid_original: CZ-HL-PRA-H-PBFK
ghcid_uuid: 9f43a1a6-41ba-51f2-b7a4-ac6b39b4bbf8
ghcid_uuid_sha256: 03bdfd08-7db8-8530-d93c-26b054e94fcc
ghcid_numeric: 269649765622883632
record_id: d87ab3a6-1853-4ee2-932b-db037763e890
generation_timestamp: '2025-12-06T23:37:15.748590+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-H-PBFK
valid_from: '2025-12-10T09:46:57Z'
valid_to:
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
- ghcid: CZ-HL-PRA-H-PBFK
valid_from:
valid_to: '2025-12-10T09:46:57Z'
reason: Previous GHCID with incorrect region code
- ghcid: CZ-HL-PRA-H-PBFK
ghcid_numeric: 269649765622883632
valid_from: '2025-12-06T23:37:15.748590+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
custodian_name:
claim_type: custodian_name
claim_value: Provincie bratří františkánů - Knihovna
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-H-PBFK
- identifier_scheme: GHCID_UUID
identifier_value: 9f43a1a6-41ba-51f2-b7a4-ac6b39b4bbf8
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 03bdfd08-7db8-8530-d93c-26b054e94fcc
- identifier_scheme: GHCID_NUMERIC
identifier_value: '269649765622883632'
- identifier_scheme: RECORD_ID
identifier_value: d87ab3a6-1853-4ee2-932b-db037763e890
- *id001
- *id002
- *id003
- *id004
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.913803+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:46Z
schema_version: 2.0.0
enrichment_provenance:
wikidata_enrichment:
content_hash: sha256-U1Y4vNGXYIhTAtrF4kYrXTpmMiLky7takLaS+zkOj1k=
verified_at: '2025-12-28T19:58:22.593319+00:00'
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617806
provenance_schema_version: '2.0'
standards_compliance:
- W3C PROV-O
- W3C SRI (content hashes)
generated_at: '2025-12-28T19:58:22.593344+00:00'
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.HOL
subtype_label: HOLY_SITES
ontology_class: schema:PlaceOfWorship
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.913803+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Provincie bratří františkánů - Knihovna
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.913803+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: HOLY_SITES
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.913803+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 1
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.913803+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: wikidata_id
claim_value: Q114617806
property_uri: owl:sameAs
provenance:
namespace: wikidata
path: /entity/Q114617806
timestamp: '2025-11-19T10:14:29.913803+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.98
- claim_type: ghcid
claim_value: CZ-ABE450
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.913803+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:15.748590+00:00'
creation_method: create_custodian_from_ch_annotator.py
wikidata_enrichment:
wikidata_entity_id: Q114617806
api_metadata:
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
fetch_timestamp: '2025-12-18T22:26:04.369044+00:00'
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
enrichment_version: 2.1_generic
properties_found:
- P31
- P17
- P159
- P968
- P18
wikidata_labels:
cs: Provincie bratří františkánů, knihovna
en: Ordo Fratrum Minorum
wikidata_label_en: Ordo Fratrum Minorum
wikidata_label_cs: Provincie bratří františkánů, knihovna
wikidata_descriptions:
cs: knihovna v Praze
wikidata_aliases:
cs:
- OFM
wikidata_sitelinks:
commonswiki: Category:Franciscan library (Monastery of Our Lady of the Snows)
cswiki: Františkánská knihovna kláštera Panny Marie Sněžné v Praze
wikidata_classification:
instance_of: &id005
- id: Q50379845
label: religious library
description: type of library with collections that focus on religion and related subjects
wikidata_instance_of: *id005
wikidata_location:
country: &id006
id: Q213
label: Czech Republic
description: country in Central Europe
headquarters_location:
id: Q973974
label: Prague 1
description: administrative district, municipal district and municipal part of Prague
wikidata_country: *id006
wikidata_media:
image: Frantiskanska Knihovna PMS B.jpg
wikidata_image: Frantiskanska Knihovna PMS B.jpg
wikidata_contact:
email: mailto:praha@ofm.cz
_provenance:
content_hash:
algorithm: sha256
value: sha256-U1Y4vNGXYIhTAtrF4kYrXTpmMiLky7takLaS+zkOj1k=
scope: enrichment_section
computed_at: '2025-12-28T19:58:22.593319+00:00'
prov:
wasGeneratedBy:
'@type': prov:Activity
name: wikidata_api_fetch
used: https://www.wikidata.org/w/rest.php/wikibase/v1
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617806
verification:
status: verified
last_verified: '2025-12-28T19:58:22.593334+00:00'
location:
latitude: 50.082922
longitude: 14.422764
coordinate_provenance:
source_type: ORIGINAL_ENTRY
source_path: original_entry.locations[0]
city: Praha 1
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 110 00
street_address: Františkánský klášter, Jungmannovo nám. 18
normalization_timestamp: '2025-12-09T06:50:46.856803+00:00'
logo_enrichment:
enrichment_timestamp: '2025-12-23T21:56:07.748377+00:00'
source_url: https://www.ofm.cz
extraction_method: crawl4ai
claims:
- claim_type: logo_url
claim_value: https://www.ofm.cz/wp-content/uploads/2020/03/Logo-mobil-s-tau.png
source_url: https://www.ofm.cz
css_selector: '#mobhead > div.bg-area.stickyable > div.logo-main-wrap.logo-mob-wrap > div.logo.logo-mobile > a > span.logo-img > img'
retrieved_on: '2025-12-23T21:56:07.748377+00:00'
extraction_method: crawl4ai_header_logo
detection_confidence: high
alt_text: ''
- claim_type: favicon_url
claim_value: https://www.ofm.cz/wp-content/uploads/2020/03/cropped-tau-180x180.png
source_url: https://www.ofm.cz
css_selector: '[document] > html > head > link:nth-of-type(21)'
retrieved_on: '2025-12-23T21:56:07.748377+00:00'
extraction_method: crawl4ai_link_rel
favicon_type: ''
favicon_sizes: ''
summary:
total_claims: 2
has_primary_logo: true
has_favicon: true
has_og_image: false
favicon_count: 3

View file

@ -0,0 +1,238 @@
original_entry:
name: Židovská obec v Praze - Knihovna
institution_type: HOLY_SITES
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABE461
- &id002
identifier_scheme: IČO
identifier_value: 00445258
- &id003
identifier_scheme: Website
identifier_value: https://www.kehilaprag.cz/cs
identifier_url: https://www.kehilaprag.cz/cs
locations:
- street_address: Jáchymova 3
postal_code: 110 01
city: Praha 1
region: Hlavní město Praha
country: CZ
latitude: 50.088591
longitude: 14.419118
- street_address: Maiselova 18
postal_code: 110 01
city: Praha 1
region: Hlavní město Praha
country: CZ
latitude: 50.089729
longitude: 14.418438
processing_timestamp: '2025-12-06T23:37:27.311926+00:00'
ghcid:
ghcid_current: CZ-10-PRA-H-ZOVPK
ghcid_original: CZ-HL-PRA-H-ŽOVPK
ghcid_uuid: df93d711-09ac-5a4e-8261-47033c74bd57
ghcid_uuid_sha256: ddbab9d1-455e-8454-8d31-c524072da3f0
ghcid_numeric: 15977286936513377364
record_id: f61482d0-de24-410c-a3d4-b29046ced869
generation_timestamp: '2025-12-06T23:37:27.311926+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-H-ZOVPK
valid_from: '2025-12-10T09:46:57Z'
valid_to:
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
- ghcid: CZ-HL-PRA-H-ZOVPK
valid_from:
valid_to: '2025-12-10T09:46:57Z'
reason: Previous GHCID with incorrect region code
- ghcid: CZ-HL-PRA-H-ZOVPK
ghcid_numeric: 15977286936513377364
valid_from: '2025-12-08T11:21:32.038215+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-H-ŽOVPK)'
- ghcid: CZ-HL-PRA-H-ŽOVPK
ghcid_numeric: 13994694256066163100
valid_from: '2025-12-06T23:37:27.311926+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:32.038215+00:00'
custodian_name:
claim_type: custodian_name
claim_value: Židovská obec v Praze - Knihovna
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-H-ZOVPK
- identifier_scheme: GHCID_UUID
identifier_value: df93d711-09ac-5a4e-8261-47033c74bd57
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: ddbab9d1-455e-8454-8d31-c524072da3f0
- identifier_scheme: GHCID_NUMERIC
identifier_value: '15977286936513377364'
- identifier_scheme: RECORD_ID
identifier_value: f61482d0-de24-410c-a3d4-b29046ced869
- *id001
- *id002
- *id003
- identifier_scheme: Wikidata
identifier_value: Q114619623
identifier_url: https://www.wikidata.org/wiki/Q114619623
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:30.062272+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:46Z
schema_version: 2.0.0
enrichment_provenance:
wikidata_enrichment:
content_hash: sha256-BwfWLwIgLXgtI0ZrvBCNnlwPntx8jJnJqHZl3vOwLKs=
verified_at: '2025-12-28T19:58:22.612588+00:00'
wasDerivedFrom: https://www.wikidata.org/wiki/Q114619623
provenance_schema_version: '2.0'
standards_compliance:
- W3C PROV-O
- W3C SRI (content hashes)
generated_at: '2025-12-28T19:58:22.612603+00:00'
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.HOL
subtype_label: HOLY_SITES
ontology_class: schema:PlaceOfWorship
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.062272+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Židovská obec v Praze - Knihovna
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.062272+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: HOLY_SITES
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.062272+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 1
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.062272+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABE461
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.062272+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:27.311926+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
latitude: 50.088591
longitude: 14.419118
coordinate_provenance:
source_type: ORIGINAL_ENTRY
source_path: original_entry.locations[0]
city: Praha 1
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 110 01
street_address: Jáchymova 3
normalization_timestamp: '2025-12-09T06:50:46.879233+00:00'
wikidata_enrichment:
wikidata_id: Q114619623
wikidata_url: https://www.wikidata.org/wiki/Q114619623
matched_by: sigla_identifier
matched_sigla: ABE461
wikidata_label: Knihovna Židovské obce v Praze
wikidata_description: knihovna v Praze
enrichment_date: '2025-12-19T10:12:59.880497+00:00'
enrichment_version: 2.1.0
instance_of:
- Q50379845
_provenance:
content_hash:
algorithm: sha256
value: sha256-BwfWLwIgLXgtI0ZrvBCNnlwPntx8jJnJqHZl3vOwLKs=
scope: enrichment_section
computed_at: '2025-12-28T19:58:22.612588+00:00'
prov:
wasGeneratedBy:
'@type': prov:Activity
name: wikidata_api_fetch
used: https://www.wikidata.org/w/rest.php/wikibase/v1
wasDerivedFrom: https://www.wikidata.org/wiki/Q114619623
generatedAtTime: '2025-12-19T10:12:59.880497+00:00'
verification:
status: verified
last_verified: '2025-12-28T19:58:22.612594+00:00'
wikidata_entity_id: Q114619623
logo_enrichment:
enrichment_timestamp: '2025-12-23T21:56:15.191049+00:00'
source_url: https://www.kehilaprag.cz/cs
extraction_method: crawl4ai
claims:
- claim_type: favicon_url
claim_value: https://www.kehilaprag.cz/img/favicon.ico
source_url: https://www.kehilaprag.cz/cs
css_selector: '[document] > html.js.flexbox > head > link:nth-of-type(5)'
retrieved_on: '2025-12-23T21:56:15.191049+00:00'
extraction_method: crawl4ai_link_rel
favicon_type: image/vnd.microsoft.icon
favicon_sizes: ''
summary:
total_claims: 1
has_primary_logo: false
has_favicon: true
has_og_image: false
favicon_count: 1

View file

@ -0,0 +1,167 @@
original_entry:
name: ARITMA
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABF002
locations:
- street_address: Lužná 591
postal_code: 160 05
city: Praha 6
country: CZ
processing_timestamp: '2025-12-06T23:37:23.806189+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-A-aritma
ghcid_original: CZ-XX-PRA-L-A-aritma
ghcid_uuid: 76eb7195-fef9-517a-b3c3-a9c661376eba
ghcid_uuid_sha256: bcf70c56-46dc-8470-481a-41cfc2a8a9d6
ghcid_numeric: 13616365563073590384
record_id: 118fa2dd-4ba6-484e-ad04-d3b1429e6973
generation_timestamp: '2025-12-06T23:37:23.806189+00:00'
location_resolution:
country_code: CZ
region_code: '10'
city_code: PRA
method: GEONAMES_CITY_LOOKUP
region_name: CZ-10
geonames_id: 8629195
resolution_timestamp: '2025-12-07T11:42:56.091145+00:00'
ghcid_history:
- ghcid: CZ-XX-PRA-L-A-aritma
ghcid_numeric: 13616365563073590384
valid_from: '2025-12-06T23:37:23.806189+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
- ghcid: CZ-10-PRA-L-A-aritma
valid_from: '2025-12-07T11:42:56.091156+00:00'
reason: 'Region resolved via GeoNames city lookup: XX->10 (Praha Klanovice)'
custodian_name:
claim_type: custodian_name
claim_value: ARITMA
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-XX-PRA-L-A-aritma
- identifier_scheme: GHCID_UUID
identifier_value: 76eb7195-fef9-517a-b3c3-a9c661376eba
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: bcf70c56-46dc-8470-481a-41cfc2a8a9d6
- identifier_scheme: GHCID_NUMERIC
identifier_value: '13616365563073590384'
- identifier_scheme: RECORD_ID
identifier_value: 118fa2dd-4ba6-484e-ad04-d3b1429e6973
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:30.031409+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- 'YouTube/Google Maps enrichment 2025-12-08T19:48:54Z: Maps: rejected by LLM; YouTube: not found'
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:19Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:48:05Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:49:29Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.031409+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: ARITMA
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.031409+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.031409+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 6
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.031409+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABF002
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.031409+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:23.806189+00:00'
creation_method: create_custodian_from_ch_annotator.py
google_maps_status: NO_MATCH
google_maps_rejected:
candidate_name: Aritma AS
rejection_reason: The candidate is rejected due to a location mismatch. The source institution is in the Czech Republic
(CZ), while the Google Maps candidate is in Norway.
timestamp: '2025-12-08T19:48:54.377041+00:00'
youtube_status: NOT_FOUND
youtube_search_query: ARITMA official
youtube_search_timestamp: '2025-12-08T19:48:54.716445+00:00'
location:
city: Prague
region: CZ-10
region_code: '10'
country: CZ
postal_code: 160 05
street_address: Lužná 591
geonames_id: 3067696
normalization_timestamp: '2025-12-09T20:40:58.384106+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:40:58.384087+00:00'

View file

@ -0,0 +1,380 @@
custodian_name:
claim_type: custodian_name
claim_value: ARTIA
source_type: ch_annotator
original_entry:
name: ARTIA
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABE391
locations:
- street_address: Ve Smečkách 30
postal_code: 111 27
city: Praha 1
country: CZ
processing_timestamp: '2025-12-06T23:37:23.793671+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-A
ghcid_original: CZ-XX-PRA-L-A
ghcid_uuid: 29a89c74-39be-5a59-a59b-478c406d7f2e
ghcid_uuid_sha256: 50203024-eb0a-8b06-f19b-349f090f646f
ghcid_numeric: 5773667657409272582
record_id: e7c66a51-321a-4d0c-a201-37c2ad3e52a3
generation_timestamp: '2025-12-06T23:37:23.793671+00:00'
location_resolution:
country_code: CZ
region_code: '10'
city_code: PRA
method: GEONAMES_CITY_LOOKUP
region_name: CZ-10
geonames_id: 8629195
resolution_timestamp: '2025-12-07T11:42:54.550249+00:00'
ghcid_history:
- ghcid: CZ-XX-PRA-L-A
ghcid_numeric: 5773667657409272582
valid_from: '2025-12-06T23:37:23.793671+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
- ghcid: CZ-10-PRA-L-A
valid_from: '2025-12-07T11:42:54.550261+00:00'
reason: 'Region resolved via GeoNames city lookup: XX->10 (Praha Klanovice)'
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-XX-PRA-L-A
- identifier_scheme: GHCID_UUID
identifier_value: 29a89c74-39be-5a59-a59b-478c406d7f2e
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 50203024-eb0a-8b06-f19b-349f090f646f
- identifier_scheme: GHCID_NUMERIC
identifier_value: '5773667657409272582'
- identifier_scheme: RECORD_ID
identifier_value: e7c66a51-321a-4d0c-a201-37c2ad3e52a3
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:30.031357+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- 'YouTube/Google Maps enrichment 2025-12-08T19:48:58Z: Maps: Aukční galerie Artia (conf: 0.80); YouTube: not found'
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:19Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:48:05Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:49:29Z
schema_version: 2.0.0
enrichment_provenance:
google_maps_enrichment:
content_hash: sha256-XhqEoqSXvzYXCoUgohSnvcuCdFAeXpKseKvmWLEEJhI=
verified_at: '2025-12-28T19:58:22.648107+00:00'
provenance_schema_version: '2.0'
standards_compliance:
- W3C PROV-O
- W3C SRI (content hashes)
generated_at: '2025-12-28T19:58:22.648163+00:00'
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.031357+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: ARTIA
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.031357+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.031357+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 1
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.031357+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABE391
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.031357+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:23.793671+00:00'
creation_method: create_custodian_from_ch_annotator.py
google_maps_enrichment:
place_id: ChIJPakg9oSUC0cRo5rduYqOuGw
name: Aukční galerie Artia
fetch_timestamp: '2025-12-08T19:48:55.686290+00:00'
api_status: OK
coordinates:
latitude: 50.0832839
longitude: 14.4320287
formatted_address: Opletalova 1417/25, Nové Město, 110 00 Praha-Praha 1, Czechia
short_address: Opletalova 1417/25, Nové Město
phone_local: 222 220 298
phone_international: +420 222 220 298
website: http://www.aukcni-galerie.cz/
google_place_types:
- art_gallery
- finance
- home_goods_store
- point_of_interest
- store
- establishment
business_status: OPERATIONAL
rating: 2.9
total_ratings: 34
reviews:
- author_name: Paul Thomas
author_uri: https://www.google.com/maps/contrib/100778712720214769548/reviews
rating: 5
relative_time_description: 8 years ago
text: I was very satisfied with their service. I can recommend them.
publish_time: '2017-11-21T16:47:48.188Z'
- author_name: Marek Marek
author_uri: https://www.google.com/maps/contrib/109431573954514849120/reviews
rating: 1
relative_time_description: a year ago
text: 'I add my experience with the Olomouc branch:
First of all, I would like to say that I approached the gallery with confidence because I inherited a few paintings.
As an art connoisseur and a busy person, I asked the gallery for an evaluation, pricing and subsequent sale of the
paintings. And I would like to share the negative experience I had, which may help someone when choosing or not choosing
an auction gallery.
On September 7, 2023, I contacted the auction house via WA with a request to see if they would take the paintings
to the auction (8 pieces, photos of all paintings sent to WA), they replied that they would accept all the paintings.
On September 14, 2023, after a telephone appointment, I brought the paintings to the Olomouc branch (my fundamental
mistake was that I did not wait for the list of paintings), I was assured by the branch employee that the list (contract)
would be sent to my email, I agreed and left the branch.
On September 18, 2023, I received an email with a contract where two items were missing from the list and a whirlwind
of inconveniences began.
One painting (religious direction - Madonna) and also "view of the castle".
After a phone call, I was assured by an employee that everything would be fixed and that he would find the painting
and amend the contract...
During October 2023, there were a few phone calls when I was told that the painting was "lost", which the branch employee
admitted that he did not understand how this could have happened and how I would imagine a replacement (I, as a complete
layman in art, and with two photos of the paintings, am supposed to determine the amount of compensation).
And that for the Madonna painting I have to get a document approving the sale - that it was not stolen from the church.
At the beginning of December, I visited the branch in person and the employee still did not know how to explain /
resolve the situation.
That he would discuss the situation with his superior who was sick that week and that he would call back. I would
assume that a gallery that deals with art (I assume even higher values) will be equipped with at least a security
camera. When I asked an employee to check the camera footage, I was told that they don''t have cameras.... In some
discussions, the tone was that I didn''t bring the painting at all and that I was trying to damage the gallery/employee...
I must also mention that I regularly had to call the Olomouc branch on my own initiative and call for a resolution
to the situation.
After a few days, an employee of the Olomouc branch called me for the first time and said that the auction gallery
was offering compensation in the amount of 1000 CZK and if I didn''t agree, I should resolve the situation with the
management in Prague - politely said that they no longer care and let me resolve it with Prague myself.
On 10.1.2024, I picked up the paintings that fortunately were not lost and the compensation.
And let the lost painting itself, or the amount from the sale of the lost painting, make the person in question happy...'
publish_time: '2024-01-10T19:24:14.553436Z'
- author_name: Marcela R
author_uri: https://www.google.com/maps/contrib/107348664986788813882/reviews
rating: 1
relative_time_description: a month ago
text: This gallery clearly does not communicate with clients, I do not recommend anyone to leave their paintings with
them, according to reviews they either lose them or withhold payments.
publish_time: '2025-10-25T17:35:35.975313577Z'
- author_name: Jana Adámková
author_uri: https://www.google.com/maps/contrib/101648565896947406982/reviews
rating: 1
relative_time_description: a year ago
text: From the seller's perspective, my experience with the gallery is absolutely tragic. Mr. Stehlík has been completely
ignoring me for 3/4 years, he doesn't respond, he doesn't pay out money, he has a catastrophic mess in his sold and
unsold paintings. He has had my paintings with him for almost two years and he is unable to pay me for the auctioned
items and return the correct number of unsold ones. His actions are leading to a criminal complaint, which is the
only thing that will help. I DO NOT RECOMMEND this gallery at all!
publish_time: '2024-12-04T11:01:12.318077Z'
- author_name: Lubomír Frejka
author_uri: https://www.google.com/maps/contrib/106970323732632081169/reviews
rating: 1
relative_time_description: a year ago
text: 'It has now been about 2 months since I received the painting that I auctioned and paid for from the ARTIA GALLERY
IN PRAGUE. I was told by one of the employees that he was sorry that they could not find the painting. I was asked
if I wanted a refund or if I wanted to wait until they found it..(???). I said that I would wait until they found
it.
A similar situation happened to me here at the Artia Gallery in Prague for the second time, once before this incident,
they could not find the painting that I auctioned and paid for.
FROM THIS IT CAN BE LOGICALLY ASSUMED THAT THE AUCTIONED ITEMS ARE NOT PAID FOR WHEN THE AUCTIONEER GOES TO PICK UP
THE AUCTIONED ITEM, BUT THE ARTIA GALLERY EMPLOYEES CANNOT FIND THE ITEM(S) AND PROPERLY HAND IT OVER.
!!! AS A CONDITION FOR PAYMENT OF THE AUCTIONED ITEM, THE HEAD OF THE ARTIA GALLERY STATES THAT THE AUCTIONED ITEM
MUST BE PAID BY THE AUCTIONED ITEM AND TAKES THE ITEM BACK!!! ONLY AFTER THE GALLERY CAN SEND THE MONEY TO THE CLIENT.
So far, I have not been paid by the Artia gallery for a large amount of auctioned items that I put up for auction
there. Otherwise, the payment usually took a loooooong time.
My items appeared in auctions only occasionally - very rarely, I often warned them about it and most of the time they
did not appear in auctions again and again.
Yesterday I went to pick up the rest of the un-auctioned items, because I am ending the "relationship" with them,
I did not get everything back. And moreover, I wrote them a question or request in advance for "payment" of my auctioned
items. The boss didn''t comment on this, I asked him personally and when I was leaving, the boss told me that he didn''t
know, that he would look into it and find out. AND I HAD HIM IN MY BACK AS HE SHOWED ME OUT SO THAT I WOULDN''T BE
DELAYED ANYMORE. HE EVEN SENT HIS SUBORDINATE TO HELP ME - HE REALLY HELPED ME LOAD MY THINGS INTO THE CAR AND I THANK
THAT GUY FOR THAT.
!!! USING THE INTERMEDIATION OF THE SALE OF ITEMS THROUGH THE ARTIA GALLERY IN PRAGUE - NEVER AGAINYYYYYY !!!
Now I will continue to wait to see whether the boss of the ARTIA GALLERY IN PRAGUE will send the money or not, so
hopefully I will live to see it.'
publish_time: '2024-08-15T00:20:34.897538Z'
opening_hours:
open_now: false
weekday_text:
- 'Monday: Closed'
- 'Tuesday: 10:00AM5:00PM'
- 'Wednesday: 10:00AM5:00PM'
- 'Thursday: 10:00AM5:00PM'
- 'Friday: Closed'
- 'Saturday: Closed'
- 'Sunday: Closed'
photo_count: 10
photos_metadata:
- name: places/ChIJPakg9oSUC0cRo5rduYqOuGw/photos/AZLasHrSgcHRYMycbAUb1ldutEmvNEi3aIlJBTSB4JnR4Z2-SCxIRpHAUoef8r8iarhFedqL_gT6xyVszDlY4A2c6UdVrapiQ3ROwYmuygjuLwuslcgnrV58WrxTgCTIkRtRg6jsbJcwbjromcqQJAv6UJF-E0vQLqRTE_bOhikQPYKJH4EDyKjpr4uy4n2a89c-tSPt8yCLGE3IQ2I1-pVyDP9DYuZm-Wlq14ccrGeF2ib8u2EPieiyldf0bRd52_jF_0Ok49XEaYjfuache0B1copgeWTyIroWs2-IaCJ4mCBJ2w
height: 1800
width: 1200
- name: places/ChIJPakg9oSUC0cRo5rduYqOuGw/photos/AZLasHowbxeoJBVwxkiyEfS0S8M95rlkyrNCNl2lzxk7I3FVAhPmK_1YPTu8yBKV_OdZdPPX414GCFSud2BuNZOKLMTepZvolbEoK8uoV6rWj657u0q1tl4xPpc64Lfg8JOg0qD1akDRiRn6stdePKQ8YFIaMrpcJjQnAnPShnw0EyuxoccttiQW0FYNe3LdlbuH2x9RUKt946vC8sSRl1QGCf2un0wCR1XZr7RwXCLbhnP7GTNU6nbkl-52mSHrxKyqfLuOdCAsG0SuDFf9G3P41A49_30eXNvGrgO9EhzNbeUDJw
height: 1800
width: 1200
- name: places/ChIJPakg9oSUC0cRo5rduYqOuGw/photos/AZLasHpnLoHJoQfI4tJHD6obZh8oXfGIYA9etUTQZKKfirSs8LyDYVT7UTnSEFhjAldTO5SrSMBWLaPNlKpbdyrqWi9yJ9qIKdOxVRKgrdTfSMqZzjkDy7x2JYbLX7DmHb8ZQXqME4S7IAqsIpqSyP2NRdOo4USnp05aWQn75XnODy_O_G1Bz9I6gPcBnDS985xyDoe04JpvphRiIjgl9VzXjTF3L0SH0-LPQLU8kz_cTCgb-hHCu0Wg7BrWyeXQQxmP8YXGqt4Wb_wyOxuxO9hwSRpr0_xCQxF3otLoFN_9tAvkjg
height: 2081
width: 2551
- name: places/ChIJPakg9oSUC0cRo5rduYqOuGw/photos/AZLasHrA6dRqiCr6RAvngdJAuHwduYql3XuiRnEy8D1hvhdwRD61dDQIWpC1ocMsR2W9p9Io403G7gj88QGefMjAyc2qMPHl_h2mi208T1F7Q871nMcXVQZJlntEmH1pHdfSj2Rk7PFp0TKVJfP7SDjTZ4zQKDKKa0hzbpjD_PA6r3w6aeiLd_m55aTjPIK-WtPYaJlx5sl8sScDQYs4N8s4yUULmDJDyu7Xd-4S_t8eVSLJS_9ybMvfqasKP9GZPez3SNxFSOmab5ODdo3D02ry9IrIxkGtlE3ovlquOJBOh4mPyg
height: 800
width: 1200
- name: places/ChIJPakg9oSUC0cRo5rduYqOuGw/photos/AZLasHqq9korXxypDF6stf9Jm5YdSsOs3WGvQKHGN3GiNK8zaWoy5qi3bVDRFjwdLMtHvM3ft5aCTaGfREeGY0-et5siga7NHjnPUFfmvh-lseYEzIy683ejQt8VgYX22GTzLauRp-i_X3TmvMLfHX9lMeYdMjU_k2yDWL_Q3vv64vyO9fdPSP4gHUof0rM2WNMffyW87NpYWJ0cCnAe5lVEaaIWsfzbnRBDwAO2BXRyCLE1B-3vX03rMfspp5-Tj5oFcqHZFWTti7gW_3svOtzBqDeBM-GMsveoIiyLe2nm3zKLsg
height: 2233
width: 1680
google_maps_url: https://maps.google.com/?cid=7834168278285916835&g_mp=Cidnb29nbGUubWFwcy5wbGFjZXMudjEuUGxhY2VzLlNlYXJjaFRleHQQAhgEIAA
llm_verification:
is_match: true
confidence: 0.8
entity_type: GRP.HER
reasoning: 'NAME MATCH: The candidate name ''Aukční galerie Artia'' is a direct and specific match for the source institution
''ARTIA''. The added words ''Aukční galerie'' (Auction Gallery) specify the institution''s function. LOCATION MATCH:
Both the source (CZ) and candidate (Czechia) are in the same country. The source has no city specified, but a match
is possible. TYPE MATCH: The candidate has the Google Place type ''art_gallery'', which is an expected type for a GRP.HER
institution. ENTITY TYPE: The candidate is an auction gallery, which falls under the definition of an art gallery, a
type of heritage custodian.'
agent: glm-4.6
verified: true
ch_annotator_version: ch_annotator-v1_7_0
_provenance:
content_hash:
algorithm: sha256
value: sha256-XhqEoqSXvzYXCoUgohSnvcuCdFAeXpKseKvmWLEEJhI=
scope: enrichment_section
computed_at: '2025-12-28T19:58:22.648107+00:00'
prov:
wasDerivedFrom: https://maps.googleapis.com/maps/api/place/details/json?place_id=ChIJPakg9oSUC0cRo5rduYqOuGw
generatedAtTime: '2025-12-08T19:48:55.686290+00:00'
wasGeneratedBy:
'@type': prov:Activity
name: google_maps_api_fetch
used: https://maps.googleapis.com/maps/api/place
verification:
status: verified
last_verified: '2025-12-28T19:58:22.648119+00:00'
google_maps_status: SUCCESS
youtube_status: NOT_FOUND
youtube_search_query: ARTIA official
youtube_search_timestamp: '2025-12-08T19:48:58.971457+00:00'
location:
latitude: 50.0832839
longitude: 14.4320287
coordinate_provenance:
source_type: GOOGLE_MAPS
source_path: google_maps_enrichment.coordinates
original_timestamp: '2025-12-08T19:48:55.686290+00:00'
entity_id: ChIJPakg9oSUC0cRo5rduYqOuGw
city: Praha 1
region: CZ-10
region_code: '10'
country: CZ
postal_code: 111 27
street_address: Opletalova 1417/25, Nové Město
formatted_address: Opletalova 1417/25, Nové Město, 110 00 Praha-Praha 1, Czechia
geonames_id: 8629195
normalization_timestamp: '2025-12-09T06:49:29.190208+00:00'
logo_enrichment:
enrichment_timestamp: '2025-12-23T21:56:24.466143+00:00'
source_url: http://www.aukcni-galerie.cz
extraction_method: crawl4ai
claims:
- claim_type: favicon_url
claim_value: http://www.aukcni-galerie.cz/www/images/favicons/safari-pinned-tab.svg
source_url: http://www.aukcni-galerie.cz
css_selector: '[document] > html > head > link:nth-of-type(7)'
retrieved_on: '2025-12-23T21:56:24.466143+00:00'
extraction_method: crawl4ai_link_rel
favicon_type: ''
favicon_sizes: ''
summary:
total_claims: 1
has_primary_logo: false
has_favicon: true
has_og_image: false
favicon_count: 5

View file

@ -0,0 +1,279 @@
original_entry:
name: Anglo-americká vysoká škola - Knihovna
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABD185
- &id002
identifier_scheme: IČO
identifier_value: '25940082'
- &id003
identifier_scheme: Website
identifier_value: https://www.aauni.edu/student-life/student-services-support/library/
identifier_url: https://www.aauni.edu/student-life/student-services-support/library/
- &id004
identifier_scheme: Wikidata
identifier_value: Q114625129
identifier_url: https://www.wikidata.org/wiki/Q114625129
locations:
- street_address: Letenská 1
postal_code: 118 00
city: Praha 1 - Malá Strana
region: Hlavní město Praha
country: CZ
latitude: 50.089958
longitude: 14.409755
processing_timestamp: '2025-12-06T23:37:43.296691+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-AAVSK
ghcid_original: CZ-HL-PRA-L-AAVŠK
ghcid_uuid: fa01e47a-1395-5d3b-b3f5-fd7a8a7e0203
ghcid_uuid_sha256: 0e5fbd97-5d0e-8894-a497-4881404388a0
ghcid_numeric: 1035754897117431956
record_id: fa23751d-4037-462c-8093-fe2a739fdcb5
generation_timestamp: '2025-12-06T23:37:43.296691+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-AAVSK
valid_from: '2025-12-10T09:46:57Z'
valid_to:
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
- ghcid: CZ-HL-PRA-L-AAVSK
valid_from:
valid_to: '2025-12-10T09:46:57Z'
reason: Previous GHCID with incorrect region code
- ghcid: CZ-HL-PRA-L-AAVSK
ghcid_numeric: 1035754897117431956
valid_from: '2025-12-08T11:21:31.224194+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-AAVŠK)'
- ghcid: CZ-HL-PRA-L-AAVŠK
ghcid_numeric: 3715561979721111793
valid_from: '2025-12-06T23:37:43.296691+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:31.224194+00:00'
custodian_name:
claim_type: custodian_name
claim_value: Anglo-americká vysoká škola - Knihovna
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-AAVSK
- identifier_scheme: GHCID_UUID
identifier_value: fa01e47a-1395-5d3b-b3f5-fd7a8a7e0203
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 0e5fbd97-5d0e-8894-a497-4881404388a0
- identifier_scheme: GHCID_NUMERIC
identifier_value: '1035754897117431956'
- identifier_scheme: RECORD_ID
identifier_value: fa23751d-4037-462c-8093-fe2a739fdcb5
- *id001
- *id002
- *id003
- *id004
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:30.303976+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:46Z
schema_version: 2.0.0
enrichment_provenance:
wikidata_enrichment:
content_hash: sha256-u9mzMLp1Si5DnHEknWbZcHYBr6yUDdVUyLMZKUJcdi4=
verified_at: '2025-12-28T19:58:22.674183+00:00'
wasDerivedFrom: https://www.wikidata.org/wiki/Q114625129
provenance_schema_version: '2.0'
standards_compliance:
- W3C PROV-O
- W3C SRI (content hashes)
generated_at: '2025-12-28T19:58:22.674202+00:00'
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.303976+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Anglo-americká vysoká škola - Knihovna
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.303976+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.303976+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 1 - Malá Strana
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.303976+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: wikidata_id
claim_value: Q114625129
property_uri: owl:sameAs
provenance:
namespace: wikidata
path: /entity/Q114625129
timestamp: '2025-11-19T10:14:30.303976+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.98
- claim_type: ghcid
claim_value: CZ-ABD185
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.303976+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:43.296691+00:00'
creation_method: create_custodian_from_ch_annotator.py
wikidata_enrichment:
wikidata_entity_id: Q114625129
api_metadata:
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
fetch_timestamp: '2025-12-18T22:26:12.304688+00:00'
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
enrichment_version: 2.1_generic
properties_found:
- P31
- P17
- P159
- P968
- P463
wikidata_labels:
cs: Anglo-americká vysoká škola, knihovna
wikidata_label_cs: Anglo-americká vysoká škola, knihovna
wikidata_descriptions:
cs: knihovna v Praze
wikidata_classification:
instance_of: &id005
- id: Q1622062
label: university library
description: general library at an institution of higher education
wikidata_instance_of: *id005
wikidata_location:
country: &id006
id: Q213
label: Czech Republic
description: country in Central Europe
headquarters_location:
id: Q753289
label: Malá Strana
description: neighborhood of Prague
wikidata_country: *id006
wikidata_organization:
member_of:
- id: Q4809890
label: Association of Library and Information Professionals of the Czech Republic
description: organization
wikidata_contact:
email: mailto:library@aauni.edu
_provenance:
content_hash:
algorithm: sha256
value: sha256-u9mzMLp1Si5DnHEknWbZcHYBr6yUDdVUyLMZKUJcdi4=
scope: enrichment_section
computed_at: '2025-12-28T19:58:22.674183+00:00'
prov:
wasGeneratedBy:
'@type': prov:Activity
name: wikidata_api_fetch
used: https://www.wikidata.org/w/rest.php/wikibase/v1
wasDerivedFrom: https://www.wikidata.org/wiki/Q114625129
verification:
status: verified
last_verified: '2025-12-28T19:58:22.674193+00:00'
location:
latitude: 50.089958
longitude: 14.409755
coordinate_provenance:
source_type: ORIGINAL_ENTRY
source_path: original_entry.locations[0]
city: Praha 1 - Malá Strana
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 118 00
street_address: Letenská 1
normalization_timestamp: '2025-12-09T06:50:46.899191+00:00'
logo_enrichment:
enrichment_timestamp: '2025-12-23T21:56:29.425209+00:00'
source_url: https://www.aauni.edu/student-life/student-services-support/library
extraction_method: crawl4ai
claims:
- claim_type: logo_url
claim_value: https://res.cloudinary.com/aauni/w_352,h_72,c_fill,q_auto:best,f_auto/web/2020/04/aau-logo.png
source_url: https://www.aauni.edu/student-life/student-services-support/library
css_selector: '#document > body.wp-singular.page-template > div.container.container_website > header > div.container.container_border > div.row.row_justify-space > div.logo > a > img'
retrieved_on: '2025-12-23T21:56:29.425209+00:00'
extraction_method: crawl4ai_header_logo
detection_confidence: high
alt_text: aau-logo
- claim_type: favicon_url
claim_value: http://www.aauni.edu/wp-content/themes/ds-prime/safari-pinned-tab.svg
source_url: https://www.aauni.edu/student-life/student-services-support/library
css_selector: '#document > head > link:nth-of-type(4)'
retrieved_on: '2025-12-23T21:56:29.425209+00:00'
extraction_method: crawl4ai_link_rel
favicon_type: ''
favicon_sizes: ''
summary:
total_claims: 2
has_primary_logo: true
has_favicon: true
has_og_image: false
favicon_count: 4

View file

@ -0,0 +1,161 @@
custodian_name:
claim_type: custodian_name
claim_value: ART CENTRUM
source_type: ch_annotator
original_entry:
name: ART CENTRUM
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABE169
locations:
- street_address: Nerudova 16
postal_code: 118 00
city: Praha 1
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:15.342432+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-AC
ghcid_original: CZ-HL-PRA-L-AC
ghcid_uuid: db7cbf05-3f46-5268-9150-644896014727
ghcid_uuid_sha256: aa99e5c4-da60-85de-c377-ca43c3d6deff
ghcid_numeric: 12293109291524617694
record_id: bcafb00d-8c2d-459b-8d01-c5725dbe58e9
generation_timestamp: '2025-12-06T23:37:15.342432+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-AC
valid_from: '2025-12-10T09:46:57Z'
valid_to: null
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha))
per ISO 3166-2:CZ
- ghcid: CZ-HL-PRA-L-AC
valid_from: null
valid_to: '2025-12-10T09:46:57Z'
reason: Previous GHCID with incorrect region code
- ghcid: CZ-HL-PRA-L-AC
ghcid_numeric: 12293109291524617694
valid_from: '2025-12-06T23:37:15.342432+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-AC
- identifier_scheme: GHCID_UUID
identifier_value: db7cbf05-3f46-5268-9150-644896014727
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: aa99e5c4-da60-85de-c377-ca43c3d6deff
- identifier_scheme: GHCID_NUMERIC
identifier_value: '12293109291524617694'
- identifier_scheme: RECORD_ID
identifier_value: bcafb00d-8c2d-459b-8d01-c5725dbe58e9
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.908583+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:46Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.908583+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date: null
verified_by: null
entity_claims:
- claim_type: full_name
claim_value: ART CENTRUM
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.908583+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.908583+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 1
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.908583+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABE169
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.908583+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:15.342432+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 118 00
street_address: Nerudova 16
normalization_timestamp: '2025-12-09T20:41:03.958695+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:03.958672+00:00'

View file

@ -0,0 +1,160 @@
original_entry:
name: AstraZeneca Czech Republic, s.r.o. - Knihovna
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABF315
- &id002
identifier_scheme: IČO
identifier_value: '63984482'
locations:
- street_address: Jinonice 921
postal_code: 158 00
city: Praha 5
region: Hlavní město Praha
country: CZ
latitude: 50.055986
longitude: 14.375324
processing_timestamp: '2025-12-06T23:37:16.225751+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-ACRSRK
ghcid_original: CZ-HL-PRA-L-ACRSRK
ghcid_uuid: cc4caad2-e43e-5bbc-8cce-57d96fc83458
ghcid_uuid_sha256: 2d85a754-0282-8671-b9cd-b9ef6a47f693
ghcid_numeric: 3280211882870462065
record_id: 1a9051b6-2094-4c18-b1c7-d6b4f561c74e
generation_timestamp: '2025-12-06T23:37:16.225751+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-ACRSRK
valid_from: "2025-12-10T09:46:57Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-ACRSRK
valid_from: null
valid_to: "2025-12-10T09:46:57Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-ACRSRK
ghcid_numeric: 3280211882870462065
valid_from: '2025-12-06T23:37:16.225751+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
custodian_name:
claim_type: custodian_name
claim_value: AstraZeneca Czech Republic, s.r.o. - Knihovna
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-ACRSRK
- identifier_scheme: GHCID_UUID
identifier_value: cc4caad2-e43e-5bbc-8cce-57d96fc83458
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 2d85a754-0282-8671-b9cd-b9ef6a47f693
- identifier_scheme: GHCID_NUMERIC
identifier_value: '3280211882870462065'
- identifier_scheme: RECORD_ID
identifier_value: 1a9051b6-2094-4c18-b1c7-d6b4f561c74e
- *id001
- *id002
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.917728+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:46Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.917728+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date: null
verified_by: null
entity_claims:
- claim_type: full_name
claim_value: AstraZeneca Czech Republic, s.r.o. - Knihovna
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.917728+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.917728+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 5
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.917728+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABF315
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.917728+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:16.225751+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
latitude: 50.055986
longitude: 14.375324
coordinate_provenance:
source_type: ORIGINAL_ENTRY
source_path: original_entry.locations[0]
city: Praha 5
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 158 00
street_address: Jinonice 921
normalization_timestamp: '2025-12-09T06:50:46.935738+00:00'

View file

@ -0,0 +1,169 @@
original_entry:
name: AKCENT College s.r.o. - Knihovna
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABG320
- &id002
identifier_scheme: IČO
identifier_value: '27259439'
- &id003
identifier_scheme: DIČ
identifier_value: CZ27259439
- &id004
identifier_scheme: Website
identifier_value: http://akcentcollege.cz/
identifier_url: http://akcentcollege.cz/
locations:
- street_address: Bítovská 5
postal_code: 140 00
city: Praha 4
region: Hlavní město Praha
country: CZ
latitude: 50.046178
longitude: 14.454242
processing_timestamp: '2025-12-06T23:37:37.561076+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-ACSRK
ghcid_original: CZ-HL-PRA-L-ACSRK
ghcid_uuid: 025e0773-85f7-581d-a32c-9d41a749ba6c
ghcid_uuid_sha256: ecc50b69-0996-88e3-5be9-a546c5b76de8
ghcid_numeric: 17061055309123332323
record_id: 704ef8ec-3d1c-4561-a641-671398d77657
generation_timestamp: '2025-12-06T23:37:37.561076+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-ACSRK
valid_from: "2025-12-10T09:46:57Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-ACSRK
valid_from: null
valid_to: "2025-12-10T09:46:57Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-ACSRK
ghcid_numeric: 17061055309123332323
valid_from: '2025-12-06T23:37:37.561076+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
custodian_name:
claim_type: custodian_name
claim_value: AKCENT College s.r.o. - Knihovna
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-ACSRK
- identifier_scheme: GHCID_UUID
identifier_value: 025e0773-85f7-581d-a32c-9d41a749ba6c
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: ecc50b69-0996-88e3-5be9-a546c5b76de8
- identifier_scheme: GHCID_NUMERIC
identifier_value: '17061055309123332323'
- identifier_scheme: RECORD_ID
identifier_value: 704ef8ec-3d1c-4561-a641-671398d77657
- *id001
- *id002
- *id003
- *id004
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:30.209358+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:46Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.209358+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date: null
verified_by: null
entity_claims:
- claim_type: full_name
claim_value: AKCENT College s.r.o. - Knihovna
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.209358+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.209358+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 4
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.209358+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABG320
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.209358+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:37.561076+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
latitude: 50.046178
longitude: 14.454242
coordinate_provenance:
source_type: ORIGINAL_ENTRY
source_path: original_entry.locations[0]
city: Praha 4
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 140 00
street_address: Bítovská 5
normalization_timestamp: '2025-12-09T06:50:46.958551+00:00'

View file

@ -0,0 +1,228 @@
custodian_name:
claim_type: custodian_name
claim_value: Americké centrum - Velvyslanectví USA - knihovna
source_type: ch_annotator
original_entry:
name: Americké centrum - Velvyslanectví USA - knihovna
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABE193
- &id002
identifier_scheme: Website
identifier_value: https://merkur.tritius.cz/library/americkecentrum/
identifier_url: https://merkur.tritius.cz/library/americkecentrum/
locations:
- street_address: Tržiště 13
postal_code: 118 01
city: Praha 1
region: Hlavní město Praha
country: CZ
latitude: 50.087198
longitude: 14.402043
processing_timestamp: '2025-12-06T23:37:27.015821+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-ACVUK
ghcid_original: CZ-HL-PRA-L-ACVUK
ghcid_uuid: 4e7ba70c-00a8-5e05-93c3-b52fa276cc35
ghcid_uuid_sha256: 5768f4cb-1461-8e1e-273d-20cdae6537db
ghcid_numeric: 6298553231935086110
record_id: 44876a62-12bc-4625-853f-5637562ba19f
generation_timestamp: '2025-12-06T23:37:27.015821+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-ACVUK
valid_from: '2025-12-10T09:46:57Z'
valid_to:
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
- ghcid: CZ-HL-PRA-L-ACVUK
valid_from:
valid_to: '2025-12-10T09:46:57Z'
reason: Previous GHCID with incorrect region code
- ghcid: CZ-HL-PRA-L-ACVUK
ghcid_numeric: 6298553231935086110
valid_from: '2025-12-06T23:37:27.015821+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-ACVUK
- identifier_scheme: GHCID_UUID
identifier_value: 4e7ba70c-00a8-5e05-93c3-b52fa276cc35
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 5768f4cb-1461-8e1e-273d-20cdae6537db
- identifier_scheme: GHCID_NUMERIC
identifier_value: '6298553231935086110'
- identifier_scheme: RECORD_ID
identifier_value: 44876a62-12bc-4625-853f-5637562ba19f
- *id001
- *id002
- identifier_scheme: Wikidata
identifier_value: Q10817014
identifier_url: https://www.wikidata.org/wiki/Q10817014
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:30.059345+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:46Z
schema_version: 2.0.0
enrichment_provenance:
wikidata_enrichment:
content_hash: sha256-gIZuWVWgdD1I4IEkEbjObCa+m84Z/CPDXPrUsfGYlc8=
verified_at: '2025-12-28T19:58:22.721704+00:00'
wasDerivedFrom: https://www.wikidata.org/wiki/Q10817014
provenance_schema_version: '2.0'
standards_compliance:
- W3C PROV-O
- W3C SRI (content hashes)
generated_at: '2025-12-28T19:58:22.721722+00:00'
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.059345+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Americké centrum - Velvyslanectví USA - knihovna
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.059345+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.059345+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 1
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.059345+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABE193
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.059345+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:27.015821+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
latitude: 50.087198
longitude: 14.402043
coordinate_provenance:
source_type: ORIGINAL_ENTRY
source_path: original_entry.locations[0]
city: Praha 1
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 118 01
street_address: Tržiště 13
normalization_timestamp: '2025-12-09T06:50:46.978265+00:00'
wikidata_enrichment:
wikidata_id: Q10817014
wikidata_url: https://www.wikidata.org/wiki/Q10817014
matched_by: sigla_identifier
matched_sigla: ABE193
wikidata_label: Americké centrum
wikidata_description: kulturní a informační centrum v Praze
enrichment_date: '2025-12-19T10:13:00.700773+00:00'
enrichment_version: 2.1.0
wikidata_coordinates:
longitude: 14.4020222
latitude: 50.0871972
official_website: http://www.americkecentrum.cz/
image: https://commons.wikimedia.org/wiki/Special:FilePath/Knihovna_Amerického_centra_v_Praze,_Vratislavský_palác,_Tržiště_13,_Malá_Strana.jpg
instance_of:
- Q7075
located_in: Q1085
_provenance:
content_hash:
algorithm: sha256
value: sha256-gIZuWVWgdD1I4IEkEbjObCa+m84Z/CPDXPrUsfGYlc8=
scope: enrichment_section
computed_at: '2025-12-28T19:58:22.721704+00:00'
prov:
wasGeneratedBy:
'@type': prov:Activity
name: wikidata_api_fetch
used: https://www.wikidata.org/w/rest.php/wikibase/v1
wasDerivedFrom: https://www.wikidata.org/wiki/Q10817014
generatedAtTime: '2025-12-19T10:13:00.700773+00:00'
verification:
status: verified
last_verified: '2025-12-28T19:58:22.721712+00:00'
wikidata_entity_id: Q10817014
logo_enrichment:
enrichment_timestamp: '2025-12-23T21:56:46.760476+00:00'
source_url: https://merkur.tritius.cz/library/americkecentrum
extraction_method: crawl4ai
claims:
- claim_type: favicon_url
claim_value: https://merkur.tritius.cz/apple-touch-icon-180x180.png
source_url: https://merkur.tritius.cz/library/americkecentrum
css_selector: '[document] > html > head > link:nth-of-type(14)'
retrieved_on: '2025-12-23T21:56:46.760476+00:00'
extraction_method: crawl4ai_link_rel
favicon_type: ''
favicon_sizes: 180x180
summary:
total_claims: 1
has_primary_logo: false
has_favicon: true
has_og_image: false
favicon_count: 14

View file

@ -0,0 +1,160 @@
original_entry:
name: AERO HOLDING, a.s.
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABF184
locations:
- street_address: Beranových 130
postal_code: 199 04
city: Praha 9 - Letňany
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:16.121078+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-AHS
ghcid_original: CZ-HL-PRA-L-AHS
ghcid_uuid: df88c0a2-a575-52d1-b7ff-863c7db10bd0
ghcid_uuid_sha256: f7dc7b4f-cb80-8f38-1590-178d46991ada
ghcid_numeric: 17860285804891377464
record_id: 2f7af94f-56d1-4625-b35f-7b827ef121b0
generation_timestamp: '2025-12-06T23:37:16.121078+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-AHS
valid_from: "2025-12-10T09:46:57Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-AHS
valid_from: null
valid_to: "2025-12-10T09:46:57Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-AHS
ghcid_numeric: 17860285804891377464
valid_from: '2025-12-06T23:37:16.121078+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
custodian_name:
claim_type: custodian_name
claim_value: AERO HOLDING, a.s.
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-AHS
- identifier_scheme: GHCID_UUID
identifier_value: df88c0a2-a575-52d1-b7ff-863c7db10bd0
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: f7dc7b4f-cb80-8f38-1590-178d46991ada
- identifier_scheme: GHCID_NUMERIC
identifier_value: '17860285804891377464'
- identifier_scheme: RECORD_ID
identifier_value: 2f7af94f-56d1-4625-b35f-7b827ef121b0
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.916759+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:46Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.916759+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: AERO HOLDING, a.s.
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.916759+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.916759+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 9 - Letňany
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.916759+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABF184
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.916759+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:16.121078+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 199 04
street_address: Beranových 130
normalization_timestamp: '2025-12-09T20:41:04.012779+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:04.012755+00:00'

View file

@ -0,0 +1,160 @@
original_entry:
name: Asociace hudebních umělců a vědců
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABE414
locations:
- street_address: Radlická 99
postal_code: 150 00
city: Praha 5
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:15.711374+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-AHUV
ghcid_original: CZ-HL-PRA-L-AHUV
ghcid_uuid: 17d2ba54-4381-58e4-b630-c602beb6e1c7
ghcid_uuid_sha256: 12e7c450-6a56-84cb-c4ea-873bd4746685
ghcid_numeric: 1362273261963334859
record_id: e1401c07-fa12-4691-9fd3-f54dad479bf1
generation_timestamp: '2025-12-06T23:37:15.711374+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-AHUV
valid_from: "2025-12-10T09:46:57Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-AHUV
valid_from: null
valid_to: "2025-12-10T09:46:57Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-AHUV
ghcid_numeric: 1362273261963334859
valid_from: '2025-12-06T23:37:15.711374+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
custodian_name:
claim_type: custodian_name
claim_value: Asociace hudebních umělců a vědců
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-AHUV
- identifier_scheme: GHCID_UUID
identifier_value: 17d2ba54-4381-58e4-b630-c602beb6e1c7
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 12e7c450-6a56-84cb-c4ea-873bd4746685
- identifier_scheme: GHCID_NUMERIC
identifier_value: '1362273261963334859'
- identifier_scheme: RECORD_ID
identifier_value: e1401c07-fa12-4691-9fd3-f54dad479bf1
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.913416+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.913416+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Asociace hudebních umělců a vědců
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.913416+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.913416+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 5
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.913416+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABE414
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.913416+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:15.711374+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 150 00
street_address: Radlická 99
normalization_timestamp: '2025-12-09T20:41:04.060480+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:04.060454+00:00'

View file

@ -0,0 +1,272 @@
original_entry:
name: Albatros Media, a.s. - Knihovna
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABE395
- &id002
identifier_scheme: IČO
identifier_value: '28377141'
- &id003
identifier_scheme: DIČ
identifier_value: CZ28377141
- &id004
identifier_scheme: Website
identifier_value: https://www.albatros.cz/
identifier_url: https://www.albatros.cz/
- &id005
identifier_scheme: Wikidata
identifier_value: Q114617801
identifier_url: https://www.wikidata.org/wiki/Q114617801
locations:
- street_address: 5. května 1746/22
postal_code: 140 00
city: Praha 4
region: Hlavní město Praha
country: CZ
latitude: 50.055332
longitude: 14.438384
processing_timestamp: '2025-12-06T23:37:15.679722+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-AMSK
ghcid_original: CZ-HL-PRA-L-AMSK
ghcid_uuid: c1e86a97-1cbf-512b-a55d-f2bef63e9ec9
ghcid_uuid_sha256: 385251a3-c9cc-824f-4807-3e6f793ecbe5
ghcid_numeric: 4058395978121359951
record_id: 0c98496e-9330-4f00-8187-75bd0a8db3d6
generation_timestamp: '2025-12-06T23:37:15.679722+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-AMSK
valid_from: '2025-12-10T09:46:57Z'
valid_to:
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
- ghcid: CZ-HL-PRA-L-AMSK
valid_from:
valid_to: '2025-12-10T09:46:57Z'
reason: Previous GHCID with incorrect region code
- ghcid: CZ-HL-PRA-L-AMSK
ghcid_numeric: 4058395978121359951
valid_from: '2025-12-06T23:37:15.679722+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
custodian_name:
claim_type: custodian_name
claim_value: Albatros Media, a.s. - Knihovna
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-AMSK
- identifier_scheme: GHCID_UUID
identifier_value: c1e86a97-1cbf-512b-a55d-f2bef63e9ec9
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 385251a3-c9cc-824f-4807-3e6f793ecbe5
- identifier_scheme: GHCID_NUMERIC
identifier_value: '4058395978121359951'
- identifier_scheme: RECORD_ID
identifier_value: 0c98496e-9330-4f00-8187-75bd0a8db3d6
- *id001
- *id002
- *id003
- *id004
- *id005
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.913103+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
schema_version: 2.0.0
enrichment_provenance:
wikidata_enrichment:
content_hash: sha256-9QX7o3HSaCfPf0zp6mHwoxtSUV64HvydJehpPRTPjTE=
verified_at: '2025-12-28T19:58:22.760818+00:00'
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617801
provenance_schema_version: '2.0'
standards_compliance:
- W3C PROV-O
- W3C SRI (content hashes)
generated_at: '2025-12-28T19:58:22.760835+00:00'
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.913103+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Albatros Media, a.s. - Knihovna
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.913103+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.913103+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 4
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.913103+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: wikidata_id
claim_value: Q114617801
property_uri: owl:sameAs
provenance:
namespace: wikidata
path: /entity/Q114617801
timestamp: '2025-11-19T10:14:29.913103+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.98
- claim_type: ghcid
claim_value: CZ-ABE395
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.913103+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:15.679722+00:00'
creation_method: create_custodian_from_ch_annotator.py
wikidata_enrichment:
wikidata_entity_id: Q114617801
api_metadata:
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
fetch_timestamp: '2025-12-18T22:26:20.334630+00:00'
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
enrichment_version: 2.1_generic
properties_found:
- P31
- P17
- P159
- P968
wikidata_labels:
cs: Albatros Media, a.s., knihovna
en: Albatros Media, Ltd., Library
wikidata_label_en: Albatros Media, Ltd., Library
wikidata_label_cs: Albatros Media, a.s., knihovna
wikidata_descriptions:
cs: knihovna v Praze
wikidata_classification:
instance_of: &id006
- id: Q385994
label: special library
description: specialized library providing resources on a particular topic or discipline
wikidata_instance_of: *id006
wikidata_location:
country: &id007
id: Q213
label: Czech Republic
description: country in Central Europe
headquarters_location:
id: Q2686587
label: Prague 4
description: municipal part of Prague
wikidata_country: *id007
wikidata_contact:
email: mailto:albatros@albatrosmedia.cz
_provenance:
content_hash:
algorithm: sha256
value: sha256-9QX7o3HSaCfPf0zp6mHwoxtSUV64HvydJehpPRTPjTE=
scope: enrichment_section
computed_at: '2025-12-28T19:58:22.760818+00:00'
prov:
wasGeneratedBy:
'@type': prov:Activity
name: wikidata_api_fetch
used: https://www.wikidata.org/w/rest.php/wikibase/v1
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617801
verification:
status: verified
last_verified: '2025-12-28T19:58:22.760826+00:00'
location:
latitude: 50.055332
longitude: 14.438384
coordinate_provenance:
source_type: ORIGINAL_ENTRY
source_path: original_entry.locations[0]
city: Praha 4
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 140 00
street_address: 5. května 1746/22
normalization_timestamp: '2025-12-09T06:50:47.041381+00:00'
logo_enrichment:
enrichment_timestamp: '2025-12-23T21:56:57.568457+00:00'
source_url: https://www.albatros.cz
extraction_method: crawl4ai
claims:
- claim_type: favicon_url
claim_value: https://www.albatros.cz/Content/images/favicons/albatros/safari-pinned-tab.svg
source_url: https://www.albatros.cz
css_selector: '[document] > html.js > head > link:nth-of-type(7)'
retrieved_on: '2025-12-23T21:56:57.568457+00:00'
extraction_method: crawl4ai_link_rel
favicon_type: ''
favicon_sizes: ''
- claim_type: og_image_url
claim_value: https://www.albatros.cz/Content/images/albatrosmedia.png
source_url: https://www.albatros.cz
css_selector: '[document] > html.js > head > meta:nth-of-type(12)'
retrieved_on: '2025-12-23T21:56:57.568457+00:00'
extraction_method: crawl4ai_meta_og
summary:
total_claims: 2
has_primary_logo: false
has_favicon: true
has_og_image: true
favicon_count: 4

View file

@ -0,0 +1,275 @@
original_entry:
name: Akademie múzických umění v Praze - Knihovny AMU
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABD020
- &id002
identifier_scheme: IČO
identifier_value: '61384984'
- &id003
identifier_scheme: DIČ
identifier_value: CZ61384984
- &id004
identifier_scheme: Website
identifier_value: https://tritius.amu.cz/library/damu
identifier_url: https://tritius.amu.cz/library/damu
- &id005
identifier_scheme: Wikidata
identifier_value: Q114617654
identifier_url: https://www.wikidata.org/wiki/Q114617654
locations:
- street_address: Malostranské nám. 13
postal_code: 118 00
city: Praha 1
region: Hlavní město Praha
country: CZ
latitude: 50.088175
longitude: 14.402125
processing_timestamp: '2025-12-06T23:37:14.752021+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-AMUVPKA
ghcid_original: CZ-HL-PRA-L-AMUVPKA
ghcid_uuid: b3555de4-5475-59d4-86d8-b7fa1f70ef1f
ghcid_uuid_sha256: d89fa37d-eda3-8c48-3afa-91cf9efb0c6d
ghcid_numeric: 15609374594742574152
record_id: 838af26a-375a-44d3-906b-bf4e512bbd58
generation_timestamp: '2025-12-06T23:37:14.752021+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-AMUVPKA
valid_from: '2025-12-10T09:46:57Z'
valid_to:
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
- ghcid: CZ-HL-PRA-L-AMUVPKA
valid_from:
valid_to: '2025-12-10T09:46:57Z'
reason: Previous GHCID with incorrect region code
- ghcid: CZ-HL-PRA-L-AMUVPKA
ghcid_numeric: 15609374594742574152
valid_from: '2025-12-06T23:37:14.752021+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
custodian_name:
claim_type: custodian_name
claim_value: Akademie múzických umění v Praze - Knihovny AMU
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-AMUVPKA
- identifier_scheme: GHCID_UUID
identifier_value: b3555de4-5475-59d4-86d8-b7fa1f70ef1f
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: d89fa37d-eda3-8c48-3afa-91cf9efb0c6d
- identifier_scheme: GHCID_NUMERIC
identifier_value: '15609374594742574152'
- identifier_scheme: RECORD_ID
identifier_value: 838af26a-375a-44d3-906b-bf4e512bbd58
- *id001
- *id002
- *id003
- *id004
- *id005
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.805576+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
schema_version: 2.0.0
enrichment_provenance:
wikidata_enrichment:
content_hash: sha256-+35CKkOOVUe+x+M9wiRhyAWq3YDjjK3Lq69YsuZ8PvE=
verified_at: '2025-12-28T19:58:22.781623+00:00'
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617654
provenance_schema_version: '2.0'
standards_compliance:
- W3C PROV-O
- W3C SRI (content hashes)
generated_at: '2025-12-28T19:58:22.781639+00:00'
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.805576+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Akademie múzických umění v Praze - Knihovny AMU
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.805576+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.805576+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 1
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.805576+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: wikidata_id
claim_value: Q114617654
property_uri: owl:sameAs
provenance:
namespace: wikidata
path: /entity/Q114617654
timestamp: '2025-11-19T10:14:29.805576+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.98
- claim_type: ghcid
claim_value: CZ-ABD020
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.805576+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:14.752021+00:00'
creation_method: create_custodian_from_ch_annotator.py
wikidata_enrichment:
wikidata_entity_id: Q114617654
api_metadata:
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
fetch_timestamp: '2025-12-18T22:26:28.375300+00:00'
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
enrichment_version: 2.1_generic
properties_found:
- P31
- P17
- P159
- P968
- P463
wikidata_labels:
cs: Akademie múzických umění v Praze, Knihovny AMU
en: Academy of Performing Arts in Prague, Libraries
wikidata_label_en: Academy of Performing Arts in Prague, Libraries
wikidata_label_cs: Akademie múzických umění v Praze, Knihovny AMU
wikidata_descriptions:
cs: knihovna v Praze
wikidata_aliases:
cs:
- AMU
wikidata_classification:
instance_of: &id006
- id: Q1622062
label: university library
description: general library at an institution of higher education
wikidata_instance_of: *id006
wikidata_location:
country: &id007
id: Q213
label: Czech Republic
description: country in Central Europe
headquarters_location:
id: Q973974
label: Prague 1
description: administrative district, municipal district and municipal part of Prague
wikidata_country: *id007
wikidata_organization:
member_of:
- id: Q4809890
label: Association of Library and Information Professionals of the Czech Republic
description: organization
wikidata_contact:
email: mailto:knihovna.damu@damu.cz
_provenance:
content_hash:
algorithm: sha256
value: sha256-+35CKkOOVUe+x+M9wiRhyAWq3YDjjK3Lq69YsuZ8PvE=
scope: enrichment_section
computed_at: '2025-12-28T19:58:22.781623+00:00'
prov:
wasGeneratedBy:
'@type': prov:Activity
name: wikidata_api_fetch
used: https://www.wikidata.org/w/rest.php/wikibase/v1
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617654
verification:
status: verified
last_verified: '2025-12-28T19:58:22.781631+00:00'
location:
latitude: 50.088175
longitude: 14.402125
coordinate_provenance:
source_type: ORIGINAL_ENTRY
source_path: original_entry.locations[0]
city: Praha 1
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 118 00
street_address: Malostranské nám. 13
normalization_timestamp: '2025-12-09T06:50:47.066498+00:00'
logo_enrichment:
enrichment_timestamp: '2025-12-23T21:57:03.111057+00:00'
source_url: https://tritius.amu.cz/library/damu
extraction_method: crawl4ai
claims:
- claim_type: favicon_url
claim_value: https://tritius.amu.cz/apple-touch-icon-180x180.png
source_url: https://tritius.amu.cz/library/damu
css_selector: '[document] > html > head > link:nth-of-type(14)'
retrieved_on: '2025-12-23T21:57:03.111057+00:00'
extraction_method: crawl4ai_link_rel
favicon_type: ''
favicon_sizes: 180x180
summary:
total_claims: 1
has_primary_logo: false
has_favicon: true
has_og_image: false
favicon_count: 14

View file

@ -0,0 +1,165 @@
original_entry:
name: ACADEMIA, nakladatelství AV ČR
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABB067
locations:
- street_address: Legerova 61
postal_code: 120 00
city: Praha 2
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:14.357057+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-ANAC-academia_nakladatelstvi_av_cr
ghcid_original: CZ-HL-PRA-L-ANAČ-academia_nakladatelstvi_av_cr
ghcid_uuid: 6eefcdea-8b2b-53b0-8440-999db38fbe76
ghcid_uuid_sha256: 7059f920-6ced-8800-b2b6-2dac6ecae7eb
ghcid_numeric: 8095775722836920320
record_id: 5855f68a-ce13-47d7-a9a4-4267d226f758
generation_timestamp: '2025-12-06T23:37:14.357057+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-ANAC-academia_nakladatelstvi_av_cr
valid_from: "2025-12-10T09:46:57Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-ANAC-academia_nakladatelstvi_av_cr
valid_from: null
valid_to: "2025-12-10T09:46:57Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-ANAC-academia_nakladatelstvi_av_cr
ghcid_numeric: 8095775722836920320
valid_from: '2025-12-08T11:21:22.227365+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ANAČ-academia_nakladatelstvi_av_cr)'
- ghcid: CZ-HL-PRA-L-ANAČ-academia_nakladatelstvi_av_cr
ghcid_numeric: 583388196963575266
valid_from: '2025-12-06T23:37:14.357057+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:22.227365+00:00'
custodian_name:
claim_type: custodian_name
claim_value: ACADEMIA, nakladatelství AV ČR
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-ANAC-academia_nakladatelstvi_av_cr
- identifier_scheme: GHCID_UUID
identifier_value: 6eefcdea-8b2b-53b0-8440-999db38fbe76
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 7059f920-6ced-8800-b2b6-2dac6ecae7eb
- identifier_scheme: GHCID_NUMERIC
identifier_value: '8095775722836920320'
- identifier_scheme: RECORD_ID
identifier_value: 5855f68a-ce13-47d7-a9a4-4267d226f758
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.799649+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:11Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.799649+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: ACADEMIA, nakladatelství AV ČR
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.799649+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.799649+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 2
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.799649+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABB067
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.799649+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:14.357057+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 120 00
street_address: Legerova 61
normalization_timestamp: '2025-12-09T20:41:04.123334+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:04.123312+00:00'

View file

@ -0,0 +1,165 @@
original_entry:
name: ACADEMIA, nakladatelství AV ČR
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABB067
locations:
- street_address: Legerova 61
postal_code: 120 00
city: Praha 2
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:35:06.403081+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-ANAC
ghcid_original: CZ-HL-PRA-L-ANAČ
ghcid_uuid: a8e8fa95-0774-5fc3-afff-70a2447670b9
ghcid_uuid_sha256: f12b095c-1ccc-8763-9626-4e69a3314c32
ghcid_numeric: 17377993878363981667
record_id: 5f2390fe-8032-4741-b321-f806a60356df
generation_timestamp: '2025-12-06T23:35:06.403081+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-ANAC
valid_from: "2025-12-10T09:46:57Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-ANAC
valid_from: null
valid_to: "2025-12-10T09:46:57Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-ANAC
ghcid_numeric: 17377993878363981667
valid_from: '2025-12-08T11:21:22.663796+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ANAČ)'
- ghcid: CZ-HL-PRA-L-ANAČ
ghcid_numeric: 15633231074025097142
valid_from: '2025-12-06T23:35:06.403081+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:22.663796+00:00'
custodian_name:
claim_type: custodian_name
claim_value: ACADEMIA, nakladatelství AV ČR
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-ANAC
- identifier_scheme: GHCID_UUID
identifier_value: a8e8fa95-0774-5fc3-afff-70a2447670b9
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: f12b095c-1ccc-8763-9626-4e69a3314c32
- identifier_scheme: GHCID_NUMERIC
identifier_value: '17377993878363981667'
- identifier_scheme: RECORD_ID
identifier_value: 5f2390fe-8032-4741-b321-f806a60356df
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.799649+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.799649+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: ACADEMIA, nakladatelství AV ČR
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.799649+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.799649+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 2
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.799649+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABB067
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.799649+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:35:06.403081+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 120 00
street_address: Legerova 61
normalization_timestamp: '2025-12-09T20:41:04.179472+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:04.179459+00:00'

View file

@ -0,0 +1,167 @@
original_entry:
name: Automobilový průmysl
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABF015
locations:
- street_address: Rašínovo nábř. 42
postal_code: 120 00
city: Praha 2
country: CZ
processing_timestamp: '2025-12-06T23:37:23.820271+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-AP
ghcid_original: CZ-XX-PRA-L-AP
ghcid_uuid: 490a6b0c-0f6f-5df9-bd2c-854da65d18f3
ghcid_uuid_sha256: fb11807f-dfaf-898d-1f4a-ee52df3fd0b3
ghcid_numeric: 18091382464826001805
record_id: 98145ed4-5147-40d4-afe7-0a5b0ffb9e30
generation_timestamp: '2025-12-06T23:37:23.820271+00:00'
location_resolution:
country_code: CZ
region_code: '10'
city_code: PRA
method: GEONAMES_CITY_LOOKUP
region_name: CZ-10
geonames_id: 8629195
resolution_timestamp: '2025-12-07T11:42:55.003894+00:00'
ghcid_history:
- ghcid: CZ-XX-PRA-L-AP
ghcid_numeric: 18091382464826001805
valid_from: '2025-12-06T23:37:23.820271+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
- ghcid: CZ-10-PRA-L-AP
valid_from: '2025-12-07T11:42:55.003929+00:00'
reason: 'Region resolved via GeoNames city lookup: XX->10 (Praha Klanovice)'
custodian_name:
claim_type: custodian_name
claim_value: Automobilový průmysl
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-XX-PRA-L-AP
- identifier_scheme: GHCID_UUID
identifier_value: 490a6b0c-0f6f-5df9-bd2c-854da65d18f3
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: fb11807f-dfaf-898d-1f4a-ee52df3fd0b3
- identifier_scheme: GHCID_NUMERIC
identifier_value: '18091382464826001805'
- identifier_scheme: RECORD_ID
identifier_value: 98145ed4-5147-40d4-afe7-0a5b0ffb9e30
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:30.031447+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- 'YouTube/Google Maps enrichment 2025-12-08T19:49:16Z: Maps: rejected by LLM; YouTube: not found'
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:19Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:48:05Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:49:29Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.031447+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Automobilový průmysl
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.031447+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.031447+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 2
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.031447+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABF015
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.031447+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:23.820271+00:00'
creation_method: create_custodian_from_ch_annotator.py
google_maps_status: NO_MATCH
google_maps_rejected:
candidate_name: Czech Automotive Industry Association
rejection_reason: The candidate is an industry association, not a heritage institution (museum, library, archive, or gallery).
The Google Place types ('point_of_interest', 'establishment') do not match the expected heritage institution types.
timestamp: '2025-12-08T19:49:15.880112+00:00'
youtube_status: NOT_FOUND
youtube_search_query: Automobilový průmysl official
youtube_search_timestamp: '2025-12-08T19:49:16.250587+00:00'
location:
city: Prague
region: CZ-10
region_code: '10'
country: CZ
postal_code: 120 00
street_address: Rašínovo nábř. 42
geonames_id: 3067696
normalization_timestamp: '2025-12-09T20:40:58.427319+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:40:58.427306+00:00'

View file

@ -0,0 +1,160 @@
original_entry:
name: AROMA Praha, a.s.
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABF317
locations:
- street_address: Novovysočanská 46
postal_code: 190 00
city: Praha 9
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:24.083522+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-APS
ghcid_original: CZ-HL-PRA-L-APS
ghcid_uuid: 6fcce8bd-0a1d-525d-a914-b5357e0cbfa2
ghcid_uuid_sha256: 659f4781-5cc9-8faa-3a35-2a0d9bcd8b7c
ghcid_numeric: 7322650140060753834
record_id: a84b633c-0086-4a3a-904f-463117833f08
generation_timestamp: '2025-12-06T23:37:24.083522+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-APS
valid_from: "2025-12-10T09:46:57Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-APS
valid_from: null
valid_to: "2025-12-10T09:46:57Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-APS
ghcid_numeric: 7322650140060753834
valid_from: '2025-12-06T23:37:24.083522+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
custodian_name:
claim_type: custodian_name
claim_value: AROMA Praha, a.s.
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-APS
- identifier_scheme: GHCID_UUID
identifier_value: 6fcce8bd-0a1d-525d-a914-b5357e0cbfa2
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 659f4781-5cc9-8faa-3a35-2a0d9bcd8b7c
- identifier_scheme: GHCID_NUMERIC
identifier_value: '7322650140060753834'
- identifier_scheme: RECORD_ID
identifier_value: a84b633c-0086-4a3a-904f-463117833f08
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:30.033411+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.033411+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: AROMA Praha, a.s.
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.033411+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.033411+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 9
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.033411+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABF317
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.033411+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:24.083522+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 190 00
street_address: Novovysočanská 46
normalization_timestamp: '2025-12-09T20:41:04.227218+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:04.227198+00:00'

View file

@ -0,0 +1,160 @@
original_entry:
name: AGROPROJEKT Praha, s.r.o.
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABF415
locations:
- street_address: Ve Smečkách 33
postal_code: 111 21
city: Praha 1
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:16.285031+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-APSR
ghcid_original: CZ-HL-PRA-L-APSR
ghcid_uuid: 8f29ce36-e21b-5f51-b040-04e873534248
ghcid_uuid_sha256: 392e3f84-e307-84d6-3512-a6e0bf51a65b
ghcid_numeric: 4120300549067703510
record_id: d0de9f8b-915c-4405-927c-13cf8f6d2d15
generation_timestamp: '2025-12-06T23:37:16.285031+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-APSR
valid_from: "2025-12-10T09:46:57Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-APSR
valid_from: null
valid_to: "2025-12-10T09:46:57Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-APSR
ghcid_numeric: 4120300549067703510
valid_from: '2025-12-06T23:37:16.285031+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
custodian_name:
claim_type: custodian_name
claim_value: AGROPROJEKT Praha, s.r.o.
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-APSR
- identifier_scheme: GHCID_UUID
identifier_value: 8f29ce36-e21b-5f51-b040-04e873534248
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 392e3f84-e307-84d6-3512-a6e0bf51a65b
- identifier_scheme: GHCID_NUMERIC
identifier_value: '4120300549067703510'
- identifier_scheme: RECORD_ID
identifier_value: d0de9f8b-915c-4405-927c-13cf8f6d2d15
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.918112+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.918112+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: AGROPROJEKT Praha, s.r.o.
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.918112+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.918112+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 1
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.918112+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABF415
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.918112+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:16.285031+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 111 21
street_address: Ve Smečkách 33
normalization_timestamp: '2025-12-09T20:41:04.277933+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:04.277918+00:00'

View file

@ -0,0 +1,164 @@
original_entry:
name: AŽD Praha, s.r.o. - Systémy pro kolejovou dopravu - Technická knihovna
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABF005
- &id002
identifier_scheme: IČO
identifier_value: '48029483'
locations:
- street_address: Žirovnická 2/3146
postal_code: 106 17
city: Praha 10
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:15.777262+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-APSRSPKDTK
ghcid_original: CZ-HL-PRA-L-APSRSPKDTK
ghcid_uuid: 73b6f6a0-6aa1-525e-9695-ad0aaf495b5e
ghcid_uuid_sha256: 25d36358-6701-8854-8694-d3ca1d69880b
ghcid_numeric: 2725631430825711700
record_id: cf6b4e23-e9fa-4dc4-801e-1edbf526f27c
generation_timestamp: '2025-12-06T23:37:15.777262+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-APSRSPKDTK
valid_from: "2025-12-10T09:46:57Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-APSRSPKDTK
valid_from: null
valid_to: "2025-12-10T09:46:57Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-APSRSPKDTK
ghcid_numeric: 2725631430825711700
valid_from: '2025-12-06T23:37:15.777262+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
custodian_name:
claim_type: custodian_name
claim_value: AŽD Praha, s.r.o. - Systémy pro kolejovou dopravu - Technická knihovna
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-APSRSPKDTK
- identifier_scheme: GHCID_UUID
identifier_value: 73b6f6a0-6aa1-525e-9695-ad0aaf495b5e
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 25d36358-6701-8854-8694-d3ca1d69880b
- identifier_scheme: GHCID_NUMERIC
identifier_value: '2725631430825711700'
- identifier_scheme: RECORD_ID
identifier_value: cf6b4e23-e9fa-4dc4-801e-1edbf526f27c
- *id001
- *id002
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.914081+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.914081+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: AŽD Praha, s.r.o. - Systémy pro kolejovou dopravu - Technická knihovna
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.914081+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.914081+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 10
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.914081+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABF005
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.914081+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:15.777262+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 106 17
street_address: Žirovnická 2/3146
normalization_timestamp: '2025-12-09T20:41:04.323764+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:04.323750+00:00'

View file

@ -0,0 +1,160 @@
original_entry:
name: ARMABETON, a.s.
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABF162
locations:
- street_address: Antala Staška 30
postal_code: 147 00
city: Praha 4
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:16.098082+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-AS
ghcid_original: CZ-HL-PRA-L-AS
ghcid_uuid: dd6aeb75-9246-5dec-ab3d-02b949e8a665
ghcid_uuid_sha256: d68cdb6a-1317-8375-e0f0-11abb10bf0a4
ghcid_numeric: 15459972869489394549
record_id: 96bd21dd-759b-4b88-9a85-50f0ad328846
generation_timestamp: '2025-12-06T23:37:16.098082+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-AS
valid_from: "2025-12-10T09:46:58Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-AS
valid_from: null
valid_to: "2025-12-10T09:46:58Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-AS
ghcid_numeric: 15459972869489394549
valid_from: '2025-12-06T23:37:16.098082+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
custodian_name:
claim_type: custodian_name
claim_value: ARMABETON, a.s.
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-AS
- identifier_scheme: GHCID_UUID
identifier_value: dd6aeb75-9246-5dec-ab3d-02b949e8a665
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: d68cdb6a-1317-8375-e0f0-11abb10bf0a4
- identifier_scheme: GHCID_NUMERIC
identifier_value: '15459972869489394549'
- identifier_scheme: RECORD_ID
identifier_value: 96bd21dd-759b-4b88-9a85-50f0ad328846
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.916496+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.916496+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: ARMABETON, a.s.
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.916496+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.916496+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 4
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.916496+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABF162
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.916496+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:16.098082+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 147 00
street_address: Antala Staška 30
normalization_timestamp: '2025-12-09T20:41:04.374645+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:04.374619+00:00'

View file

@ -0,0 +1,260 @@
original_entry:
name: Arcibiskupský seminář Praha - Knihovna
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABD092
- &id002
identifier_scheme: IČO
identifier_value: 06522971
- &id003
identifier_scheme: Website
identifier_value: https://knihovna.seminar-praha.cz/
identifier_url: https://knihovna.seminar-praha.cz/
- &id004
identifier_scheme: Wikidata
identifier_value: Q114625124
identifier_url: https://www.wikidata.org/wiki/Q114625124
locations:
- street_address: Thákurova 676/3
postal_code: 160 00
city: Praha 6
region: Hlavní město Praha
country: CZ
latitude: 50.102269
longitude: 14.387193
processing_timestamp: '2025-12-06T23:37:43.284354+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-ASPK
ghcid_original: CZ-HL-PRA-L-ASPK
ghcid_uuid: eda9c255-cddd-538b-a2f5-2a412cfafba0
ghcid_uuid_sha256: b918706f-3041-82c9-f75d-478ef8db9ad6
ghcid_numeric: 13337533919310975689
record_id: 47bd51e5-a309-453b-ad45-227a6ca0a9a0
generation_timestamp: '2025-12-06T23:37:43.284354+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-ASPK
valid_from: '2025-12-10T09:46:58Z'
valid_to:
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
- ghcid: CZ-HL-PRA-L-ASPK
valid_from:
valid_to: '2025-12-10T09:46:58Z'
reason: Previous GHCID with incorrect region code
- ghcid: CZ-HL-PRA-L-ASPK
ghcid_numeric: 13337533919310975689
valid_from: '2025-12-06T23:37:43.284354+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
custodian_name:
claim_type: custodian_name
claim_value: Arcibiskupský seminář Praha - Knihovna
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-ASPK
- identifier_scheme: GHCID_UUID
identifier_value: eda9c255-cddd-538b-a2f5-2a412cfafba0
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: b918706f-3041-82c9-f75d-478ef8db9ad6
- identifier_scheme: GHCID_NUMERIC
identifier_value: '13337533919310975689'
- identifier_scheme: RECORD_ID
identifier_value: 47bd51e5-a309-453b-ad45-227a6ca0a9a0
- *id001
- *id002
- *id003
- *id004
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:30.303750+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
schema_version: 2.0.0
enrichment_provenance:
wikidata_enrichment:
content_hash: sha256-ntHR4RBP4Mj51klRdUy7DKK0ApDvoeGLimWqigTPhaA=
verified_at: '2025-12-28T19:58:22.869748+00:00'
wasDerivedFrom: https://www.wikidata.org/wiki/Q114625124
provenance_schema_version: '2.0'
standards_compliance:
- W3C PROV-O
- W3C SRI (content hashes)
generated_at: '2025-12-28T19:58:22.869781+00:00'
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.303750+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Arcibiskupský seminář Praha - Knihovna
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.303750+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.303750+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 6
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.303750+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: wikidata_id
claim_value: Q114625124
property_uri: owl:sameAs
provenance:
namespace: wikidata
path: /entity/Q114625124
timestamp: '2025-11-19T10:14:30.303750+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.98
- claim_type: ghcid
claim_value: CZ-ABD092
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.303750+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:43.284354+00:00'
creation_method: create_custodian_from_ch_annotator.py
wikidata_enrichment:
wikidata_entity_id: Q114625124
api_metadata:
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
fetch_timestamp: '2025-12-18T22:26:36.445708+00:00'
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
enrichment_version: 2.1_generic
properties_found:
- P31
- P17
- P159
- P968
wikidata_labels:
cs: Arcibiskupský seminář Praha, knihovna
wikidata_label_cs: Arcibiskupský seminář Praha, knihovna
wikidata_descriptions:
cs: knihovna v Praze
wikidata_classification:
instance_of: &id005
- id: Q50379845
label: religious library
description: type of library with collections that focus on religion and related subjects
wikidata_instance_of: *id005
wikidata_location:
country: &id006
id: Q213
label: Czech Republic
description: country in Central Europe
headquarters_location:
id: Q2667290
label: Prague 6
description: municipal part of Prague
wikidata_country: *id006
wikidata_contact:
email: mailto:knihovna@seminar-praha.cz
_provenance:
content_hash:
algorithm: sha256
value: sha256-ntHR4RBP4Mj51klRdUy7DKK0ApDvoeGLimWqigTPhaA=
scope: enrichment_section
computed_at: '2025-12-28T19:58:22.869748+00:00'
prov:
wasGeneratedBy:
'@type': prov:Activity
name: wikidata_api_fetch
used: https://www.wikidata.org/w/rest.php/wikibase/v1
wasDerivedFrom: https://www.wikidata.org/wiki/Q114625124
verification:
status: verified
last_verified: '2025-12-28T19:58:22.869770+00:00'
location:
latitude: 50.102269
longitude: 14.387193
coordinate_provenance:
source_type: ORIGINAL_ENTRY
source_path: original_entry.locations[0]
city: Praha 6
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 160 00
street_address: Thákurova 676/3
normalization_timestamp: '2025-12-09T06:50:47.206103+00:00'
logo_enrichment:
enrichment_timestamp: '2025-12-23T21:57:23.368831+00:00'
source_url: https://knihovna.seminar-praha.cz
extraction_method: crawl4ai
claims:
- claim_type: favicon_url
claim_value: https://knihovna.seminar-praha.cz/opac-tmpl/bootstrap/images/favicon.ico
source_url: https://knihovna.seminar-praha.cz
css_selector: '[document] > html.js.fonts-loaded > head > link:nth-of-type(3)'
retrieved_on: '2025-12-23T21:57:23.368831+00:00'
extraction_method: crawl4ai_link_rel
favicon_type: image/x-icon
favicon_sizes: ''
summary:
total_claims: 1
has_primary_logo: false
has_favicon: true
has_og_image: false
favicon_count: 1

View file

@ -0,0 +1,161 @@
custodian_name:
claim_type: custodian_name
claim_value: ACIDOTECHNA, s.r.o.
source_type: ch_annotator
original_entry:
name: ACIDOTECHNA, s.r.o.
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABF096
locations:
- street_address: Michelská 12a
postal_code: 145 01
city: Praha 4
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:15.953529+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-ASR-acidotechna_sro
ghcid_original: CZ-HL-PRA-L-ASR-acidotechna_sro
ghcid_uuid: 9c2c2209-9d56-5375-bb53-09c75ba423ef
ghcid_uuid_sha256: bce8eb61-ded2-8b28-874e-645c068f85be
ghcid_numeric: 13612388679310035752
record_id: 63f54d2a-db70-4c4c-b0ba-6fb873826750
generation_timestamp: '2025-12-06T23:37:15.953529+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-ASR-acidotechna_sro
valid_from: '2025-12-10T09:46:58Z'
valid_to: null
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha))
per ISO 3166-2:CZ
- ghcid: CZ-HL-PRA-L-ASR-acidotechna_sro
valid_from: null
valid_to: '2025-12-10T09:46:58Z'
reason: Previous GHCID with incorrect region code
- ghcid: CZ-HL-PRA-L-ASR-acidotechna_sro
ghcid_numeric: 13612388679310035752
valid_from: '2025-12-06T23:37:15.953529+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-ASR-acidotechna_sro
- identifier_scheme: GHCID_UUID
identifier_value: 9c2c2209-9d56-5375-bb53-09c75ba423ef
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: bce8eb61-ded2-8b28-874e-645c068f85be
- identifier_scheme: GHCID_NUMERIC
identifier_value: '13612388679310035752'
- identifier_scheme: RECORD_ID
identifier_value: 63f54d2a-db70-4c4c-b0ba-6fb873826750
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.915592+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.915592+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date: null
verified_by: null
entity_claims:
- claim_type: full_name
claim_value: ACIDOTECHNA, s.r.o.
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.915592+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.915592+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 4
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.915592+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABF096
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.915592+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:15.953529+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 145 01
street_address: Michelská 12a
normalization_timestamp: '2025-12-09T20:41:04.420625+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:04.420612+00:00'

View file

@ -0,0 +1,160 @@
original_entry:
name: AGMECO, s.r.o.
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABC092
locations:
- street_address: Turkova 828
postal_code: 149 00
city: Praha 4
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:14.565312+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-ASR
ghcid_original: CZ-HL-PRA-L-ASR
ghcid_uuid: cd691ef7-1bfd-52ef-a488-72f631ad3a7e
ghcid_uuid_sha256: 26b6bda7-4fa1-86bc-9c4c-4a233085c9ea
ghcid_numeric: 2789625545495766716
record_id: 6559bed9-6e1f-4aeb-b6b1-ac1f44c6317e
generation_timestamp: '2025-12-06T23:37:14.565312+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-ASR
valid_from: "2025-12-10T09:46:58Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-ASR
valid_from: null
valid_to: "2025-12-10T09:46:58Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-ASR
ghcid_numeric: 2789625545495766716
valid_from: '2025-12-06T23:37:14.565312+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
custodian_name:
claim_type: custodian_name
claim_value: AGMECO, s.r.o.
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-ASR
- identifier_scheme: GHCID_UUID
identifier_value: cd691ef7-1bfd-52ef-a488-72f631ad3a7e
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 26b6bda7-4fa1-86bc-9c4c-4a233085c9ea
- identifier_scheme: GHCID_NUMERIC
identifier_value: '2789625545495766716'
- identifier_scheme: RECORD_ID
identifier_value: 6559bed9-6e1f-4aeb-b6b1-ac1f44c6317e
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.803393+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.803393+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: AGMECO, s.r.o.
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.803393+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.803393+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 4
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.803393+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABC092
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.803393+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:14.565312+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 149 00
street_address: Turkova 828
normalization_timestamp: '2025-12-09T20:41:04.467290+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:04.467210+00:00'

View file

@ -0,0 +1,273 @@
custodian_name:
claim_type: custodian_name
claim_value: ATS-TELCOM PRAHA, a.s. - Knihovna
source_type: ch_annotator
original_entry:
name: ATS-TELCOM PRAHA, a.s. - Knihovna
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABF153
- &id002
identifier_scheme: IČO
identifier_value: '61860409'
- &id003
identifier_scheme: DIČ
identifier_value: CZ61860409
- &id004
identifier_scheme: Website
identifier_value: https://www.atstelcom.cz/
identifier_url: https://www.atstelcom.cz/
- &id005
identifier_scheme: Wikidata
identifier_value: Q114617620
identifier_url: https://www.wikidata.org/wiki/Q114617620
locations:
- street_address: Milíčova 553/14
postal_code: 130 00
city: Praha 3
region: Hlavní město Praha
country: CZ
latitude: 50.085483
longitude: 14.451664
- street_address: Nad elektrárnou 1526/45
postal_code: 106 00
city: Praha 10
region: Hlavní město Praha
country: CZ
latitude: 50.059649
longitude: 14.482379
processing_timestamp: '2025-12-06T23:37:14.579079+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-ATPSK
ghcid_original: CZ-HL-PRA-L-ATPSK
ghcid_uuid: 5e551f13-d489-53fa-af0d-b6e82fa47d51
ghcid_uuid_sha256: 7d900938-62fa-869d-3b96-9cb930692af6
ghcid_numeric: 9047741789170710173
record_id: 4645f24c-0e18-4d6f-91de-8f8c6d333731
generation_timestamp: '2025-12-06T23:37:14.579079+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-ATPSK
valid_from: '2025-12-10T09:46:58Z'
valid_to:
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
- ghcid: CZ-HL-PRA-L-ATPSK
valid_from:
valid_to: '2025-12-10T09:46:58Z'
reason: Previous GHCID with incorrect region code
- ghcid: CZ-HL-PRA-L-ATPSK
ghcid_numeric: 9047741789170710173
valid_from: '2025-12-06T23:37:14.579079+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-ATPSK
- identifier_scheme: GHCID_UUID
identifier_value: 5e551f13-d489-53fa-af0d-b6e82fa47d51
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 7d900938-62fa-869d-3b96-9cb930692af6
- identifier_scheme: GHCID_NUMERIC
identifier_value: '9047741789170710173'
- identifier_scheme: RECORD_ID
identifier_value: 4645f24c-0e18-4d6f-91de-8f8c6d333731
- *id001
- *id002
- *id003
- *id004
- *id005
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.803514+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
schema_version: 2.0.0
enrichment_provenance:
wikidata_enrichment:
content_hash: sha256-Ai9x7D73cX3tVNjU3LVqIKD/eoQYNuqfN5tBF2LNVn0=
verified_at: '2025-12-28T19:58:22.910555+00:00'
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617620
provenance_schema_version: '2.0'
standards_compliance:
- W3C PROV-O
- W3C SRI (content hashes)
generated_at: '2025-12-28T19:58:22.910577+00:00'
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.803514+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: ATS-TELCOM PRAHA, a.s. - Knihovna
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.803514+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.803514+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 3
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.803514+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: wikidata_id
claim_value: Q114617620
property_uri: owl:sameAs
provenance:
namespace: wikidata
path: /entity/Q114617620
timestamp: '2025-11-19T10:14:29.803514+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.98
- claim_type: ghcid
claim_value: CZ-ABF153
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.803514+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:14.579079+00:00'
creation_method: create_custodian_from_ch_annotator.py
wikidata_enrichment:
wikidata_entity_id: Q114617620
api_metadata:
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
fetch_timestamp: '2025-12-18T22:26:44.488623+00:00'
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
enrichment_version: 2.1_generic
properties_found:
- P31
- P17
- P159
- P968
wikidata_labels:
cs: ATS-TELCOM PRAHA, a.s., knihovna
en: ATS-TELCOM PRAHA, JSC, Library
wikidata_label_en: ATS-TELCOM PRAHA, JSC, Library
wikidata_label_cs: ATS-TELCOM PRAHA, a.s., knihovna
wikidata_descriptions:
cs: knihovna v Praze
wikidata_classification:
instance_of: &id006
- id: Q385994
label: special library
description: specialized library providing resources on a particular topic or discipline
wikidata_instance_of: *id006
wikidata_location:
country: &id007
id: Q213
label: Czech Republic
description: country in Central Europe
headquarters_location:
id: Q2444921
label: Prague 10
description: municipal part and administrative district of Prague
wikidata_country: *id007
wikidata_contact:
email: mailto:info@atstelcom.cz
_provenance:
content_hash:
algorithm: sha256
value: sha256-Ai9x7D73cX3tVNjU3LVqIKD/eoQYNuqfN5tBF2LNVn0=
scope: enrichment_section
computed_at: '2025-12-28T19:58:22.910555+00:00'
prov:
wasGeneratedBy:
'@type': prov:Activity
name: wikidata_api_fetch
used: https://www.wikidata.org/w/rest.php/wikibase/v1
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617620
verification:
status: verified
last_verified: '2025-12-28T19:58:22.910567+00:00'
location:
latitude: 50.085483
longitude: 14.451664
coordinate_provenance:
source_type: ORIGINAL_ENTRY
source_path: original_entry.locations[0]
city: Praha 3
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 130 00
street_address: Milíčova 553/14
normalization_timestamp: '2025-12-09T06:50:47.255819+00:00'
logo_enrichment:
enrichment_timestamp: '2025-12-23T21:57:36.669627+00:00'
source_url: https://www.atstelcom.cz
extraction_method: crawl4ai
claims:
- claim_type: favicon_url
claim_value: https://www.atstelcom.cz/wp-content/themes/atstelcom/images/favicon.png
source_url: https://www.atstelcom.cz
css_selector: '[document] > html > head > link:nth-of-type(24)'
retrieved_on: '2025-12-23T21:57:36.669627+00:00'
extraction_method: crawl4ai_link_rel
favicon_type: image/png
favicon_sizes: 64x64
summary:
total_claims: 1
has_primary_logo: false
has_favicon: true
has_og_image: false
favicon_count: 1

View file

@ -0,0 +1,268 @@
original_entry:
name: Archeologický ústav AV ČR, Praha, v. v. i. - Knihovna
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABB001
- &id002
identifier_scheme: IČO
identifier_value: '67985912'
- &id003
identifier_scheme: DIČ
identifier_value: CZ67985912
- &id004
identifier_scheme: Website
identifier_value: https://aleph.lib.cas.cz/F?func=file&file_name=find-b&local_base=ARU
identifier_url: https://aleph.lib.cas.cz/F?func=file&file_name=find-b&local_base=ARU
- &id005
identifier_scheme: Wikidata
identifier_value: Q114617535
identifier_url: https://www.wikidata.org/wiki/Q114617535
locations:
- street_address: Letenská 4
postal_code: 118 01
city: Praha 1
region: Hlavní město Praha
country: CZ
latitude: 50.090292
longitude: 14.409381
processing_timestamp: '2025-12-06T23:35:06.261010+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-AUACPVVK
ghcid_original: CZ-HL-PRA-L-AÚAČPVVK
ghcid_uuid: 924b81b6-c531-53fb-bbaa-afd62e44a6cc
ghcid_uuid_sha256: bd78b1c5-240f-874c-8893-89b8d58c855f
ghcid_numeric: 13652857730645309260
record_id: 9b0a2dd2-fb5b-4cba-87e5-bcb1cdbd0c78
generation_timestamp: '2025-12-06T23:35:06.261010+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-AUACPVVK
valid_from: '2025-12-10T09:46:58Z'
valid_to:
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
- ghcid: CZ-HL-PRA-L-AUACPVVK
valid_from:
valid_to: '2025-12-10T09:46:58Z'
reason: Previous GHCID with incorrect region code
- ghcid: CZ-HL-PRA-L-AUACPVVK
ghcid_numeric: 13652857730645309260
valid_from: '2025-12-08T11:21:41.171139+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-AÚAČPVVK)'
- ghcid: CZ-HL-PRA-L-AÚAČPVVK
ghcid_numeric: 17938697631577694371
valid_from: '2025-12-06T23:35:06.261010+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:41.171139+00:00'
custodian_name:
claim_type: custodian_name
claim_value: Archeologický ústav AV ČR, Praha, v. v. i. - Knihovna
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-AUACPVVK
- identifier_scheme: GHCID_UUID
identifier_value: 924b81b6-c531-53fb-bbaa-afd62e44a6cc
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: bd78b1c5-240f-874c-8893-89b8d58c855f
- identifier_scheme: GHCID_NUMERIC
identifier_value: '13652857730645309260'
- identifier_scheme: RECORD_ID
identifier_value: 9b0a2dd2-fb5b-4cba-87e5-bcb1cdbd0c78
- *id001
- *id002
- *id003
- *id004
- *id005
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.796766+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
schema_version: 2.0.0
enrichment_provenance:
wikidata_enrichment:
content_hash: sha256-E7SnXARgb3OG9F55KshgrRA+rk285ZpEO2RGIY1V9ys=
verified_at: '2025-12-28T19:58:22.931365+00:00'
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617535
provenance_schema_version: '2.0'
standards_compliance:
- W3C PROV-O
- W3C SRI (content hashes)
generated_at: '2025-12-28T19:58:22.931385+00:00'
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.796766+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Archeologický ústav AV ČR, Praha, v. v. i. - Knihovna
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.796766+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.796766+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 1
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.796766+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: wikidata_id
claim_value: Q114617535
property_uri: owl:sameAs
provenance:
namespace: wikidata
path: /entity/Q114617535
timestamp: '2025-11-19T10:14:29.796766+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.98
- claim_type: ghcid
claim_value: CZ-ABB001
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.796766+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:35:06.261010+00:00'
creation_method: create_custodian_from_ch_annotator.py
wikidata_enrichment:
wikidata_entity_id: Q114617535
api_metadata:
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
fetch_timestamp: '2025-12-18T22:26:52.524632+00:00'
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
enrichment_version: 2.1_generic
properties_found:
- P31
- P17
- P159
- P968
- P18
- P463
wikidata_labels:
cs: Archeologický ústav AV ČR, Praha, v. v. i., knihovna
en: Institute of Archaeology CAS, Prague, p. r. i., Library
wikidata_label_en: Institute of Archaeology CAS, Prague, p. r. i., Library
wikidata_label_cs: Archeologický ústav AV ČR, Praha, v. v. i., knihovna
wikidata_descriptions:
cs: knihovna v Praze
wikidata_aliases:
cs:
- ARÚ AV ČR
wikidata_classification:
instance_of: &id006
- id: Q1438040
label: research library
description: library that collects materials on one or more subjects to support scholarly or scientific research
wikidata_instance_of: *id006
wikidata_location:
country: &id007
id: Q213
label: Czech Republic
description: country in Central Europe
headquarters_location:
id: Q973974
label: Prague 1
description: administrative district, municipal district and municipal part of Prague
wikidata_country: *id007
wikidata_organization:
member_of:
- id: Q106192307
label: Czech Republic Libraries Association
description: organization in Czechia
- id: Q4809890
label: Association of Library and Information Professionals of the Czech Republic
description: organization
wikidata_media:
image: Praha Mala Strana Letenska 4.JPG
wikidata_image: Praha Mala Strana Letenska 4.JPG
wikidata_contact:
email: mailto:mvs-edd@arup.cas.cz
_provenance:
content_hash:
algorithm: sha256
value: sha256-E7SnXARgb3OG9F55KshgrRA+rk285ZpEO2RGIY1V9ys=
scope: enrichment_section
computed_at: '2025-12-28T19:58:22.931365+00:00'
prov:
wasGeneratedBy:
'@type': prov:Activity
name: wikidata_api_fetch
used: https://www.wikidata.org/w/rest.php/wikibase/v1
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617535
verification:
status: verified
last_verified: '2025-12-28T19:58:22.931375+00:00'
location:
latitude: 50.090292
longitude: 14.409381
coordinate_provenance:
source_type: ORIGINAL_ENTRY
source_path: original_entry.locations[0]
city: Praha 1
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 118 01
street_address: Letenská 4
normalization_timestamp: '2025-12-09T06:50:47.272728+00:00'

View file

@ -0,0 +1,165 @@
original_entry:
name: Akademie věd ČR - Astronomický ústav - pobočka Praha
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABB003
locations:
- street_address: Boční II/1401a
postal_code: 141 31
city: Praha 4 - Spořilov
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:14.336723+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-AVCAUPP-akademie_ved_cr_astronomicky_ustav_pobocka_praha
ghcid_original: CZ-HL-PRA-L-AVČAÚPP-akademie_ved_cr_astronomicky_ustav_pobocka_praha
ghcid_uuid: 58f34098-2260-5a52-a55a-bd35658b9c4c
ghcid_uuid_sha256: b3b20835-7309-8201-99b1-e47c28429836
ghcid_numeric: 12948420904299880961
record_id: bcb4ac09-49d9-4ea3-93dc-a45ac8db7b22
generation_timestamp: '2025-12-06T23:37:14.336723+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-AVCAUPP-akademie_ved_cr_astronomicky_ustav_pobocka_praha
valid_from: "2025-12-10T09:46:58Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-AVCAUPP-akademie_ved_cr_astronomicky_ustav_pobocka_praha
valid_from: null
valid_to: "2025-12-10T09:46:58Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-AVCAUPP-akademie_ved_cr_astronomicky_ustav_pobocka_praha
ghcid_numeric: 12948420904299880961
valid_from: '2025-12-08T11:21:37.421254+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-AVČAÚPP-akademie_ved_cr_astronomicky_ustav_pobocka_praha)'
- ghcid: CZ-HL-PRA-L-AVČAÚPP-akademie_ved_cr_astronomicky_ustav_pobocka_praha
ghcid_numeric: 7223755232174600416
valid_from: '2025-12-06T23:37:14.336723+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:37.421254+00:00'
custodian_name:
claim_type: custodian_name
claim_value: Akademie věd ČR - Astronomický ústav - pobočka Praha
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-AVCAUPP-akademie_ved_cr_astronomicky_ustav_pobocka_praha
- identifier_scheme: GHCID_UUID
identifier_value: 58f34098-2260-5a52-a55a-bd35658b9c4c
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: b3b20835-7309-8201-99b1-e47c28429836
- identifier_scheme: GHCID_NUMERIC
identifier_value: '12948420904299880961'
- identifier_scheme: RECORD_ID
identifier_value: bcb4ac09-49d9-4ea3-93dc-a45ac8db7b22
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.796909+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.796909+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Akademie věd ČR - Astronomický ústav - pobočka Praha
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.796909+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.796909+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 4 - Spořilov
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.796909+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABB003
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.796909+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:14.336723+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 141 31
street_address: Boční II/1401a
normalization_timestamp: '2025-12-09T20:41:04.527872+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:04.527796+00:00'

View file

@ -0,0 +1,166 @@
custodian_name:
claim_type: custodian_name
claim_value: Akademie věd ČR - Astronomický ústav - pobočka Praha
source_type: ch_annotator
original_entry:
name: Akademie věd ČR - Astronomický ústav - pobočka Praha
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABB003
locations:
- street_address: Boční II/1401a
postal_code: 141 31
city: Praha 4 - Spořilov
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:35:06.267449+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-AVCAUPP
ghcid_original: CZ-HL-PRA-L-AVČAÚPP
ghcid_uuid: 18967c5e-b5e8-590f-a534-aee5991f94d2
ghcid_uuid_sha256: 9b3c35db-cf09-8a4e-bf7a-4869fbae9378
ghcid_numeric: 11185874792669071950
record_id: 25dfb5ba-2772-4315-97f5-9e6bff301b3e
generation_timestamp: '2025-12-06T23:35:06.267449+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-AVCAUPP
valid_from: '2025-12-10T09:46:58Z'
valid_to: null
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha))
per ISO 3166-2:CZ
- ghcid: CZ-HL-PRA-L-AVCAUPP
valid_from: null
valid_to: '2025-12-10T09:46:58Z'
reason: Previous GHCID with incorrect region code
- ghcid: CZ-HL-PRA-L-AVCAUPP
ghcid_numeric: 11185874792669071950
valid_from: '2025-12-08T11:21:31.097745+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-AVČAÚPP)'
- ghcid: CZ-HL-PRA-L-AVČAÚPP
ghcid_numeric: 8955671449216497236
valid_from: '2025-12-06T23:35:06.267449+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:31.097745+00:00'
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-AVCAUPP
- identifier_scheme: GHCID_UUID
identifier_value: 18967c5e-b5e8-590f-a534-aee5991f94d2
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 9b3c35db-cf09-8a4e-bf7a-4869fbae9378
- identifier_scheme: GHCID_NUMERIC
identifier_value: '11185874792669071950'
- identifier_scheme: RECORD_ID
identifier_value: 25dfb5ba-2772-4315-97f5-9e6bff301b3e
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.796909+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.796909+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date: null
verified_by: null
entity_claims:
- claim_type: full_name
claim_value: Akademie věd ČR - Astronomický ústav - pobočka Praha
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.796909+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.796909+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 4 - Spořilov
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.796909+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABB003
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.796909+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:35:06.267449+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 141 31
street_address: Boční II/1401a
normalization_timestamp: '2025-12-09T20:41:04.585077+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:04.585057+00:00'

View file

@ -0,0 +1,165 @@
original_entry:
name: Akademie věd ČR - Encyklopedický institut
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABB007
locations:
- street_address: Nerudova 21
postal_code: 110 00
city: Praha 1
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:25.509175+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-AVCEI
ghcid_original: CZ-HL-PRA-L-AVČEI
ghcid_uuid: 0db42277-531e-574c-b4d8-06d65682aec8
ghcid_uuid_sha256: abe02825-c33c-8ab4-8761-1ed37589c252
ghcid_numeric: 12384943117923306164
record_id: 29def572-f83a-4df6-a71d-d7a953ebbaf3
generation_timestamp: '2025-12-06T23:37:25.509175+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-AVCEI
valid_from: "2025-12-10T09:46:58Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-AVCEI
valid_from: null
valid_to: "2025-12-10T09:46:58Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-AVCEI
ghcid_numeric: 12384943117923306164
valid_from: '2025-12-08T11:21:22.756802+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-AVČEI)'
- ghcid: CZ-HL-PRA-L-AVČEI
ghcid_numeric: 1744362685845442236
valid_from: '2025-12-06T23:37:25.509175+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:22.756802+00:00'
custodian_name:
claim_type: custodian_name
claim_value: Akademie věd ČR - Encyklopedický institut
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-AVCEI
- identifier_scheme: GHCID_UUID
identifier_value: 0db42277-531e-574c-b4d8-06d65682aec8
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: abe02825-c33c-8ab4-8761-1ed37589c252
- identifier_scheme: GHCID_NUMERIC
identifier_value: '12384943117923306164'
- identifier_scheme: RECORD_ID
identifier_value: 29def572-f83a-4df6-a71d-d7a953ebbaf3
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:30.046021+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046021+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Akademie věd ČR - Encyklopedický institut
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046021+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046021+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 1
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046021+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABB007
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046021+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:25.509175+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 110 00
street_address: Nerudova 21
normalization_timestamp: '2025-12-09T20:41:04.641407+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:04.641362+00:00'

View file

@ -0,0 +1,174 @@
original_entry:
name: Akademie věd ČR - Farmakologický ústav
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABB009
locations:
- street_address: Vídeňská 1083
postal_code: 142 20
city: Praha 4 - Krč
country: CZ
processing_timestamp: '2025-12-06T23:37:14.339235+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-AVCFU-akademie_ved_cr_farmakologicky_ustav
ghcid_original: CZ-XX-PRA-L-AVČFÚ-akademie_ved_cr_farmakologicky_ustav
ghcid_uuid: f96e7f32-5991-57f8-ac2b-03a29e61f6b4
ghcid_uuid_sha256: 672371a3-e0d0-8931-b13f-7df5a5e66712
ghcid_numeric: 7431908758756849969
record_id: 0e02392d-7875-4d5d-8654-519be6319019
generation_timestamp: '2025-12-06T23:37:14.339235+00:00'
location_resolution:
country_code: CZ
region_code: '10'
city_code: PRA
method: GEONAMES_CITY_LOOKUP
region_name: CZ-10
geonames_id: 8629195
resolution_timestamp: '2025-12-07T11:42:53.985901+00:00'
ghcid_history:
- ghcid: CZ-10-PRA-L-AVCFU-akademie_ved_cr_farmakologicky_ustav
ghcid_numeric: 7431908758756849969
valid_from: '2025-12-08T11:21:21.962848+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-10-PRA-L-AVČFÚ-akademie_ved_cr_farmakologicky_ustav)'
- ghcid: CZ-XX-PRA-L-AVČFÚ-akademie_ved_cr_farmakologicky_ustav
ghcid_numeric: 1014122794529823696
valid_from: '2025-12-06T23:37:14.339235+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:21.962848+00:00'
- ghcid: CZ-10-PRA-L-AVČFÚ-akademie_ved_cr_farmakologicky_ustav
valid_from: '2025-12-07T11:42:53.985912+00:00'
reason: 'Region resolved via GeoNames city lookup: XX->10 (Praha Klanovice)'
custodian_name:
claim_type: custodian_name
claim_value: Akademie věd ČR - Farmakologický ústav
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-AVCFU-akademie_ved_cr_farmakologicky_ustav
- identifier_scheme: GHCID_UUID
identifier_value: f96e7f32-5991-57f8-ac2b-03a29e61f6b4
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 672371a3-e0d0-8931-b13f-7df5a5e66712
- identifier_scheme: GHCID_NUMERIC
identifier_value: '7431908758756849969'
- identifier_scheme: RECORD_ID
identifier_value: 0e02392d-7875-4d5d-8654-519be6319019
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.797040+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- 'YouTube/Google Maps enrichment 2025-12-08T19:49:19Z: Maps: rejected by LLM; YouTube: not found'
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:19Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:48:05Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:49:29Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.797040+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Akademie věd ČR - Farmakologický ústav
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.797040+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.797040+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 4 - Krč
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.797040+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABB009
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.797040+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:14.339235+00:00'
creation_method: create_custodian_from_ch_annotator.py
google_maps_status: NO_MATCH
google_maps_rejected:
candidate_name: Czech Academy of Sciences
rejection_reason: The candidate 'Czech Academy of Sciences' is the overarching parent institution for the source 'Akademie
věd ČR - Farmakologický ústav' (Institute of Pharmacology). They are not the same entity. Furthermore, the candidate's
generic Google Place types ('point_of_interest', 'establishment') do not match the expected heritage custodian types (museum,
library, archive, gallery). The entity type is not GRP.HER.
timestamp: '2025-12-08T19:49:19.452977+00:00'
youtube_status: NOT_FOUND
youtube_search_query: Akademie věd ČR - Farmakologický ústav official
youtube_search_timestamp: '2025-12-08T19:49:19.794617+00:00'
location:
city: Prague
region: CZ-10
region_code: '10'
country: CZ
postal_code: 142 20
street_address: Vídeňská 1083
geonames_id: 3067696
normalization_timestamp: '2025-12-09T20:40:58.470017+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:40:58.469902+00:00'

View file

@ -0,0 +1,175 @@
custodian_name:
claim_type: custodian_name
claim_value: Akademie věd ČR - Farmakologický ústav
source_type: ch_annotator
original_entry:
name: Akademie věd ČR - Farmakologický ústav
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABB009
locations:
- street_address: Vídeňská 1083
postal_code: 142 20
city: Praha 4 - Krč
country: CZ
processing_timestamp: '2025-12-06T23:35:06.272726+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-AVCFU
ghcid_original: CZ-XX-PRA-L-AVČFÚ
ghcid_uuid: 19446a92-0047-50e3-a29e-2b6727963ddd
ghcid_uuid_sha256: 509c980d-7747-8252-ac32-02074845989a
ghcid_numeric: 5808684803004277330
record_id: 7f650e4c-32cf-48c2-96c7-c2e8a97b93a0
generation_timestamp: '2025-12-06T23:35:06.272726+00:00'
location_resolution:
country_code: CZ
region_code: '10'
city_code: PRA
method: GEONAMES_CITY_LOOKUP
region_name: CZ-10
geonames_id: 8629195
resolution_timestamp: '2025-12-07T11:42:56.327508+00:00'
ghcid_history:
- ghcid: CZ-10-PRA-L-AVCFU
ghcid_numeric: 5808684803004277330
valid_from: '2025-12-08T11:21:22.541518+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-10-PRA-L-AVČFÚ)'
- ghcid: CZ-XX-PRA-L-AVČFÚ
ghcid_numeric: 10681739419161361008
valid_from: '2025-12-06T23:35:06.272726+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:22.541518+00:00'
- ghcid: CZ-10-PRA-L-AVČFÚ
valid_from: '2025-12-07T11:42:56.327520+00:00'
reason: 'Region resolved via GeoNames city lookup: XX->10 (Praha Klanovice)'
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-AVCFU
- identifier_scheme: GHCID_UUID
identifier_value: 19446a92-0047-50e3-a29e-2b6727963ddd
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 509c980d-7747-8252-ac32-02074845989a
- identifier_scheme: GHCID_NUMERIC
identifier_value: '5808684803004277330'
- identifier_scheme: RECORD_ID
identifier_value: 7f650e4c-32cf-48c2-96c7-c2e8a97b93a0
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.797040+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- 'YouTube/Google Maps enrichment 2025-12-08T19:49:23Z: Maps: rejected by LLM; YouTube:
not found'
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:19Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:48:05Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:49:29Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.797040+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date: null
verified_by: null
entity_claims:
- claim_type: full_name
claim_value: Akademie věd ČR - Farmakologický ústav
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.797040+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.797040+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 4 - Krč
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.797040+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABB009
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.797040+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:35:06.272726+00:00'
creation_method: create_custodian_from_ch_annotator.py
google_maps_status: NO_MATCH
google_maps_rejected:
candidate_name: Czech Academy of Sciences
rejection_reason: The Google Place types 'point_of_interest' and 'establishment'
are too generic and do not match the expected types for a Heritage Institution
(GRP.HER). The source institution is a research institute, which does not align
with the definition of a heritage custodian (museums, archives, libraries, galleries).
timestamp: '2025-12-08T19:49:22.801497+00:00'
youtube_status: NOT_FOUND
youtube_search_query: Akademie věd ČR - Farmakologický ústav official
youtube_search_timestamp: '2025-12-08T19:49:23.146454+00:00'
location:
city: Prague
region: CZ-10
region_code: '10'
country: CZ
postal_code: 142 20
street_address: Vídeňská 1083
geonames_id: 3067696
normalization_timestamp: '2025-12-09T20:40:58.518802+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:40:58.518791+00:00'

View file

@ -0,0 +1,165 @@
original_entry:
name: Akademie věd ČR - Laboratoř evoluční biologie
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABB113
locations:
- street_address: Na Folimance 11
postal_code: 120 00
city: Praha 2
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:25.551825+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-AVCLEB
ghcid_original: CZ-HL-PRA-L-AVČLEB
ghcid_uuid: e0d81fc2-7fba-594a-9c91-9346aecc6151
ghcid_uuid_sha256: 07e2e7ff-3bd7-8da4-9c02-bdf92fed8815
ghcid_numeric: 568271586408783268
record_id: 0ebd10c7-c2bf-47f2-9f6c-a6ab26dacb10
generation_timestamp: '2025-12-06T23:37:25.551825+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-AVCLEB
valid_from: "2025-12-10T09:46:58Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-AVCLEB
valid_from: null
valid_to: "2025-12-10T09:46:58Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-AVCLEB
ghcid_numeric: 568271586408783268
valid_from: '2025-12-08T11:21:21.322426+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-AVČLEB)'
- ghcid: CZ-HL-PRA-L-AVČLEB
ghcid_numeric: 6089402491441890915
valid_from: '2025-12-06T23:37:25.551825+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:21.322426+00:00'
custodian_name:
claim_type: custodian_name
claim_value: Akademie věd ČR - Laboratoř evoluční biologie
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-AVCLEB
- identifier_scheme: GHCID_UUID
identifier_value: e0d81fc2-7fba-594a-9c91-9346aecc6151
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 07e2e7ff-3bd7-8da4-9c02-bdf92fed8815
- identifier_scheme: GHCID_NUMERIC
identifier_value: '568271586408783268'
- identifier_scheme: RECORD_ID
identifier_value: 0ebd10c7-c2bf-47f2-9f6c-a6ab26dacb10
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:30.046359+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046359+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Akademie věd ČR - Laboratoř evoluční biologie
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046359+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046359+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 2
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046359+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABB113
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046359+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:25.551825+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 120 00
street_address: Na Folimance 11
normalization_timestamp: '2025-12-09T20:41:04.685021+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:04.684993+00:00'

View file

@ -0,0 +1,165 @@
original_entry:
name: Akademie věd ČR - Mikrobiologický ústav
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABB070
locations:
- street_address: Vídeňská 1083
postal_code: 142 20
city: Praha 4 - Krč
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:14.360633+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-AVCMU-akademie_ved_cr_mikrobiologicky_ustav
ghcid_original: CZ-HL-PRA-L-AVČMÚ-akademie_ved_cr_mikrobiologicky_ustav
ghcid_uuid: a981b3fa-1bce-51ef-8dcb-102dd837a91d
ghcid_uuid_sha256: 1df13b28-a632-8490-bb35-783b25a0b097
ghcid_numeric: 2157570742260221072
record_id: 7dd73bdf-0d97-4f80-8391-979cb3bf4e74
generation_timestamp: '2025-12-06T23:37:14.360633+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-AVCMU-akademie_ved_cr_mikrobiologicky_ustav
valid_from: "2025-12-10T09:46:58Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-AVCMU-akademie_ved_cr_mikrobiologicky_ustav
valid_from: null
valid_to: "2025-12-10T09:46:58Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-AVCMU-akademie_ved_cr_mikrobiologicky_ustav
ghcid_numeric: 2157570742260221072
valid_from: '2025-12-08T11:21:40.263531+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-AVČMÚ-akademie_ved_cr_mikrobiologicky_ustav)'
- ghcid: CZ-HL-PRA-L-AVČMÚ-akademie_ved_cr_mikrobiologicky_ustav
ghcid_numeric: 11561309061492259634
valid_from: '2025-12-06T23:37:14.360633+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:40.263531+00:00'
custodian_name:
claim_type: custodian_name
claim_value: Akademie věd ČR - Mikrobiologický ústav
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-AVCMU-akademie_ved_cr_mikrobiologicky_ustav
- identifier_scheme: GHCID_UUID
identifier_value: a981b3fa-1bce-51ef-8dcb-102dd837a91d
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 1df13b28-a632-8490-bb35-783b25a0b097
- identifier_scheme: GHCID_NUMERIC
identifier_value: '2157570742260221072'
- identifier_scheme: RECORD_ID
identifier_value: 7dd73bdf-0d97-4f80-8391-979cb3bf4e74
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.799712+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.799712+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Akademie věd ČR - Mikrobiologický ústav
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.799712+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.799712+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 4 - Krč
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.799712+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABB070
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.799712+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:14.360633+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 142 20
street_address: Vídeňská 1083
normalization_timestamp: '2025-12-09T20:41:04.735969+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:04.735953+00:00'

View file

@ -0,0 +1,165 @@
original_entry:
name: Akademie věd ČR - Mikrobiologický ústav
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABB070
locations:
- street_address: Vídeňská 1083
postal_code: 142 20
city: Praha 4 - Krč
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:35:06.408370+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-AVCMU
ghcid_original: CZ-HL-PRA-L-AVČMÚ
ghcid_uuid: 67eb014a-1a9a-5c33-b7b0-ba6334bbd2f4
ghcid_uuid_sha256: dee515cd-cd8e-8bb7-bb0f-5779401058ef
ghcid_numeric: 16061267619747896247
record_id: e556633f-9974-4eff-8494-d78123ceb59e
generation_timestamp: '2025-12-06T23:35:06.408370+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-AVCMU
valid_from: "2025-12-10T09:46:58Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-AVCMU
valid_from: null
valid_to: "2025-12-10T09:46:58Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-AVCMU
ghcid_numeric: 16061267619747896247
valid_from: '2025-12-08T11:21:21.267751+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-AVČMÚ)'
- ghcid: CZ-HL-PRA-L-AVČMÚ
ghcid_numeric: 17106076247160859197
valid_from: '2025-12-06T23:35:06.408370+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:21.267751+00:00'
custodian_name:
claim_type: custodian_name
claim_value: Akademie věd ČR - Mikrobiologický ústav
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-AVCMU
- identifier_scheme: GHCID_UUID
identifier_value: 67eb014a-1a9a-5c33-b7b0-ba6334bbd2f4
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: dee515cd-cd8e-8bb7-bb0f-5779401058ef
- identifier_scheme: GHCID_NUMERIC
identifier_value: '16061267619747896247'
- identifier_scheme: RECORD_ID
identifier_value: e556633f-9974-4eff-8494-d78123ceb59e
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.799712+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.799712+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Akademie věd ČR - Mikrobiologický ústav
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.799712+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.799712+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 4 - Krč
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.799712+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABB070
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.799712+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:35:06.408370+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 142 20
street_address: Vídeňská 1083
normalization_timestamp: '2025-12-09T20:41:04.783450+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:04.783435+00:00'

View file

@ -0,0 +1,175 @@
original_entry:
name: Akademie věd ČR - Patentové a licenční služby
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABB034
locations:
- street_address: Českomalínská 25
postal_code: 160 70
city: Praha 6
country: CZ
processing_timestamp: '2025-12-06T23:37:14.342675+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-AVCPLS-akademie_ved_cr_patentove_a_licencni_sluzby
ghcid_original: CZ-XX-PRA-L-AVČPLS-akademie_ved_cr_patentove_a_licencni_sluzby
ghcid_uuid: 77fc1dea-cb44-5f17-b700-0fa091c6d43e
ghcid_uuid_sha256: cf64099a-0441-8f94-9b74-a037a0a9bce2
ghcid_numeric: 14944080020623171476
record_id: 20afb6b3-2f0b-40bc-aa0f-ff7a2c1a78a7
generation_timestamp: '2025-12-06T23:37:14.342675+00:00'
location_resolution:
country_code: CZ
region_code: '10'
city_code: PRA
method: GEONAMES_CITY_LOOKUP
region_name: CZ-10
geonames_id: 8629195
resolution_timestamp: '2025-12-07T11:42:54.008704+00:00'
ghcid_history:
- ghcid: CZ-10-PRA-L-AVCPLS-akademie_ved_cr_patentove_a_licencni_sluzby
ghcid_numeric: 14944080020623171476
valid_from: '2025-12-08T11:21:38.962454+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-10-PRA-L-AVČPLS-akademie_ved_cr_patentove_a_licencni_sluzby)'
- ghcid: CZ-XX-PRA-L-AVČPLS-akademie_ved_cr_patentove_a_licencni_sluzby
ghcid_numeric: 2477309062137844976
valid_from: '2025-12-06T23:37:14.342675+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:38.962454+00:00'
- ghcid: CZ-10-PRA-L-AVČPLS-akademie_ved_cr_patentove_a_licencni_sluzby
valid_from: '2025-12-07T11:42:54.008741+00:00'
reason: 'Region resolved via GeoNames city lookup: XX->10 (Praha Klanovice)'
custodian_name:
claim_type: custodian_name
claim_value: Akademie věd ČR - Patentové a licenční služby
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-AVCPLS-akademie_ved_cr_patentove_a_licencni_sluzby
- identifier_scheme: GHCID_UUID
identifier_value: 77fc1dea-cb44-5f17-b700-0fa091c6d43e
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: cf64099a-0441-8f94-9b74-a037a0a9bce2
- identifier_scheme: GHCID_NUMERIC
identifier_value: '14944080020623171476'
- identifier_scheme: RECORD_ID
identifier_value: 20afb6b3-2f0b-40bc-aa0f-ff7a2c1a78a7
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.797854+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- 'YouTube/Google Maps enrichment 2025-12-08T19:49:26Z: Maps: rejected by LLM; YouTube: not found'
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:19Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:48:05Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:49:29Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.797854+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Akademie věd ČR - Patentové a licenční služby
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.797854+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.797854+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 6
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.797854+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABB034
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.797854+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:14.342675+00:00'
creation_method: create_custodian_from_ch_annotator.py
google_maps_status: NO_MATCH
google_maps_rejected:
candidate_name: Czech Academy of Sciences
rejection_reason: 'REJECT: The source name indicates a specific department (Patent and Licensing Services) of the Czech
Academy of Sciences (Akademie věd ČR). The Google Maps result is for the parent institution, the Czech Academy of Sciences
itself, which is a broad scientific research organization, not a specific heritage custodian unit. Critically, the Google
Place types (''point_of_interest'', ''establishment'') do not match any of the expected heritage custodian types (e.g.,
library, museum, archive).'
timestamp: '2025-12-08T19:49:26.082220+00:00'
youtube_status: NOT_FOUND
youtube_search_query: Akademie věd ČR - Patentové a licenční služby official
youtube_search_timestamp: '2025-12-08T19:49:26.425308+00:00'
location:
city: Prague
region: CZ-10
region_code: '10'
country: CZ
postal_code: 160 70
street_address: Českomalínská 25
geonames_id: 3067696
normalization_timestamp: '2025-12-09T20:40:58.576931+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:40:58.576911+00:00'

View file

@ -0,0 +1,173 @@
original_entry:
name: Akademie věd ČR - Patentové a licenční služby
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABB034
locations:
- street_address: Českomalínská 25
postal_code: 160 70
city: Praha 6
country: CZ
processing_timestamp: '2025-12-06T23:35:06.317887+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-AVCPLS
ghcid_original: CZ-XX-PRA-L-AVČPLS
ghcid_uuid: 736468df-5da2-5944-9de8-403b76877f1c
ghcid_uuid_sha256: eb24cd69-68ef-8a64-b846-ef07973b1e82
ghcid_numeric: 16943893550690450020
record_id: 3b727510-8b1e-4cab-899d-ab014a488899
generation_timestamp: '2025-12-06T23:35:06.317887+00:00'
location_resolution:
country_code: CZ
region_code: '10'
city_code: PRA
method: GEONAMES_CITY_LOOKUP
region_name: CZ-10
geonames_id: 8629195
resolution_timestamp: '2025-12-07T11:42:54.307666+00:00'
ghcid_history:
- ghcid: CZ-10-PRA-L-AVCPLS
ghcid_numeric: 16943893550690450020
valid_from: '2025-12-08T11:21:31.031504+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-10-PRA-L-AVČPLS)'
- ghcid: CZ-XX-PRA-L-AVČPLS
ghcid_numeric: 14931995940768973646
valid_from: '2025-12-06T23:35:06.317887+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:31.031504+00:00'
- ghcid: CZ-10-PRA-L-AVČPLS
valid_from: '2025-12-07T11:42:54.307676+00:00'
reason: 'Region resolved via GeoNames city lookup: XX->10 (Praha Klanovice)'
custodian_name:
claim_type: custodian_name
claim_value: Akademie věd ČR - Patentové a licenční služby
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-AVCPLS
- identifier_scheme: GHCID_UUID
identifier_value: 736468df-5da2-5944-9de8-403b76877f1c
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: eb24cd69-68ef-8a64-b846-ef07973b1e82
- identifier_scheme: GHCID_NUMERIC
identifier_value: '16943893550690450020'
- identifier_scheme: RECORD_ID
identifier_value: 3b727510-8b1e-4cab-899d-ab014a488899
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.797854+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- 'YouTube/Google Maps enrichment 2025-12-08T19:49:33Z: Maps: rejected by LLM; YouTube: not found'
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:19Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:48:05Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:49:29Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.797854+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Akademie věd ČR - Patentové a licenční služby
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.797854+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.797854+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 6
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.797854+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABB034
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.797854+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:35:06.317887+00:00'
creation_method: create_custodian_from_ch_annotator.py
google_maps_status: NO_MATCH
google_maps_rejected:
candidate_name: Czech Academy of Sciences
rejection_reason: 'REJECT: The candidate matches by name and location, but the Google Place Types (''point_of_interest'',
''establishment'') do not match the expected heritage types (museum, library, archive, gallery). The institution is a
national academy of sciences, not a heritage custodian like a museum or library.'
timestamp: '2025-12-08T19:49:33.088455+00:00'
youtube_status: NOT_FOUND
youtube_search_query: Akademie věd ČR - Patentové a licenční služby official
youtube_search_timestamp: '2025-12-08T19:49:33.450514+00:00'
location:
city: Prague
region: CZ-10
region_code: '10'
country: CZ
postal_code: 160 70
street_address: Českomalínská 25
geonames_id: 3067696
normalization_timestamp: '2025-12-09T20:40:58.629000+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:40:58.628980+00:00'

View file

@ -0,0 +1,175 @@
original_entry:
name: Akademie věd ČR - Prognostický ústav
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABB112
locations:
- street_address: Opletalova 19
postal_code: 110 00
city: Praha 1
country: CZ
processing_timestamp: '2025-12-06T23:37:23.658421+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-AVCPU
ghcid_original: CZ-XX-PRA-L-AVČPÚ
ghcid_uuid: 8bb4e75c-a21b-5838-acb6-5ff445aed08a
ghcid_uuid_sha256: 5de04df0-b47f-8a74-a7f8-a9bb971668b9
ghcid_numeric: 6764492336526219892
record_id: 018d704c-40a7-4314-802e-d1657a6ef48d
generation_timestamp: '2025-12-06T23:37:23.658421+00:00'
location_resolution:
country_code: CZ
region_code: '10'
city_code: PRA
method: GEONAMES_CITY_LOOKUP
region_name: CZ-10
geonames_id: 8629195
resolution_timestamp: '2025-12-07T11:42:55.506888+00:00'
ghcid_history:
- ghcid: CZ-10-PRA-L-AVCPU
ghcid_numeric: 6764492336526219892
valid_from: '2025-12-08T11:21:21.976876+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-10-PRA-L-AVČPÚ)'
- ghcid: CZ-XX-PRA-L-AVČPÚ
ghcid_numeric: 4769545273226280928
valid_from: '2025-12-06T23:37:23.658421+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:21.976876+00:00'
- ghcid: CZ-10-PRA-L-AVČPÚ
valid_from: '2025-12-07T11:42:55.506915+00:00'
reason: 'Region resolved via GeoNames city lookup: XX->10 (Praha Klanovice)'
custodian_name:
claim_type: custodian_name
claim_value: Akademie věd ČR - Prognostický ústav
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-AVCPU
- identifier_scheme: GHCID_UUID
identifier_value: 8bb4e75c-a21b-5838-acb6-5ff445aed08a
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 5de04df0-b47f-8a74-a7f8-a9bb971668b9
- identifier_scheme: GHCID_NUMERIC
identifier_value: '6764492336526219892'
- identifier_scheme: RECORD_ID
identifier_value: 018d704c-40a7-4314-802e-d1657a6ef48d
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:30.030652+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- 'YouTube/Google Maps enrichment 2025-12-08T19:49:37Z: Maps: rejected by LLM; YouTube: not found'
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:19Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:48:05Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:49:29Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.030652+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Akademie věd ČR - Prognostický ústav
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.030652+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.030652+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 1
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.030652+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABB112
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.030652+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:23.658421+00:00'
creation_method: create_custodian_from_ch_annotator.py
google_maps_status: NO_MATCH
google_maps_rejected:
candidate_name: Czech Academy of Sciences
rejection_reason: 'NAME MATCH: The source ''Akademie věd ČR - Prognostický ústav'' (Czech Academy of Sciences - Prognostic
Institute) is a specific research institute. The candidate ''Czech Academy of Sciences'' is the parent umbrella organization.
They are not the same institution. TYPE MATCH: The Google Place types ''point_of_interest'' and ''establishment'' do not
match any of the expected types for a heritage institution (e.g., museum, library, gallery). ENTITY TYPE: The candidate
is an umbrella organization for a national academy of sciences and does not function as a heritage institution.'
timestamp: '2025-12-08T19:49:36.877469+00:00'
youtube_status: NOT_FOUND
youtube_search_query: Akademie věd ČR - Prognostický ústav official
youtube_search_timestamp: '2025-12-08T19:49:37.207233+00:00'
location:
city: Prague
region: CZ-10
region_code: '10'
country: CZ
postal_code: 110 00
street_address: Opletalova 19
geonames_id: 3067696
normalization_timestamp: '2025-12-09T20:40:58.679222+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:40:58.679209+00:00'

View file

@ -0,0 +1,175 @@
original_entry:
name: Akademie věd ČR - Pedagogický ústav J.A.Komenského
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABB011
locations:
- street_address: Máchova 7
postal_code: 120 00
city: Praha 2
country: CZ
processing_timestamp: '2025-12-06T23:37:23.639904+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-AVCPUJK
ghcid_original: CZ-XX-PRA-L-AVČPÚJK
ghcid_uuid: 6c0c0023-d809-5af7-9792-3a88280b05cc
ghcid_uuid_sha256: c7d810dd-72ec-871c-87f0-147e22c4f1b9
ghcid_numeric: 14400278351819056924
record_id: b51bbf12-d5c6-492c-9a1a-f19d1d6b61e7
generation_timestamp: '2025-12-06T23:37:23.639904+00:00'
location_resolution:
country_code: CZ
region_code: '10'
city_code: PRA
method: GEONAMES_CITY_LOOKUP
region_name: CZ-10
geonames_id: 8629195
resolution_timestamp: '2025-12-07T11:42:54.224540+00:00'
ghcid_history:
- ghcid: CZ-10-PRA-L-AVCPUJK
ghcid_numeric: 14400278351819056924
valid_from: '2025-12-08T11:21:29.200637+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-10-PRA-L-AVČPÚJK)'
- ghcid: CZ-XX-PRA-L-AVČPÚJK
ghcid_numeric: 11911444671767460800
valid_from: '2025-12-06T23:37:23.639904+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:29.200637+00:00'
- ghcid: CZ-10-PRA-L-AVČPÚJK
valid_from: '2025-12-07T11:42:54.224551+00:00'
reason: 'Region resolved via GeoNames city lookup: XX->10 (Praha Klanovice)'
custodian_name:
claim_type: custodian_name
claim_value: Akademie věd ČR - Pedagogický ústav J.A.Komenského
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-AVCPUJK
- identifier_scheme: GHCID_UUID
identifier_value: 6c0c0023-d809-5af7-9792-3a88280b05cc
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: c7d810dd-72ec-871c-87f0-147e22c4f1b9
- identifier_scheme: GHCID_NUMERIC
identifier_value: '14400278351819056924'
- identifier_scheme: RECORD_ID
identifier_value: b51bbf12-d5c6-492c-9a1a-f19d1d6b61e7
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:30.030561+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- 'YouTube/Google Maps enrichment 2025-12-08T19:49:41Z: Maps: rejected by LLM; YouTube: not found'
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:19Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:48:05Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:49:29Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.030561+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Akademie věd ČR - Pedagogický ústav J.A.Komenského
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.030561+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.030561+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 2
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.030561+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABB011
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.030561+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:23.639904+00:00'
creation_method: create_custodian_from_ch_annotator.py
google_maps_status: NO_MATCH
google_maps_rejected:
candidate_name: Czech Academy of Sciences
rejection_reason: This is a mismatch. The source is a specific research institute within the Czech Academy of Sciences.
The Google Maps candidate represents the overarching parent organization, the Czech Academy of Sciences, located at its
headquarters. The candidate's Google Place types ('point_of_interest', 'establishment') are generic and do not match the
expected types for a heritage institution. Furthermore, the website links to the main academy, not the specific Pedagogical
Institute.
timestamp: '2025-12-08T19:49:41.002827+00:00'
youtube_status: NOT_FOUND
youtube_search_query: Akademie věd ČR - Pedagogický ústav J.A.Komenského official
youtube_search_timestamp: '2025-12-08T19:49:41.371035+00:00'
location:
city: Prague
region: CZ-10
region_code: '10'
country: CZ
postal_code: 120 00
street_address: Máchova 7
geonames_id: 3067696
normalization_timestamp: '2025-12-09T20:40:58.718568+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:40:58.718556+00:00'

View file

@ -0,0 +1,165 @@
original_entry:
name: Akademie věd ČR - Ústav anorganické chemie
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABB035
locations:
- street_address: Pelléova 24
postal_code: 160 00
city: Praha 6
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:25.544411+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-AVCUAC
ghcid_original: CZ-HL-PRA-L-AVČÚAC
ghcid_uuid: c224a937-aaa3-5581-ae12-af86dda848f4
ghcid_uuid_sha256: 4644be0d-f1e9-85fe-a554-49213c174ac9
ghcid_numeric: 5063380848173729278
record_id: 76769554-a057-44b4-b2bf-00c3126e879d
generation_timestamp: '2025-12-06T23:37:25.544411+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-AVCUAC
valid_from: "2025-12-10T09:46:58Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-AVCUAC
valid_from: null
valid_to: "2025-12-10T09:46:58Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-AVCUAC
ghcid_numeric: 5063380848173729278
valid_from: '2025-12-08T11:21:27.871544+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-AVČÚAC)'
- ghcid: CZ-HL-PRA-L-AVČÚAC
ghcid_numeric: 9877480588037009223
valid_from: '2025-12-06T23:37:25.544411+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:27.871544+00:00'
custodian_name:
claim_type: custodian_name
claim_value: Akademie věd ČR - Ústav anorganické chemie
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-AVCUAC
- identifier_scheme: GHCID_UUID
identifier_value: c224a937-aaa3-5581-ae12-af86dda848f4
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 4644be0d-f1e9-85fe-a554-49213c174ac9
- identifier_scheme: GHCID_NUMERIC
identifier_value: '5063380848173729278'
- identifier_scheme: RECORD_ID
identifier_value: 76769554-a057-44b4-b2bf-00c3126e879d
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:30.046296+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046296+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Akademie věd ČR - Ústav anorganické chemie
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046296+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046296+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 6
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046296+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABB035
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046296+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:25.544411+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 160 00
street_address: Pelléova 24
normalization_timestamp: '2025-12-09T20:41:04.836786+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:04.836764+00:00'

View file

@ -0,0 +1,172 @@
original_entry:
name: Akademie věd ČR - Ústav fyzikální metalurgie
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABB057
locations:
- street_address: Boční II/1401
postal_code: 141 31
city: Praha 4
country: CZ
processing_timestamp: '2025-12-06T23:37:23.652645+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-AVCUFM
ghcid_original: CZ-XX-PRA-L-AVČÚFM
ghcid_uuid: e3553f52-4b71-579e-8949-7ed9a14d7989
ghcid_uuid_sha256: 56d2fc1d-5840-8332-b8c5-11f406f5728b
ghcid_numeric: 6256340035335910194
record_id: fa035dcd-5e79-4d93-8774-4d980c99cdf7
generation_timestamp: '2025-12-06T23:37:23.652645+00:00'
location_resolution:
country_code: CZ
region_code: '10'
city_code: PRA
method: GEONAMES_CITY_LOOKUP
region_name: CZ-10
geonames_id: 8629195
resolution_timestamp: '2025-12-07T11:42:54.761825+00:00'
ghcid_history:
- ghcid: CZ-10-PRA-L-AVCUFM
ghcid_numeric: 6256340035335910194
valid_from: '2025-12-08T11:21:29.451072+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-10-PRA-L-AVČÚFM)'
- ghcid: CZ-XX-PRA-L-AVČÚFM
ghcid_numeric: 14246368355740162447
valid_from: '2025-12-06T23:37:23.652645+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:29.451072+00:00'
- ghcid: CZ-10-PRA-L-AVČÚFM
valid_from: '2025-12-07T11:42:54.761835+00:00'
reason: 'Region resolved via GeoNames city lookup: XX->10 (Praha Klanovice)'
custodian_name:
claim_type: custodian_name
claim_value: Akademie věd ČR - Ústav fyzikální metalurgie
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-AVCUFM
- identifier_scheme: GHCID_UUID
identifier_value: e3553f52-4b71-579e-8949-7ed9a14d7989
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 56d2fc1d-5840-8332-b8c5-11f406f5728b
- identifier_scheme: GHCID_NUMERIC
identifier_value: '6256340035335910194'
- identifier_scheme: RECORD_ID
identifier_value: fa035dcd-5e79-4d93-8774-4d980c99cdf7
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:30.030634+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- 'YouTube/Google Maps enrichment 2025-12-08T19:49:44Z: Maps: rejected by LLM; YouTube: not found'
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:19Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:48:05Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:49:29Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.030634+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Akademie věd ČR - Ústav fyzikální metalurgie
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.030634+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.030634+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 4
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.030634+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABB057
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.030634+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:23.652645+00:00'
creation_method: create_custodian_from_ch_annotator.py
google_maps_status: NO_MATCH
google_maps_rejected:
candidate_name: Institute of Physics of Materials ASCR
rejection_reason: 'REJECT: The candidate is a research institute. Its Google Place types (''point_of_interest'', ''establishment'')
do not match the expected heritage institution types (museum, library, gallery).'
timestamp: '2025-12-08T19:49:44.300650+00:00'
youtube_status: NOT_FOUND
youtube_search_query: Akademie věd ČR - Ústav fyzikální metalurgie official
youtube_search_timestamp: '2025-12-08T19:49:44.639812+00:00'
location:
city: Prague
region: CZ-10
region_code: '10'
country: CZ
postal_code: 141 31
street_address: Boční II/1401
geonames_id: 3067696
normalization_timestamp: '2025-12-09T20:40:58.765484+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:40:58.765473+00:00'

View file

@ -0,0 +1,165 @@
original_entry:
name: Akademie věd ČR - Ústav fyziologických regulací
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABB025
locations:
- street_address: Na Truhlářce 2
postal_code: 180 00
city: Praha 8 - Libeň
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:25.531267+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-AVCUFR
ghcid_original: CZ-HL-PRA-L-AVČÚFR
ghcid_uuid: 795dc63d-7b17-5de0-80be-3a319ae1298d
ghcid_uuid_sha256: 86714f47-d21f-8eea-a4be-06373e19a4df
ghcid_numeric: 9687611443337182954
record_id: 8f9b21ff-fd8d-4247-8b70-05a3b5ee62b8
generation_timestamp: '2025-12-06T23:37:25.531267+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-AVCUFR
valid_from: "2025-12-10T09:46:58Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-AVCUFR
valid_from: null
valid_to: "2025-12-10T09:46:58Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-AVCUFR
ghcid_numeric: 9687611443337182954
valid_from: '2025-12-08T11:21:31.235957+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-AVČÚFR)'
- ghcid: CZ-HL-PRA-L-AVČÚFR
ghcid_numeric: 18029943413411395802
valid_from: '2025-12-06T23:37:25.531267+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:31.235957+00:00'
custodian_name:
claim_type: custodian_name
claim_value: Akademie věd ČR - Ústav fyziologických regulací
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-AVCUFR
- identifier_scheme: GHCID_UUID
identifier_value: 795dc63d-7b17-5de0-80be-3a319ae1298d
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 86714f47-d21f-8eea-a4be-06373e19a4df
- identifier_scheme: GHCID_NUMERIC
identifier_value: '9687611443337182954'
- identifier_scheme: RECORD_ID
identifier_value: 8f9b21ff-fd8d-4247-8b70-05a3b5ee62b8
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:30.046197+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046197+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Akademie věd ČR - Ústav fyziologických regulací
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046197+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046197+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 8 - Libeň
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046197+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABB025
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046197+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:25.531267+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 180 00
street_address: Na Truhlářce 2
normalization_timestamp: '2025-12-09T20:41:04.897937+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:04.897911+00:00'

View file

@ -0,0 +1,166 @@
custodian_name:
claim_type: custodian_name
claim_value: Akademie věd ČR - Ústav molekulární genetiky
source_type: ch_annotator
original_entry:
name: Akademie věd ČR - Ústav molekulární genetiky
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABB110
locations:
- street_address: Flemingovo nám. 2
postal_code: 166 37
city: Praha 6
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:14.369691+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-AVCUMG-akademie_ved_cr_ustav_molekularni_genetiky
ghcid_original: CZ-HL-PRA-L-AVČÚMG-akademie_ved_cr_ustav_molekularni_genetiky
ghcid_uuid: 7f2b5df4-8c0a-5167-ba84-91432ff7ac7a
ghcid_uuid_sha256: 5af43a46-4a31-8013-b858-d13011f05f40
ghcid_numeric: 6553927431297822739
record_id: 016cc72d-03a0-4072-9ae0-f0134a15c21c
generation_timestamp: '2025-12-06T23:37:14.369691+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-AVCUMG-akademie_ved_cr_ustav_molekularni_genetiky
valid_from: '2025-12-10T09:46:58Z'
valid_to: null
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha))
per ISO 3166-2:CZ
- ghcid: CZ-HL-PRA-L-AVCUMG-akademie_ved_cr_ustav_molekularni_genetiky
valid_from: null
valid_to: '2025-12-10T09:46:58Z'
reason: Previous GHCID with incorrect region code
- ghcid: CZ-HL-PRA-L-AVCUMG-akademie_ved_cr_ustav_molekularni_genetiky
ghcid_numeric: 6553927431297822739
valid_from: '2025-12-08T11:21:31.754415+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-AVČÚMG-akademie_ved_cr_ustav_molekularni_genetiky)'
- ghcid: CZ-HL-PRA-L-AVČÚMG-akademie_ved_cr_ustav_molekularni_genetiky
ghcid_numeric: 160046891344919514
valid_from: '2025-12-06T23:37:14.369691+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:31.754415+00:00'
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-AVCUMG-akademie_ved_cr_ustav_molekularni_genetiky
- identifier_scheme: GHCID_UUID
identifier_value: 7f2b5df4-8c0a-5167-ba84-91432ff7ac7a
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 5af43a46-4a31-8013-b858-d13011f05f40
- identifier_scheme: GHCID_NUMERIC
identifier_value: '6553927431297822739'
- identifier_scheme: RECORD_ID
identifier_value: 016cc72d-03a0-4072-9ae0-f0134a15c21c
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.800225+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.800225+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date: null
verified_by: null
entity_claims:
- claim_type: full_name
claim_value: Akademie věd ČR - Ústav molekulární genetiky
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.800225+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.800225+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 6
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.800225+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABB110
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.800225+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:14.369691+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 166 37
street_address: Flemingovo nám. 2
normalization_timestamp: '2025-12-09T20:41:04.955642+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:04.955627+00:00'

View file

@ -0,0 +1,165 @@
original_entry:
name: Akademie věd ČR - Ústav molekulární genetiky
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABB110
locations:
- street_address: Flemingovo nám. 2
postal_code: 166 37
city: Praha 6
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:35:06.436939+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-AVCUMG
ghcid_original: CZ-HL-PRA-L-AVČÚMG
ghcid_uuid: 695bf0ce-d479-5c58-a6a8-1b631a99e4be
ghcid_uuid_sha256: 27fc2512-1330-8d25-90f5-fcd2998c2f3a
ghcid_numeric: 2881218621171850533
record_id: c6c9b1f9-edfd-4e00-b83d-282e4bb3cd36
generation_timestamp: '2025-12-06T23:35:06.436939+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-AVCUMG
valid_from: "2025-12-10T09:46:58Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-AVCUMG
valid_from: null
valid_to: "2025-12-10T09:46:58Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-AVCUMG
ghcid_numeric: 2881218621171850533
valid_from: '2025-12-08T11:21:21.745302+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-AVČÚMG)'
- ghcid: CZ-HL-PRA-L-AVČÚMG
ghcid_numeric: 3450507008958569248
valid_from: '2025-12-06T23:35:06.436939+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:21.745302+00:00'
custodian_name:
claim_type: custodian_name
claim_value: Akademie věd ČR - Ústav molekulární genetiky
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-AVCUMG
- identifier_scheme: GHCID_UUID
identifier_value: 695bf0ce-d479-5c58-a6a8-1b631a99e4be
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 27fc2512-1330-8d25-90f5-fcd2998c2f3a
- identifier_scheme: GHCID_NUMERIC
identifier_value: '2881218621171850533'
- identifier_scheme: RECORD_ID
identifier_value: c6c9b1f9-edfd-4e00-b83d-282e4bb3cd36
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.800225+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.800225+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Akademie věd ČR - Ústav molekulární genetiky
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.800225+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.800225+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 6
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.800225+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABB110
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.800225+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:35:06.436939+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 166 37
street_address: Flemingovo nám. 2
normalization_timestamp: '2025-12-09T20:41:05.002294+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:05.002273+00:00'

View file

@ -0,0 +1,165 @@
original_entry:
name: Akademie věd ČR - Ústav pro českou a světovou literaturu - odd. neslovanských literatur
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABB039
locations:
- street_address: nám. Republiky 1
postal_code: 110 00
city: Praha 1
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:25.547650+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-AVCUPCSLON
ghcid_original: CZ-HL-PRA-L-AVČÚPČSLON
ghcid_uuid: 612e600a-6c13-59da-bff4-07b2a007e568
ghcid_uuid_sha256: 7537c6f3-aa62-86aa-8cbc-a34535e1b4bd
ghcid_numeric: 8446438375994599082
record_id: f2eec84d-8531-4e23-857e-6c002da3fec9
generation_timestamp: '2025-12-06T23:37:25.547650+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-AVCUPCSLON
valid_from: "2025-12-10T09:46:58Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-AVCUPCSLON
valid_from: null
valid_to: "2025-12-10T09:46:58Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-AVCUPCSLON
ghcid_numeric: 8446438375994599082
valid_from: '2025-12-08T11:21:22.066165+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-AVČÚPČSLON)'
- ghcid: CZ-HL-PRA-L-AVČÚPČSLON
ghcid_numeric: 6149932590907325350
valid_from: '2025-12-06T23:37:25.547650+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:22.066165+00:00'
custodian_name:
claim_type: custodian_name
claim_value: Akademie věd ČR - Ústav pro českou a světovou literaturu - odd. neslovanských literatur
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-AVCUPCSLON
- identifier_scheme: GHCID_UUID
identifier_value: 612e600a-6c13-59da-bff4-07b2a007e568
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 7537c6f3-aa62-86aa-8cbc-a34535e1b4bd
- identifier_scheme: GHCID_NUMERIC
identifier_value: '8446438375994599082'
- identifier_scheme: RECORD_ID
identifier_value: f2eec84d-8531-4e23-857e-6c002da3fec9
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:30.046318+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046318+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Akademie věd ČR - Ústav pro českou a světovou literaturu - odd. neslovanských literatur
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046318+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046318+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 1
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046318+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABB039
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046318+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:25.547650+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 110 00
street_address: nám. Republiky 1
normalization_timestamp: '2025-12-09T20:41:05.061486+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:05.061461+00:00'

View file

@ -0,0 +1,165 @@
original_entry:
name: Akademie věd ČR - Ústav pro elektrotechniku - Knihovna
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABB042
locations:
- street_address: Dolejškova 5
postal_code: 182 02
city: Praha 8
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:14.348347+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-AVCUPEK-akademie_ved_cr_ustav_pro_elektrotechniku_knihovna
ghcid_original: CZ-HL-PRA-L-AVČÚPEK-akademie_ved_cr_ustav_pro_elektrotechniku_knihovna
ghcid_uuid: f377a24a-a2c3-5754-90e7-22295a9a5022
ghcid_uuid_sha256: 2915a1fc-c781-8a6f-beb9-0629b4602eb7
ghcid_numeric: 2960450437116938863
record_id: fb760fd0-96d2-48eb-b2cb-e7974123d634
generation_timestamp: '2025-12-06T23:37:14.348347+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-AVCUPEK-akademie_ved_cr_ustav_pro_elektrotechniku_knihovna
valid_from: "2025-12-10T09:46:58Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-AVCUPEK-akademie_ved_cr_ustav_pro_elektrotechniku_knihovna
valid_from: null
valid_to: "2025-12-10T09:46:58Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-AVCUPEK-akademie_ved_cr_ustav_pro_elektrotechniku_knihovna
ghcid_numeric: 2960450437116938863
valid_from: '2025-12-08T11:21:33.284297+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-AVČÚPEK-akademie_ved_cr_ustav_pro_elektrotechniku_knihovna)'
- ghcid: CZ-HL-PRA-L-AVČÚPEK-akademie_ved_cr_ustav_pro_elektrotechniku_knihovna
ghcid_numeric: 16206133009206881437
valid_from: '2025-12-06T23:37:14.348347+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:33.284297+00:00'
custodian_name:
claim_type: custodian_name
claim_value: Akademie věd ČR - Ústav pro elektrotechniku - Knihovna
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-AVCUPEK-akademie_ved_cr_ustav_pro_elektrotechniku_knihovna
- identifier_scheme: GHCID_UUID
identifier_value: f377a24a-a2c3-5754-90e7-22295a9a5022
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 2915a1fc-c781-8a6f-beb9-0629b4602eb7
- identifier_scheme: GHCID_NUMERIC
identifier_value: '2960450437116938863'
- identifier_scheme: RECORD_ID
identifier_value: fb760fd0-96d2-48eb-b2cb-e7974123d634
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.798272+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.798272+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Akademie věd ČR - Ústav pro elektrotechniku - Knihovna
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.798272+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.798272+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 8
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.798272+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABB042
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.798272+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:14.348347+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 182 02
street_address: Dolejškova 5
normalization_timestamp: '2025-12-09T20:41:05.107965+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:05.107942+00:00'

View file

@ -0,0 +1,165 @@
original_entry:
name: Akademie věd ČR - Ústav pro elektrotechniku - Knihovna
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABB042
locations:
- street_address: Dolejškova 5
postal_code: 182 02
city: Praha 8
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:35:06.339629+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-AVCUPEK
ghcid_original: CZ-HL-PRA-L-AVČÚPEK
ghcid_uuid: 77caf563-ecf0-5f59-9224-55acb0d0ff4e
ghcid_uuid_sha256: e7120276-cc52-8cfb-9757-fb99a4b53d77
ghcid_numeric: 16650373481599474939
record_id: 9d388e7a-9745-439a-a98a-886da098a45a
generation_timestamp: '2025-12-06T23:35:06.339629+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-AVCUPEK
valid_from: "2025-12-10T09:46:58Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-AVCUPEK
valid_from: null
valid_to: "2025-12-10T09:46:58Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-AVCUPEK
ghcid_numeric: 16650373481599474939
valid_from: '2025-12-08T11:21:40.439155+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-AVČÚPEK)'
- ghcid: CZ-HL-PRA-L-AVČÚPEK
ghcid_numeric: 134491386058511181
valid_from: '2025-12-06T23:35:06.339629+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:40.439155+00:00'
custodian_name:
claim_type: custodian_name
claim_value: Akademie věd ČR - Ústav pro elektrotechniku - Knihovna
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-AVCUPEK
- identifier_scheme: GHCID_UUID
identifier_value: 77caf563-ecf0-5f59-9224-55acb0d0ff4e
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: e7120276-cc52-8cfb-9757-fb99a4b53d77
- identifier_scheme: GHCID_NUMERIC
identifier_value: '16650373481599474939'
- identifier_scheme: RECORD_ID
identifier_value: 9d388e7a-9745-439a-a98a-886da098a45a
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.798272+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.798272+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Akademie věd ČR - Ústav pro elektrotechniku - Knihovna
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.798272+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.798272+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 8
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.798272+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABB042
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.798272+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:35:06.339629+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 182 02
street_address: Dolejškova 5
normalization_timestamp: '2025-12-09T20:41:05.152929+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:05.152904+00:00'

View file

@ -0,0 +1,174 @@
original_entry:
name: Akademie věd ČR - Vývojové dílny
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABB058
locations:
- street_address: Pod vodárenskou věží 4
postal_code: 180 00
city: Praha 8
country: CZ
processing_timestamp: '2025-12-06T23:37:14.351182+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-AVCVD-akademie_ved_cr_vyvojove_dilny
ghcid_original: CZ-XX-PRA-L-AVČVD-akademie_ved_cr_vyvojove_dilny
ghcid_uuid: ccd8c551-111d-5004-8893-af60cb91a370
ghcid_uuid_sha256: c93d3ed6-02d1-864b-b032-a65e83112669
ghcid_numeric: 14500815464094078539
record_id: 168e29e2-f2d5-4d1b-8e85-f0f07f31731c
generation_timestamp: '2025-12-06T23:37:14.351182+00:00'
location_resolution:
country_code: CZ
region_code: '10'
city_code: PRA
method: GEONAMES_CITY_LOOKUP
region_name: CZ-10
geonames_id: 8629195
resolution_timestamp: '2025-12-07T11:42:54.977370+00:00'
ghcid_history:
- ghcid: CZ-10-PRA-L-AVCVD-akademie_ved_cr_vyvojove_dilny
ghcid_numeric: 14500815464094078539
valid_from: '2025-12-08T11:21:41.358002+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-10-PRA-L-AVČVD-akademie_ved_cr_vyvojove_dilny)'
- ghcid: CZ-XX-PRA-L-AVČVD-akademie_ved_cr_vyvojove_dilny
ghcid_numeric: 2362682881202093387
valid_from: '2025-12-06T23:37:14.351182+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:41.358002+00:00'
- ghcid: CZ-10-PRA-L-AVČVD-akademie_ved_cr_vyvojove_dilny
valid_from: '2025-12-07T11:42:54.977381+00:00'
reason: 'Region resolved via GeoNames city lookup: XX->10 (Praha Klanovice)'
custodian_name:
claim_type: custodian_name
claim_value: Akademie věd ČR - Vývojové dílny
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-AVCVD-akademie_ved_cr_vyvojove_dilny
- identifier_scheme: GHCID_UUID
identifier_value: ccd8c551-111d-5004-8893-af60cb91a370
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: c93d3ed6-02d1-864b-b032-a65e83112669
- identifier_scheme: GHCID_NUMERIC
identifier_value: '14500815464094078539'
- identifier_scheme: RECORD_ID
identifier_value: 168e29e2-f2d5-4d1b-8e85-f0f07f31731c
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.799317+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- 'YouTube/Google Maps enrichment 2025-12-08T19:49:48Z: Maps: rejected by LLM; YouTube: not found'
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:19Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:48:05Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:49:29Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.799317+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Akademie věd ČR - Vývojové dílny
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.799317+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.799317+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 8
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.799317+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABB058
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.799317+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:14.351182+00:00'
creation_method: create_custodian_from_ch_annotator.py
google_maps_status: NO_MATCH
google_maps_rejected:
candidate_name: Czech Academy of Sciences
rejection_reason: 'TYPE MISMATCH: The Google Place candidate is the Czech Academy of Sciences, a large research institution.
Its Google Place Types are generic (''point_of_interest'', ''establishment'') and do not match any of the required heritage
institution types (museum, library, gallery). Although there is a strong name and location match with the source, it does
not fit the GLAM (Heritage Custodian) definition.'
timestamp: '2025-12-08T19:49:48.074229+00:00'
youtube_status: NOT_FOUND
youtube_search_query: Akademie věd ČR - Vývojové dílny official
youtube_search_timestamp: '2025-12-08T19:49:48.416534+00:00'
location:
city: Prague
region: CZ-10
region_code: '10'
country: CZ
postal_code: 180 00
street_address: Pod vodárenskou věží 4
geonames_id: 3067696
normalization_timestamp: '2025-12-09T20:40:58.807961+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:40:58.807938+00:00'

View file

@ -0,0 +1,175 @@
original_entry:
name: Akademie věd ČR - Vývojové dílny
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABB058
locations:
- street_address: Pod vodárenskou věží 4
postal_code: 180 00
city: Praha 8
country: CZ
processing_timestamp: '2025-12-06T23:35:06.382577+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-AVCVD
ghcid_original: CZ-XX-PRA-L-AVČVD
ghcid_uuid: ba3d163b-c510-5878-8885-7fedeec11661
ghcid_uuid_sha256: bfd1764c-38e3-82d4-aac0-9cf26c2dfdd2
ghcid_numeric: 13821958801120805588
record_id: d803d048-5d6a-41f2-ae30-b243e288d372
generation_timestamp: '2025-12-06T23:35:06.382577+00:00'
location_resolution:
country_code: CZ
region_code: '10'
city_code: PRA
method: GEONAMES_CITY_LOOKUP
region_name: CZ-10
geonames_id: 8629195
resolution_timestamp: '2025-12-07T11:42:55.652198+00:00'
ghcid_history:
- ghcid: CZ-10-PRA-L-AVCVD
ghcid_numeric: 13821958801120805588
valid_from: '2025-12-08T11:21:37.438639+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-10-PRA-L-AVČVD)'
- ghcid: CZ-XX-PRA-L-AVČVD
ghcid_numeric: 215626475134910548
valid_from: '2025-12-06T23:35:06.382577+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:37.438639+00:00'
- ghcid: CZ-10-PRA-L-AVČVD
valid_from: '2025-12-07T11:42:55.652217+00:00'
reason: 'Region resolved via GeoNames city lookup: XX->10 (Praha Klanovice)'
custodian_name:
claim_type: custodian_name
claim_value: Akademie věd ČR - Vývojové dílny
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-AVCVD
- identifier_scheme: GHCID_UUID
identifier_value: ba3d163b-c510-5878-8885-7fedeec11661
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: bfd1764c-38e3-82d4-aac0-9cf26c2dfdd2
- identifier_scheme: GHCID_NUMERIC
identifier_value: '13821958801120805588'
- identifier_scheme: RECORD_ID
identifier_value: d803d048-5d6a-41f2-ae30-b243e288d372
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.799317+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- 'YouTube/Google Maps enrichment 2025-12-08T19:49:52Z: Maps: rejected by LLM; YouTube: not found'
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:19Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:48:05Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:49:29Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.799317+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Akademie věd ČR - Vývojové dílny
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.799317+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.799317+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 8
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.799317+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABB058
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.799317+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:35:06.382577+00:00'
creation_method: create_custodian_from_ch_annotator.py
google_maps_status: NO_MATCH
google_maps_rejected:
candidate_name: Czech Academy of Sciences
rejection_reason: 'TYPE MISMATCH: The Google Maps candidate ''Czech Academy of Sciences'' is a national research institution,
not a specific museum, library, or archive. Its Google types (''point_of_interest'', ''establishment'') are too general
and do not match the expected heritage types (e.g., museum, library). The source name ''Akademie věd ČR - Vývojové dílny''
(Academy of Sciences of the Czech Republic - Development Workshops) also suggests a research or technical department,
further confirming it is not a public-facing heritage institution.'
timestamp: '2025-12-08T19:49:52.339515+00:00'
youtube_status: NOT_FOUND
youtube_search_query: Akademie věd ČR - Vývojové dílny official
youtube_search_timestamp: '2025-12-08T19:49:52.671667+00:00'
location:
city: Prague
region: CZ-10
region_code: '10'
country: CZ
postal_code: 180 00
street_address: Pod vodárenskou věží 4
geonames_id: 3067696
normalization_timestamp: '2025-12-09T20:40:58.845752+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:40:58.845740+00:00'

View file

@ -0,0 +1,280 @@
original_entry:
name: AMBIS vysoká škola, a.s - Knihovna
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABD030
- &id002
identifier_scheme: IČO
identifier_value: '61858307'
- &id003
identifier_scheme: DIČ
identifier_value: CZ61858307
- &id004
identifier_scheme: Website
identifier_value: https://www.ambis.cz
identifier_url: https://www.ambis.cz
- &id005
identifier_scheme: Wikidata
identifier_value: Q114619496
identifier_url: https://www.wikidata.org/wiki/Q114619496
locations:
- street_address: Lindnerova 575/1
postal_code: 180 00
city: Praha 8 - Libeň
region: Hlavní město Praha
country: CZ
latitude: 50.111828
longitude: 14.472999
processing_timestamp: '2025-12-06T23:37:26.591017+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-AVSSK
ghcid_original: CZ-HL-PRA-L-AVŠSK
ghcid_uuid: a2942158-3abe-500e-a42e-47e3d802cffb
ghcid_uuid_sha256: dbf6e478-f670-8c95-b621-5141084dc9f8
ghcid_numeric: 15850107146758818965
record_id: b7bdc0a9-70d9-484f-bff6-eef614485765
generation_timestamp: '2025-12-06T23:37:26.591017+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-AVSSK
valid_from: '2025-12-10T09:46:58Z'
valid_to:
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
- ghcid: CZ-HL-PRA-L-AVSSK
valid_from:
valid_to: '2025-12-10T09:46:58Z'
reason: Previous GHCID with incorrect region code
- ghcid: CZ-HL-PRA-L-AVSSK
ghcid_numeric: 15850107146758818965
valid_from: '2025-12-08T11:21:36.402400+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-AVŠSK)'
- ghcid: CZ-HL-PRA-L-AVŠSK
ghcid_numeric: 11709857600197536560
valid_from: '2025-12-06T23:37:26.591017+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:36.402400+00:00'
custodian_name:
claim_type: custodian_name
claim_value: AMBIS vysoká škola, a.s - Knihovna
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-AVSSK
- identifier_scheme: GHCID_UUID
identifier_value: a2942158-3abe-500e-a42e-47e3d802cffb
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: dbf6e478-f670-8c95-b621-5141084dc9f8
- identifier_scheme: GHCID_NUMERIC
identifier_value: '15850107146758818965'
- identifier_scheme: RECORD_ID
identifier_value: b7bdc0a9-70d9-484f-bff6-eef614485765
- *id001
- *id002
- *id003
- *id004
- *id005
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:30.055665+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
schema_version: 2.0.0
enrichment_provenance:
wikidata_enrichment:
content_hash: sha256-tDge/qPcehBzQ4ZL0RmrfaaoiFExgARmcRFJPWdLs/g=
verified_at: '2025-12-28T19:58:23.178566+00:00'
wasDerivedFrom: https://www.wikidata.org/wiki/Q114619496
provenance_schema_version: '2.0'
standards_compliance:
- W3C PROV-O
- W3C SRI (content hashes)
generated_at: '2025-12-28T19:58:23.178587+00:00'
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.055665+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: AMBIS vysoká škola, a.s - Knihovna
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.055665+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.055665+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 8 - Libeň
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.055665+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: wikidata_id
claim_value: Q114619496
property_uri: owl:sameAs
provenance:
namespace: wikidata
path: /entity/Q114619496
timestamp: '2025-11-19T10:14:30.055665+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.98
- claim_type: ghcid
claim_value: CZ-ABD030
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.055665+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:26.591017+00:00'
creation_method: create_custodian_from_ch_annotator.py
wikidata_enrichment:
wikidata_entity_id: Q114619496
api_metadata:
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
fetch_timestamp: '2025-12-18T22:27:00.684272+00:00'
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
enrichment_version: 2.1_generic
properties_found:
- P31
- P17
- P159
- P968
- P463
wikidata_labels:
cs: AMBIS vysoká škola, a.s, knihovna
en: AMBIS,JSC, Scholarly Library
wikidata_label_en: AMBIS,JSC, Scholarly Library
wikidata_label_cs: AMBIS vysoká škola, a.s, knihovna
wikidata_descriptions:
cs: knihovna v Praze
wikidata_aliases:
cs:
- Knihovna AMBIS
wikidata_classification:
instance_of: &id006
- id: Q1622062
label: university library
description: general library at an institution of higher education
wikidata_instance_of: *id006
wikidata_location:
country: &id007
id: Q213
label: Czech Republic
description: country in Central Europe
headquarters_location:
id: Q2003486
label: Libeň
description: cadastral area and district of Prague
wikidata_country: *id007
wikidata_organization:
member_of:
- id: Q4809890
label: Association of Library and Information Professionals of the Czech Republic
description: organization
wikidata_contact:
email: mailto:knihovna@ambis.cz
_provenance:
content_hash:
algorithm: sha256
value: sha256-tDge/qPcehBzQ4ZL0RmrfaaoiFExgARmcRFJPWdLs/g=
scope: enrichment_section
computed_at: '2025-12-28T19:58:23.178566+00:00'
prov:
wasGeneratedBy:
'@type': prov:Activity
name: wikidata_api_fetch
used: https://www.wikidata.org/w/rest.php/wikibase/v1
wasDerivedFrom: https://www.wikidata.org/wiki/Q114619496
verification:
status: verified
last_verified: '2025-12-28T19:58:23.178577+00:00'
location:
latitude: 50.111828
longitude: 14.472999
coordinate_provenance:
source_type: ORIGINAL_ENTRY
source_path: original_entry.locations[0]
city: Praha 8 - Libeň
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 180 00
street_address: Lindnerova 575/1
normalization_timestamp: '2025-12-09T06:50:47.561464+00:00'
logo_enrichment:
enrichment_timestamp: '2025-12-23T21:58:33.227071+00:00'
source_url: https://www.ambis.cz
extraction_method: crawl4ai
claims:
- claim_type: favicon_url
claim_value: https://www.ambis.cz/files/resize/16/16/ambis-favicon.png
source_url: https://www.ambis.cz
css_selector: '[document] > html.js.fonts-loaded > head > link:nth-of-type(3)'
retrieved_on: '2025-12-23T21:58:33.227071+00:00'
extraction_method: crawl4ai_link_rel
favicon_type: image/png
favicon_sizes: 16x16
summary:
total_claims: 1
has_primary_logo: false
has_favicon: true
has_og_image: false
favicon_count: 2

View file

@ -0,0 +1,277 @@
original_entry:
name: Akademie výtvarných umění v Praze - Knihovna
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABD021
- &id002
identifier_scheme: IČO
identifier_value: '60461446'
- &id003
identifier_scheme: DIČ
identifier_value: CZ60461446
- &id004
identifier_scheme: Website
identifier_value: http://opac.avu.cz/
identifier_url: http://opac.avu.cz/
- &id005
identifier_scheme: Wikidata
identifier_value: Q114617655
identifier_url: https://www.wikidata.org/wiki/Q114617655
locations:
- street_address: U Akademie 172/4
postal_code: 172 00
city: Praha 7
region: Hlavní město Praha
country: CZ
latitude: 50.102708
longitude: 14.424
processing_timestamp: '2025-12-06T23:37:14.758955+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-AVUVPK
ghcid_original: CZ-HL-PRA-L-AVUVPK
ghcid_uuid: 52d6ae9a-c788-5ca6-9ac6-cdfcccee0d3a
ghcid_uuid_sha256: 29049dce-349a-89bc-0e62-9fdf688e56ed
ghcid_numeric: 2955660764433230268
record_id: d7ec6f35-674c-4715-b658-c2e0d50ee1b8
generation_timestamp: '2025-12-06T23:37:14.758955+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-AVUVPK
valid_from: '2025-12-10T09:46:58Z'
valid_to:
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
- ghcid: CZ-HL-PRA-L-AVUVPK
valid_from:
valid_to: '2025-12-10T09:46:58Z'
reason: Previous GHCID with incorrect region code
- ghcid: CZ-HL-PRA-L-AVUVPK
ghcid_numeric: 2955660764433230268
valid_from: '2025-12-06T23:37:14.758955+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
custodian_name:
claim_type: custodian_name
claim_value: Akademie výtvarných umění v Praze - Knihovna
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-AVUVPK
- identifier_scheme: GHCID_UUID
identifier_value: 52d6ae9a-c788-5ca6-9ac6-cdfcccee0d3a
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 29049dce-349a-89bc-0e62-9fdf688e56ed
- identifier_scheme: GHCID_NUMERIC
identifier_value: '2955660764433230268'
- identifier_scheme: RECORD_ID
identifier_value: d7ec6f35-674c-4715-b658-c2e0d50ee1b8
- *id001
- *id002
- *id003
- *id004
- *id005
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.805625+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
schema_version: 2.0.0
enrichment_provenance:
wikidata_enrichment:
content_hash: sha256-rbDX60Po9LRK18K3O5FR9eSZgfb5ph0+zBwk4or6wUQ=
verified_at: '2025-12-28T19:58:23.199863+00:00'
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617655
provenance_schema_version: '2.0'
standards_compliance:
- W3C PROV-O
- W3C SRI (content hashes)
generated_at: '2025-12-28T19:58:23.199880+00:00'
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.805625+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Akademie výtvarných umění v Praze - Knihovna
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.805625+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.805625+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 7
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.805625+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: wikidata_id
claim_value: Q114617655
property_uri: owl:sameAs
provenance:
namespace: wikidata
path: /entity/Q114617655
timestamp: '2025-11-19T10:14:29.805625+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.98
- claim_type: ghcid
claim_value: CZ-ABD021
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.805625+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:14.758955+00:00'
creation_method: create_custodian_from_ch_annotator.py
wikidata_enrichment:
wikidata_entity_id: Q114617655
api_metadata:
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
fetch_timestamp: '2025-12-18T22:27:08.718822+00:00'
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
enrichment_version: 2.1_generic
properties_found:
- P31
- P17
- P159
- P968
wikidata_labels:
cs: Akademie výtvarných umění v Praze, knihovna
en: Academy of Fine Arts in Prague, Library
wikidata_label_en: Academy of Fine Arts in Prague, Library
wikidata_label_cs: Akademie výtvarných umění v Praze, knihovna
wikidata_descriptions:
cs: knihovna v Praze
wikidata_aliases:
cs:
- AVU
wikidata_classification:
instance_of: &id006
- id: Q1622062
label: university library
description: general library at an institution of higher education
wikidata_instance_of: *id006
wikidata_location:
country: &id007
id: Q213
label: Czech Republic
description: country in Central Europe
headquarters_location:
id: Q1490045
label: Prague 7
description: municipal part of Prague
wikidata_country: *id007
wikidata_contact:
email: mailto:tomas.pergler@avu.cz
_provenance:
content_hash:
algorithm: sha256
value: sha256-rbDX60Po9LRK18K3O5FR9eSZgfb5ph0+zBwk4or6wUQ=
scope: enrichment_section
computed_at: '2025-12-28T19:58:23.199863+00:00'
prov:
wasGeneratedBy:
'@type': prov:Activity
name: wikidata_api_fetch
used: https://www.wikidata.org/w/rest.php/wikibase/v1
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617655
verification:
status: verified
last_verified: '2025-12-28T19:58:23.199872+00:00'
location:
latitude: 50.102708
longitude: 14.424
coordinate_provenance:
source_type: ORIGINAL_ENTRY
source_path: original_entry.locations[0]
city: Praha 7
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 172 00
street_address: U Akademie 172/4
normalization_timestamp: '2025-12-09T06:50:47.580972+00:00'
logo_enrichment:
enrichment_timestamp: '2025-12-23T21:58:40.363421+00:00'
source_url: http://opac.avu.cz
extraction_method: crawl4ai
claims:
- claim_type: logo_url
claim_value: http://opac.avu.cz/custom/design/logoavu2.png
source_url: http://opac.avu.cz
css_selector: '#portaro-classic-layout > kp-svelte-component-wrapper.kp-header-component-wrapper.ng-isolate-scope > header.kp-header.logo-stripe > div.logo-content-container.container > div.logo-search-row.row > div.customLogoArea.custom-logo-area > h1.unset-style > img'
retrieved_on: '2025-12-23T21:58:40.363421+00:00'
extraction_method: crawl4ai_header_logo
detection_confidence: high
alt_text: Akademie výtvarných umění
- claim_type: favicon_url
claim_value: http://opac.avu.cz/favicon.png?v=2.3.0-32021
source_url: http://opac.avu.cz
css_selector: '#ng-app > head > link:nth-of-type(9)'
retrieved_on: '2025-12-23T21:58:40.363421+00:00'
extraction_method: crawl4ai_link_rel
favicon_type: image/png
favicon_sizes: 256x256
summary:
total_claims: 2
has_primary_logo: true
has_favicon: true
has_og_image: false
favicon_count: 1

View file

@ -0,0 +1,160 @@
original_entry:
name: AVICENUM, zdravotnické nakladatelství
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABE322
locations:
- street_address: Malostranské nám. 26
postal_code: 118 02
city: Praha 1
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:15.610784+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-AZN
ghcid_original: CZ-HL-PRA-L-AZN
ghcid_uuid: 00c6dd57-92d9-54f7-9b20-9f93f29de9c7
ghcid_uuid_sha256: fbb3f4e7-7051-8310-35c9-e0cefa4d0d26
ghcid_numeric: 18137109399210111760
record_id: 3a8d5f57-f602-4f19-abcc-a496565c9120
generation_timestamp: '2025-12-06T23:37:15.610784+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-AZN
valid_from: "2025-12-10T09:46:58Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-AZN
valid_from: null
valid_to: "2025-12-10T09:46:58Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-AZN
ghcid_numeric: 18137109399210111760
valid_from: '2025-12-06T23:37:15.610784+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
custodian_name:
claim_type: custodian_name
claim_value: AVICENUM, zdravotnické nakladatelství
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-AZN
- identifier_scheme: GHCID_UUID
identifier_value: 00c6dd57-92d9-54f7-9b20-9f93f29de9c7
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: fbb3f4e7-7051-8310-35c9-e0cefa4d0d26
- identifier_scheme: GHCID_NUMERIC
identifier_value: '18137109399210111760'
- identifier_scheme: RECORD_ID
identifier_value: 3a8d5f57-f602-4f19-abcc-a496565c9120
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.912197+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.912197+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: AVICENUM, zdravotnické nakladatelství
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.912197+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.912197+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 1
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.912197+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABE322
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.912197+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:15.610784+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 118 02
street_address: Malostranské nám. 26
normalization_timestamp: '2025-12-09T20:41:05.207401+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:05.207376+00:00'

View file

@ -0,0 +1,160 @@
original_entry:
name: BENZINA
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABF008
locations:
- street_address: Spálená 5
postal_code: 111 28
city: Praha 1
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:15.782203+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-B
ghcid_original: CZ-HL-PRA-L-B
ghcid_uuid: b53b98b7-2cbf-5d8d-b793-f0d601d14c77
ghcid_uuid_sha256: 44a7a076-2f51-8988-83ea-7ba7c46b360e
ghcid_numeric: 4947099145150265736
record_id: 168aece8-08ed-4da9-adbc-8150b306ed07
generation_timestamp: '2025-12-06T23:37:15.782203+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-B
valid_from: "2025-12-10T09:46:58Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-B
valid_from: null
valid_to: "2025-12-10T09:46:58Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-B
ghcid_numeric: 4947099145150265736
valid_from: '2025-12-06T23:37:15.782203+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
custodian_name:
claim_type: custodian_name
claim_value: BENZINA
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-B
- identifier_scheme: GHCID_UUID
identifier_value: b53b98b7-2cbf-5d8d-b793-f0d601d14c77
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 44a7a076-2f51-8988-83ea-7ba7c46b360e
- identifier_scheme: GHCID_NUMERIC
identifier_value: '4947099145150265736'
- identifier_scheme: RECORD_ID
identifier_value: 168aece8-08ed-4da9-adbc-8150b306ed07
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.914126+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.914126+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: BENZINA
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.914126+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.914126+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 1
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.914126+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABF008
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.914126+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:15.782203+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 111 28
street_address: Spálená 5
normalization_timestamp: '2025-12-09T20:41:05.253191+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:05.253168+00:00'

View file

@ -0,0 +1,160 @@
original_entry:
name: Barvy a laky
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABC057
locations:
- street_address: Českomoravská 29
postal_code: 190 00
city: Praha 9
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:14.488435+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-BL
ghcid_original: CZ-HL-PRA-L-BL
ghcid_uuid: 49bed947-bbe7-5811-9b7c-001675e2d6d6
ghcid_uuid_sha256: 297bde07-c161-877f-a82a-5dd56cffbaba
ghcid_numeric: 2989226902580967295
record_id: 002a2a73-db8c-4ed7-aa4a-3228a1c76a97
generation_timestamp: '2025-12-06T23:37:14.488435+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-BL
valid_from: "2025-12-10T09:46:58Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-BL
valid_from: null
valid_to: "2025-12-10T09:46:58Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-BL
ghcid_numeric: 2989226902580967295
valid_from: '2025-12-06T23:37:14.488435+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
custodian_name:
claim_type: custodian_name
claim_value: Barvy a laky
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-BL
- identifier_scheme: GHCID_UUID
identifier_value: 49bed947-bbe7-5811-9b7c-001675e2d6d6
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 297bde07-c161-877f-a82a-5dd56cffbaba
- identifier_scheme: GHCID_NUMERIC
identifier_value: '2989226902580967295'
- identifier_scheme: RECORD_ID
identifier_value: 002a2a73-db8c-4ed7-aa4a-3228a1c76a97
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.802491+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.802491+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Barvy a laky
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.802491+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.802491+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 9
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.802491+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABC057
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.802491+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:14.488435+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 190 00
street_address: Českomoravská 29
normalization_timestamp: '2025-12-09T20:41:05.295510+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:05.295483+00:00'

View file

@ -0,0 +1,160 @@
original_entry:
name: Barvy a laky - závod 5
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABF511
locations:
- street_address: Přátelství 550
postal_code: 104 00
city: Praha 10 - Uhříněves
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:16.373440+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-BLZ
ghcid_original: CZ-HL-PRA-L-BLZ
ghcid_uuid: 2c47abc8-7588-5b11-8963-71c9377d507e
ghcid_uuid_sha256: 834799aa-8861-8c7d-6ce3-6d36f8dce2f0
ghcid_numeric: 9459698500026625149
record_id: 1e659114-a761-4f09-a898-91c128325318
generation_timestamp: '2025-12-06T23:37:16.373440+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-BLZ
valid_from: "2025-12-10T09:46:58Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-BLZ
valid_from: null
valid_to: "2025-12-10T09:46:58Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-BLZ
ghcid_numeric: 9459698500026625149
valid_from: '2025-12-06T23:37:16.373440+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
custodian_name:
claim_type: custodian_name
claim_value: Barvy a laky - závod 5
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-BLZ
- identifier_scheme: GHCID_UUID
identifier_value: 2c47abc8-7588-5b11-8963-71c9377d507e
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 834799aa-8861-8c7d-6ce3-6d36f8dce2f0
- identifier_scheme: GHCID_NUMERIC
identifier_value: '9459698500026625149'
- identifier_scheme: RECORD_ID
identifier_value: 1e659114-a761-4f09-a898-91c128325318
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.918845+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.918845+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Barvy a laky - závod 5
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.918845+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.918845+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 10 - Uhříněves
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.918845+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABF511
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.918845+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:16.373440+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 104 00
street_address: Přátelství 550
normalization_timestamp: '2025-12-09T20:41:05.343604+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:05.343576+00:00'

View file

@ -0,0 +1,160 @@
original_entry:
name: Bristol-Myers Squibb, s.r.o.
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABF018
locations:
- street_address: Olivova 4/2096
postal_code: 110 00
city: Praha 1
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:24.186843+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-BMSSR
ghcid_original: CZ-HL-PRA-L-BMSSR
ghcid_uuid: ae0d311b-9a87-57bb-a729-38ee3806551b
ghcid_uuid_sha256: 732ffc3e-197f-8142-163d-2d91ea6f5289
ghcid_numeric: 8300129981913076034
record_id: 21e138b9-56a7-41e4-bd57-9f8da97e0d56
generation_timestamp: '2025-12-06T23:37:24.186843+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-BMSSR
valid_from: "2025-12-10T09:46:58Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-BMSSR
valid_from: null
valid_to: "2025-12-10T09:46:58Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-BMSSR
ghcid_numeric: 8300129981913076034
valid_from: '2025-12-06T23:37:24.186843+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
custodian_name:
claim_type: custodian_name
claim_value: Bristol-Myers Squibb, s.r.o.
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-BMSSR
- identifier_scheme: GHCID_UUID
identifier_value: ae0d311b-9a87-57bb-a729-38ee3806551b
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 732ffc3e-197f-8142-163d-2d91ea6f5289
- identifier_scheme: GHCID_NUMERIC
identifier_value: '8300129981913076034'
- identifier_scheme: RECORD_ID
identifier_value: 21e138b9-56a7-41e4-bd57-9f8da97e0d56
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:30.034986+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.034986+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Bristol-Myers Squibb, s.r.o.
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.034986+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.034986+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 1
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.034986+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABF018
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.034986+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:24.186843+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 110 00
street_address: Olivova 4/2096
normalization_timestamp: '2025-12-09T20:41:05.399591+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:05.399563+00:00'

View file

@ -0,0 +1,160 @@
original_entry:
name: Balírny obchodu Praha, a.s.
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABF237
locations:
- street_address: Karlovo nám. 18
postal_code: 120 00
city: Praha 2
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:16.187235+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-BOPS
ghcid_original: CZ-HL-PRA-L-BOPS
ghcid_uuid: 7beaa98c-c3fb-574c-95cf-1df1778178ba
ghcid_uuid_sha256: 6ab92454-360f-865a-41a9-a05c376f827f
ghcid_numeric: 7690217782814639706
record_id: dc435520-e235-436b-9023-4f66b4208c10
generation_timestamp: '2025-12-06T23:37:16.187235+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-BOPS
valid_from: "2025-12-10T09:46:58Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-BOPS
valid_from: null
valid_to: "2025-12-10T09:46:58Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-BOPS
ghcid_numeric: 7690217782814639706
valid_from: '2025-12-06T23:37:16.187235+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
custodian_name:
claim_type: custodian_name
claim_value: Balírny obchodu Praha, a.s.
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-BOPS
- identifier_scheme: GHCID_UUID
identifier_value: 7beaa98c-c3fb-574c-95cf-1df1778178ba
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 6ab92454-360f-865a-41a9-a05c376f827f
- identifier_scheme: GHCID_NUMERIC
identifier_value: '7690217782814639706'
- identifier_scheme: RECORD_ID
identifier_value: dc435520-e235-436b-9023-4f66b4208c10
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.917462+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.917462+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Balírny obchodu Praha, a.s.
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.917462+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.917462+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 2
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.917462+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABF237
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.917462+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:16.187235+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 120 00
street_address: Karlovo nám. 18
normalization_timestamp: '2025-12-09T20:41:05.444789+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:05.444776+00:00'

View file

@ -0,0 +1,272 @@
original_entry:
name: Bärenreiter Praha s.r.o. - Půjčovna notových materiálů
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABF314
- &id002
identifier_scheme: IČO
identifier_value: '40527352'
- &id003
identifier_scheme: DIČ
identifier_value: CZ40527352
- &id004
identifier_scheme: Website
identifier_value: https://www.baerenreiter.cz/
identifier_url: https://www.baerenreiter.cz/
- &id005
identifier_scheme: Wikidata
identifier_value: Q114617814
identifier_url: https://www.wikidata.org/wiki/Q114617814
locations:
- street_address: Perunova 1412/10
postal_code: 130 00
city: Praha 3
region: Hlavní město Praha
country: CZ
latitude: 50.07673
longitude: 14.454035
processing_timestamp: '2025-12-06T23:37:16.222539+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-BPSRPNM
ghcid_original: CZ-HL-PRA-L-BPSRPNM
ghcid_uuid: f517ef34-3199-5fa8-a64e-a3d6c6bc8b79
ghcid_uuid_sha256: f1b94f67-64ea-8685-9497-237fda7ff180
ghcid_numeric: 17418040339325408901
record_id: b4884eee-e6aa-48b6-b13c-12711d7cd24b
generation_timestamp: '2025-12-06T23:37:16.222539+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-BPSRPNM
valid_from: '2025-12-10T09:46:58Z'
valid_to:
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
- ghcid: CZ-HL-PRA-L-BPSRPNM
valid_from:
valid_to: '2025-12-10T09:46:58Z'
reason: Previous GHCID with incorrect region code
- ghcid: CZ-HL-PRA-L-BPSRPNM
ghcid_numeric: 17418040339325408901
valid_from: '2025-12-06T23:37:16.222539+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
custodian_name:
claim_type: custodian_name
claim_value: Bärenreiter Praha s.r.o. - Půjčovna notových materiálů
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-BPSRPNM
- identifier_scheme: GHCID_UUID
identifier_value: f517ef34-3199-5fa8-a64e-a3d6c6bc8b79
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: f1b94f67-64ea-8685-9497-237fda7ff180
- identifier_scheme: GHCID_NUMERIC
identifier_value: '17418040339325408901'
- identifier_scheme: RECORD_ID
identifier_value: b4884eee-e6aa-48b6-b13c-12711d7cd24b
- *id001
- *id002
- *id003
- *id004
- *id005
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.917698+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
schema_version: 2.0.0
enrichment_provenance:
wikidata_enrichment:
content_hash: sha256-adAcvng7eL1uAzyjqBlS4toCs3qHqquBYX6PXZUcX/s=
verified_at: '2025-12-28T19:58:23.277377+00:00'
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617814
provenance_schema_version: '2.0'
standards_compliance:
- W3C PROV-O
- W3C SRI (content hashes)
generated_at: '2025-12-28T19:58:23.277402+00:00'
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.917698+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Bärenreiter Praha s.r.o. - Půjčovna notových materiálů
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.917698+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.917698+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 3
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.917698+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: wikidata_id
claim_value: Q114617814
property_uri: owl:sameAs
provenance:
namespace: wikidata
path: /entity/Q114617814
timestamp: '2025-11-19T10:14:29.917698+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.98
- claim_type: ghcid
claim_value: CZ-ABF314
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.917698+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:16.222539+00:00'
creation_method: create_custodian_from_ch_annotator.py
wikidata_enrichment:
wikidata_entity_id: Q114617814
api_metadata:
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
fetch_timestamp: '2025-12-18T22:27:16.758067+00:00'
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
enrichment_version: 2.1_generic
properties_found:
- P31
- P17
- P159
- P968
wikidata_labels:
cs: Bärenreiter Praha s.r.o., Půjčovna notových materiálů (knihovna)
en: Bärenreiter Praha s.r.o., Hire Library
wikidata_label_en: Bärenreiter Praha s.r.o., Hire Library
wikidata_label_cs: Bärenreiter Praha s.r.o., Půjčovna notových materiálů (knihovna)
wikidata_descriptions:
cs: knihovna v Praze
wikidata_classification:
instance_of: &id006
- id: Q385994
label: special library
description: specialized library providing resources on a particular topic or discipline
wikidata_instance_of: *id006
wikidata_location:
country: &id007
id: Q213
label: Czech Republic
description: country in Central Europe
headquarters_location:
id: Q2598899
label: Prague 3
description: administrative district, municipal district and municipal part of Prague
wikidata_country: *id007
wikidata_contact:
email: mailto:hire@baerenreiter.cz
_provenance:
content_hash:
algorithm: sha256
value: sha256-adAcvng7eL1uAzyjqBlS4toCs3qHqquBYX6PXZUcX/s=
scope: enrichment_section
computed_at: '2025-12-28T19:58:23.277377+00:00'
prov:
wasGeneratedBy:
'@type': prov:Activity
name: wikidata_api_fetch
used: https://www.wikidata.org/w/rest.php/wikibase/v1
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617814
verification:
status: verified
last_verified: '2025-12-28T19:58:23.277390+00:00'
location:
latitude: 50.07673
longitude: 14.454035
coordinate_provenance:
source_type: ORIGINAL_ENTRY
source_path: original_entry.locations[0]
city: Praha 3
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 130 00
street_address: Perunova 1412/10
normalization_timestamp: '2025-12-09T06:50:47.749663+00:00'
logo_enrichment:
enrichment_timestamp: '2025-12-23T21:58:58.246784+00:00'
source_url: https://www.baerenreiter.cz
extraction_method: crawl4ai
claims:
- claim_type: favicon_url
claim_value: https://www.baerenreiter.cz/sites/default/files/favicon.ico
source_url: https://www.baerenreiter.cz
css_selector: '[document] > html.js > head > link'
retrieved_on: '2025-12-23T21:58:58.246784+00:00'
extraction_method: crawl4ai_link_rel
favicon_type: image/vnd.microsoft.icon
favicon_sizes: ''
- claim_type: og_image_url
claim_value: https://www.baerenreiter.cz/sites/default/files/fb.png
source_url: https://www.baerenreiter.cz
css_selector: '[document] > html.js > head > meta:nth-of-type(10)'
retrieved_on: '2025-12-23T21:58:58.246784+00:00'
extraction_method: crawl4ai_meta_og
summary:
total_claims: 2
has_primary_logo: false
has_favicon: true
has_og_image: true
favicon_count: 1

View file

@ -0,0 +1,160 @@
original_entry:
name: BALNEX, a.s.
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABE209
locations:
- street_address: Křižovnická 3
postal_code: 110 01
city: Praha 1
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:15.404725+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-BS
ghcid_original: CZ-HL-PRA-L-BS
ghcid_uuid: a6c010de-713b-583c-911c-7360c964f0c8
ghcid_uuid_sha256: d5f79d41-d62a-81e5-fa61-a849a722c094
ghcid_numeric: 15417964755417694693
record_id: 5bab80da-b02b-44fb-90a4-4be4ae66abc6
generation_timestamp: '2025-12-06T23:37:15.404725+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-BS
valid_from: "2025-12-10T09:46:58Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-BS
valid_from: null
valid_to: "2025-12-10T09:46:58Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-BS
ghcid_numeric: 15417964755417694693
valid_from: '2025-12-06T23:37:15.404725+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
custodian_name:
claim_type: custodian_name
claim_value: BALNEX, a.s.
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-BS
- identifier_scheme: GHCID_UUID
identifier_value: a6c010de-713b-583c-911c-7360c964f0c8
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: d5f79d41-d62a-81e5-fa61-a849a722c094
- identifier_scheme: GHCID_NUMERIC
identifier_value: '15417964755417694693'
- identifier_scheme: RECORD_ID
identifier_value: 5bab80da-b02b-44fb-90a4-4be4ae66abc6
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.909359+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.909359+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: BALNEX, a.s.
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.909359+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.909359+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 1
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.909359+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABE209
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.909359+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:15.404725+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 110 01
street_address: Křižovnická 3
normalization_timestamp: '2025-12-09T20:41:05.487903+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:05.487890+00:00'

View file

@ -0,0 +1,161 @@
custodian_name:
claim_type: custodian_name
claim_value: BIJO TC, a.s.
source_type: ch_annotator
original_entry:
name: BIJO TC, a.s.
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABC125
locations:
- street_address: Tiskařská 10/257
postal_code: 108 28
city: Praha 10 - Malešice
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:14.605316+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-BTS
ghcid_original: CZ-HL-PRA-L-BTS
ghcid_uuid: 46fb3260-4257-5d4f-859e-32420cdf87b5
ghcid_uuid_sha256: ca43ed37-dcf6-8c51-0af3-e33d300f2c68
ghcid_numeric: 14574753643287153745
record_id: 8748db8f-27f7-42d7-9ea4-9a141e04d05d
generation_timestamp: '2025-12-06T23:37:14.605316+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-BTS
valid_from: '2025-12-10T09:46:58Z'
valid_to: null
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha))
per ISO 3166-2:CZ
- ghcid: CZ-HL-PRA-L-BTS
valid_from: null
valid_to: '2025-12-10T09:46:58Z'
reason: Previous GHCID with incorrect region code
- ghcid: CZ-HL-PRA-L-BTS
ghcid_numeric: 14574753643287153745
valid_from: '2025-12-06T23:37:14.605316+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-BTS
- identifier_scheme: GHCID_UUID
identifier_value: 46fb3260-4257-5d4f-859e-32420cdf87b5
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: ca43ed37-dcf6-8c51-0af3-e33d300f2c68
- identifier_scheme: GHCID_NUMERIC
identifier_value: '14574753643287153745'
- identifier_scheme: RECORD_ID
identifier_value: 8748db8f-27f7-42d7-9ea4-9a141e04d05d
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.803741+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:46Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.803741+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date: null
verified_by: null
entity_claims:
- claim_type: full_name
claim_value: BIJO TC, a.s.
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.803741+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.803741+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 10 - Malešice
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.803741+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABC125
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.803741+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:14.605316+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 108 28
street_address: Tiskařská 10/257
normalization_timestamp: '2025-12-09T20:41:05.534948+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:05.534935+00:00'

View file

@ -0,0 +1,160 @@
original_entry:
name: BUREAU VERITAS CZECH REPUBLIC, s.r.o.
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABF042
locations:
- street_address: Olbrachtova 1
postal_code: 140 00
city: Praha 4
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:15.840934+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-BVCRSR
ghcid_original: CZ-HL-PRA-L-BVCRSR
ghcid_uuid: b4d15574-7d65-5c95-b400-0cd921f8038f
ghcid_uuid_sha256: adba6580-9cb8-8cba-3b07-d6bc76134ec4
ghcid_numeric: 12518429717289270458
record_id: 2cdae77e-3374-4bf4-9254-ea086fa284d1
generation_timestamp: '2025-12-06T23:37:15.840934+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-BVCRSR
valid_from: "2025-12-10T09:46:58Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-BVCRSR
valid_from: null
valid_to: "2025-12-10T09:46:58Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-BVCRSR
ghcid_numeric: 12518429717289270458
valid_from: '2025-12-06T23:37:15.840934+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
custodian_name:
claim_type: custodian_name
claim_value: BUREAU VERITAS CZECH REPUBLIC, s.r.o.
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-BVCRSR
- identifier_scheme: GHCID_UUID
identifier_value: b4d15574-7d65-5c95-b400-0cd921f8038f
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: adba6580-9cb8-8cba-3b07-d6bc76134ec4
- identifier_scheme: GHCID_NUMERIC
identifier_value: '12518429717289270458'
- identifier_scheme: RECORD_ID
identifier_value: 2cdae77e-3374-4bf4-9254-ea086fa284d1
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.914590+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.914590+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: BUREAU VERITAS CZECH REPUBLIC, s.r.o.
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.914590+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.914590+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 4
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.914590+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABF042
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.914590+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:15.840934+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 140 00
street_address: Olbrachtova 1
normalization_timestamp: '2025-12-09T20:41:05.573445+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:05.573432+00:00'

View file

@ -0,0 +1,277 @@
original_entry:
name: Botanická zahrada hl. m. Prahy - knihovna
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABE404
- &id002
identifier_scheme: IČO
identifier_value: '00064572'
- &id003
identifier_scheme: DIČ
identifier_value: CZ00064572
- &id004
identifier_scheme: Website
identifier_value: https://knihovna.botanicka.cz/
identifier_url: https://knihovna.botanicka.cz/
- &id005
identifier_scheme: Wikidata
identifier_value: Q114617803
identifier_url: https://www.wikidata.org/wiki/Q114617803
locations:
- street_address: Trojská 800/196
postal_code: 171 00
city: Praha 7 - Troja
region: Hlavní město Praha
country: CZ
latitude: 50.121983
longitude: 14.412453
processing_timestamp: '2025-12-06T23:37:15.700458+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-BZHMPK
ghcid_original: CZ-HL-PRA-L-BZHMPK
ghcid_uuid: ddb28542-c287-5302-9f45-281bda8f18d1
ghcid_uuid_sha256: 6de8d9c8-9d4d-8589-b9b1-7776aad03b6a
ghcid_numeric: 7919819400386827657
record_id: 965cbe79-ccdd-4561-8acc-7ae286cc3ef2
generation_timestamp: '2025-12-06T23:37:15.700458+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-BZHMPK
valid_from: '2025-12-10T09:46:58Z'
valid_to:
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
- ghcid: CZ-HL-PRA-L-BZHMPK
valid_from:
valid_to: '2025-12-10T09:46:58Z'
reason: Previous GHCID with incorrect region code
- ghcid: CZ-HL-PRA-L-BZHMPK
ghcid_numeric: 7919819400386827657
valid_from: '2025-12-06T23:37:15.700458+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
custodian_name:
claim_type: custodian_name
claim_value: Botanická zahrada hl. m. Prahy - knihovna
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-BZHMPK
- identifier_scheme: GHCID_UUID
identifier_value: ddb28542-c287-5302-9f45-281bda8f18d1
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 6de8d9c8-9d4d-8589-b9b1-7776aad03b6a
- identifier_scheme: GHCID_NUMERIC
identifier_value: '7919819400386827657'
- identifier_scheme: RECORD_ID
identifier_value: 965cbe79-ccdd-4561-8acc-7ae286cc3ef2
- *id001
- *id002
- *id003
- *id004
- *id005
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.913331+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
schema_version: 2.0.0
enrichment_provenance:
wikidata_enrichment:
content_hash: sha256-fnZfc5LlrZv1vMwhc7UMIawFYMK4eplZVEeWgDEpN0Y=
verified_at: '2025-12-28T19:58:23.327120+00:00'
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617803
provenance_schema_version: '2.0'
standards_compliance:
- W3C PROV-O
- W3C SRI (content hashes)
generated_at: '2025-12-28T19:58:23.327139+00:00'
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.913331+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Botanická zahrada hl. m. Prahy - knihovna
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.913331+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.913331+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 7 - Troja
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.913331+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: wikidata_id
claim_value: Q114617803
property_uri: owl:sameAs
provenance:
namespace: wikidata
path: /entity/Q114617803
timestamp: '2025-11-19T10:14:29.913331+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.98
- claim_type: ghcid
claim_value: CZ-ABE404
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.913331+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:15.700458+00:00'
creation_method: create_custodian_from_ch_annotator.py
wikidata_enrichment:
wikidata_entity_id: Q114617803
api_metadata:
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
fetch_timestamp: '2025-12-18T22:27:25.230947+00:00'
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
enrichment_version: 2.1_generic
properties_found:
- P31
- P17
- P159
- P968
wikidata_labels:
cs: Botanická zahrada hl. m. Prahy, knihovna
en: Prague Botanical Garden
wikidata_label_en: Prague Botanical Garden
wikidata_label_cs: Botanická zahrada hl. m. Prahy, knihovna
wikidata_descriptions:
cs: knihovna v Praze
wikidata_aliases:
cs:
- BZ Praha
wikidata_classification:
instance_of: &id006
- id: Q385994
label: special library
description: specialized library providing resources on a particular topic or discipline
wikidata_instance_of: *id006
wikidata_location:
country: &id007
id: Q213
label: Czech Republic
description: country in Central Europe
headquarters_location:
id: Q84682501
label: Troja
description: cadastral area in Prague
wikidata_country: *id007
wikidata_contact:
email: mailto:info@botanicka.cz
_provenance:
content_hash:
algorithm: sha256
value: sha256-fnZfc5LlrZv1vMwhc7UMIawFYMK4eplZVEeWgDEpN0Y=
scope: enrichment_section
computed_at: '2025-12-28T19:58:23.327120+00:00'
prov:
wasGeneratedBy:
'@type': prov:Activity
name: wikidata_api_fetch
used: https://www.wikidata.org/w/rest.php/wikibase/v1
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617803
verification:
status: verified
last_verified: '2025-12-28T19:58:23.327130+00:00'
location:
latitude: 50.121983
longitude: 14.412453
coordinate_provenance:
source_type: ORIGINAL_ENTRY
source_path: original_entry.locations[0]
city: Praha 7 - Troja
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 171 00
street_address: Trojská 800/196
normalization_timestamp: '2025-12-09T06:50:47.837815+00:00'
logo_enrichment:
enrichment_timestamp: '2025-12-23T21:59:10.228060+00:00'
source_url: https://knihovna.botanicka.cz
extraction_method: crawl4ai
claims:
- claim_type: logo_url
claim_value: https://knihovna.botanicka.cz/custom/design/logo.png
source_url: https://knihovna.botanicka.cz
css_selector: '#portaro-classic-layout > kp-svelte-component-wrapper.kp-header-component-wrapper.ng-isolate-scope > header.kp-header.logo-stripe > div.logo-content-container.container > div.logo-search-row.row > div.customLogoArea.custom-logo-area > div > a > img'
retrieved_on: '2025-12-23T21:59:10.228060+00:00'
extraction_method: crawl4ai_header_logo
detection_confidence: high
alt_text: Botanická zahrada Praha
- claim_type: favicon_url
claim_value: https://knihovna.botanicka.cz/favicon.png?v=2.3.0-32021
source_url: https://knihovna.botanicka.cz
css_selector: '#ng-app > head > link:nth-of-type(9)'
retrieved_on: '2025-12-23T21:59:10.228060+00:00'
extraction_method: crawl4ai_link_rel
favicon_type: image/png
favicon_sizes: 256x256
summary:
total_claims: 2
has_primary_logo: true
has_favicon: true
has_og_image: false
favicon_count: 1

View file

@ -0,0 +1,160 @@
original_entry:
name: BARUM Zlín - výrobní podnik MITAS
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABF219
locations:
- street_address: Komarovova 1900
postal_code: 106 24
city: Praha 10
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:16.166411+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-BZVPM
ghcid_original: CZ-HL-PRA-L-BZVPM
ghcid_uuid: 2bc7f6fb-4623-5d84-bc09-5af68d2b30d5
ghcid_uuid_sha256: 7ee23211-b1ce-87fe-b290-acc47c3d2166
ghcid_numeric: 9142925245094483966
record_id: 8ce281e5-83f2-413e-83e0-286609a9437c
generation_timestamp: '2025-12-06T23:37:16.166411+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-BZVPM
valid_from: "2025-12-10T09:46:58Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-BZVPM
valid_from: null
valid_to: "2025-12-10T09:46:58Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-BZVPM
ghcid_numeric: 9142925245094483966
valid_from: '2025-12-06T23:37:16.166411+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
custodian_name:
claim_type: custodian_name
claim_value: BARUM Zlín - výrobní podnik MITAS
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-BZVPM
- identifier_scheme: GHCID_UUID
identifier_value: 2bc7f6fb-4623-5d84-bc09-5af68d2b30d5
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 7ee23211-b1ce-87fe-b290-acc47c3d2166
- identifier_scheme: GHCID_NUMERIC
identifier_value: '9142925245094483966'
- identifier_scheme: RECORD_ID
identifier_value: 8ce281e5-83f2-413e-83e0-286609a9437c
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.917237+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.917237+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: BARUM Zlín - výrobní podnik MITAS
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.917237+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.917237+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 10
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.917237+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABF219
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.917237+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:16.166411+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 106 24
street_address: Komarovova 1900
normalization_timestamp: '2025-12-09T20:41:05.614114+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:05.614068+00:00'

View file

@ -0,0 +1,233 @@
original_entry:
name: ČKAIT
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABE121
- &id002
identifier_scheme: IČO
identifier_value: '45770743'
- &id003
identifier_scheme: DIČ
identifier_value: CZ45770743
- &id004
identifier_scheme: Website
identifier_value: https://www.ckait.cz/content/knihovna-studovna-vyuzivejte-zdroje-komory
identifier_url: https://www.ckait.cz/content/knihovna-studovna-vyuzivejte-zdroje-komory
locations:
- street_address: Sokolská 15
postal_code: 120 00
city: Praha 2
region: Hlavní město Praha
country: CZ
latitude: 50.073689
longitude: 14.428981
processing_timestamp: '2025-12-06T23:37:24.220409+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-C
ghcid_original: CZ-HL-PRA-L-Č
ghcid_uuid: fc03c3b2-07eb-59c3-8cec-76a02a126334
ghcid_uuid_sha256: 69fa5c4f-fa31-8052-a145-b97bc584a8c4
ghcid_numeric: 7636517616729829458
record_id: e4649a2e-d847-4fff-988c-474675df99b5
generation_timestamp: '2025-12-06T23:37:24.220409+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-C
valid_from: '2025-12-10T09:46:58Z'
valid_to:
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
- ghcid: CZ-HL-PRA-L-C
valid_from:
valid_to: '2025-12-10T09:46:58Z'
reason: Previous GHCID with incorrect region code
- ghcid: CZ-HL-PRA-L-C
ghcid_numeric: 7636517616729829458
valid_from: '2025-12-08T11:21:30.867239+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-Č)'
- ghcid: CZ-HL-PRA-L-Č
ghcid_numeric: 6753267552530310287
valid_from: '2025-12-06T23:37:24.220409+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:30.867239+00:00'
custodian_name:
claim_type: custodian_name
claim_value: ČKAIT
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-C
- identifier_scheme: GHCID_UUID
identifier_value: fc03c3b2-07eb-59c3-8cec-76a02a126334
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 69fa5c4f-fa31-8052-a145-b97bc584a8c4
- identifier_scheme: GHCID_NUMERIC
identifier_value: '7636517616729829458'
- identifier_scheme: RECORD_ID
identifier_value: e4649a2e-d847-4fff-988c-474675df99b5
- *id001
- *id002
- *id003
- *id004
- identifier_scheme: Wikidata
identifier_value: Q114619194
identifier_url: https://www.wikidata.org/wiki/Q114619194
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:30.035396+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
schema_version: 2.0.0
enrichment_provenance:
wikidata_enrichment:
content_hash: sha256-9flePgPRModhuSmKlHwWMjPrGn5nVoXBoUUCCLzfbjw=
verified_at: '2025-12-28T19:58:23.355427+00:00'
wasDerivedFrom: https://www.wikidata.org/wiki/Q114619194
provenance_schema_version: '2.0'
standards_compliance:
- W3C PROV-O
- W3C SRI (content hashes)
generated_at: '2025-12-28T19:58:23.355445+00:00'
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.035396+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: ČKAIT
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.035396+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.035396+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 2
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.035396+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABE121
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.035396+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:24.220409+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
latitude: 50.073689
longitude: 14.428981
coordinate_provenance:
source_type: ORIGINAL_ENTRY
source_path: original_entry.locations[0]
city: Praha 2
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 120 00
street_address: Sokolská 15
normalization_timestamp: '2025-12-09T06:50:47.895364+00:00'
wikidata_enrichment:
wikidata_id: Q114619194
wikidata_url: https://www.wikidata.org/wiki/Q114619194
matched_by: sigla_identifier
matched_sigla: ABE121
wikidata_label: ČKAIT (knihovna)
wikidata_description: knihovna v Praze
enrichment_date: '2025-12-19T10:13:02.020089+00:00'
enrichment_version: 2.1.0
instance_of:
- Q385994
_provenance:
content_hash:
algorithm: sha256
value: sha256-9flePgPRModhuSmKlHwWMjPrGn5nVoXBoUUCCLzfbjw=
scope: enrichment_section
computed_at: '2025-12-28T19:58:23.355427+00:00'
prov:
wasGeneratedBy:
'@type': prov:Activity
name: wikidata_api_fetch
used: https://www.wikidata.org/w/rest.php/wikibase/v1
wasDerivedFrom: https://www.wikidata.org/wiki/Q114619194
generatedAtTime: '2025-12-19T10:13:02.020089+00:00'
verification:
status: verified
last_verified: '2025-12-28T19:58:23.355435+00:00'
wikidata_entity_id: Q114619194
logo_enrichment:
enrichment_timestamp: '2025-12-23T21:59:18.694376+00:00'
source_url: https://www.ckait.cz/content/knihovna-studovna-vyuzivejte-zdroje-komory
extraction_method: crawl4ai
claims:
- claim_type: og_image_url
claim_value: https://www.ckait.cz/sites/default/files/logo_0.png
source_url: https://www.ckait.cz/content/knihovna-studovna-vyuzivejte-zdroje-komory
css_selector: '[document] > html.js > head > meta:nth-of-type(4)'
retrieved_on: '2025-12-23T21:59:18.694376+00:00'
extraction_method: crawl4ai_meta_og
summary:
total_claims: 1
has_primary_logo: false
has_favicon: false
has_og_image: true
favicon_count: 0

View file

@ -0,0 +1,165 @@
original_entry:
name: Československé aerolinie
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABF013
locations:
- street_address: Revoluční 1
postal_code: 110 00
city: Praha 1
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:25.757670+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-CA
ghcid_original: CZ-HL-PRA-L-ČA
ghcid_uuid: ec8f4d30-cab6-563a-9e95-68ecde7d50d2
ghcid_uuid_sha256: 0e090c6c-ff12-8a37-b143-e016cadec154
ghcid_numeric: 1011353253596809783
record_id: 2eef5618-dd76-4088-98bd-052f9a7f1daa
generation_timestamp: '2025-12-06T23:37:25.757670+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-CA
valid_from: "2025-12-10T09:46:58Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-CA
valid_from: null
valid_to: "2025-12-10T09:46:58Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-CA
ghcid_numeric: 1011353253596809783
valid_from: '2025-12-08T11:21:40.305739+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ČA)'
- ghcid: CZ-HL-PRA-L-ČA
ghcid_numeric: 14316824294466373603
valid_from: '2025-12-06T23:37:25.757670+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:40.305739+00:00'
custodian_name:
claim_type: custodian_name
claim_value: Československé aerolinie
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-CA
- identifier_scheme: GHCID_UUID
identifier_value: ec8f4d30-cab6-563a-9e95-68ecde7d50d2
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 0e090c6c-ff12-8a37-b143-e016cadec154
- identifier_scheme: GHCID_NUMERIC
identifier_value: '1011353253596809783'
- identifier_scheme: RECORD_ID
identifier_value: 2eef5618-dd76-4088-98bd-052f9a7f1daa
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:30.047770+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.047770+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Československé aerolinie
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.047770+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.047770+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 1
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.047770+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABF013
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.047770+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:25.757670+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 110 00
street_address: Revoluční 1
normalization_timestamp: '2025-12-09T20:41:05.659406+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:05.659389+00:00'

View file

@ -0,0 +1,165 @@
original_entry:
name: Česká agentura na podporu obchodu / CzechTrade
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABC078
locations:
- street_address: Politických vězňů 20
postal_code: 111 21
city: Praha 1
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:14.535093+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-CAPOC
ghcid_original: CZ-HL-PRA-L-ČAPOC
ghcid_uuid: bd63e385-f54a-59f0-8fc2-9a816ff748dc
ghcid_uuid_sha256: bf51eb6c-2cd1-8f54-9b13-21677ac27e17
ghcid_numeric: 13786058784198733652
record_id: 76cf57a2-2e08-4564-ab23-7d6a2916eb29
generation_timestamp: '2025-12-06T23:37:14.535093+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-CAPOC
valid_from: "2025-12-10T09:46:58Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-CAPOC
valid_from: null
valid_to: "2025-12-10T09:46:58Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-CAPOC
ghcid_numeric: 13786058784198733652
valid_from: '2025-12-08T11:21:23.125025+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ČAPOC)'
- ghcid: CZ-HL-PRA-L-ČAPOC
ghcid_numeric: 436287342414373497
valid_from: '2025-12-06T23:37:14.535093+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:23.125025+00:00'
custodian_name:
claim_type: custodian_name
claim_value: Česká agentura na podporu obchodu / CzechTrade
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-CAPOC
- identifier_scheme: GHCID_UUID
identifier_value: bd63e385-f54a-59f0-8fc2-9a816ff748dc
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: bf51eb6c-2cd1-8f54-9b13-21677ac27e17
- identifier_scheme: GHCID_NUMERIC
identifier_value: '13786058784198733652'
- identifier_scheme: RECORD_ID
identifier_value: 76cf57a2-2e08-4564-ab23-7d6a2916eb29
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.803043+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.803043+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Česká agentura na podporu obchodu / CzechTrade
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.803043+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.803043+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 1
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.803043+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABC078
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.803043+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:14.535093+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 111 21
street_address: Politických vězňů 20
normalization_timestamp: '2025-12-09T20:41:05.703407+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:05.703378+00:00'

View file

@ -0,0 +1,173 @@
original_entry:
name: České aerolinie, a.s.
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABF014
- &id002
identifier_scheme: IČO
identifier_value: '45795908'
- &id003
identifier_scheme: DIČ
identifier_value: CZ45795908
locations:
- street_address: Ruzyně - letiště
postal_code: 160 08
city: Praha 6
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:15.792471+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-CAS
ghcid_original: CZ-HL-PRA-L-ČAS
ghcid_uuid: e1fd3676-d4e0-5959-bb25-ff516014a9dc
ghcid_uuid_sha256: b9edb1dd-c76f-853f-bd4b-3b74130c84ad
ghcid_numeric: 13397560032588944703
record_id: 6f5b847f-5531-45b6-994f-fde37203ff3b
generation_timestamp: '2025-12-06T23:37:15.792471+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-CAS
valid_from: "2025-12-10T09:46:58Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-CAS
valid_from: null
valid_to: "2025-12-10T09:46:58Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-CAS
ghcid_numeric: 13397560032588944703
valid_from: '2025-12-08T11:21:21.233118+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ČAS)'
- ghcid: CZ-HL-PRA-L-ČAS
ghcid_numeric: 16494477793071435589
valid_from: '2025-12-06T23:37:15.792471+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:21.233118+00:00'
custodian_name:
claim_type: custodian_name
claim_value: České aerolinie, a.s.
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-CAS
- identifier_scheme: GHCID_UUID
identifier_value: e1fd3676-d4e0-5959-bb25-ff516014a9dc
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: b9edb1dd-c76f-853f-bd4b-3b74130c84ad
- identifier_scheme: GHCID_NUMERIC
identifier_value: '13397560032588944703'
- identifier_scheme: RECORD_ID
identifier_value: 6f5b847f-5531-45b6-994f-fde37203ff3b
- *id001
- *id002
- *id003
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.914209+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.914209+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: České aerolinie, a.s.
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.914209+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.914209+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 6
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.914209+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABF014
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.914209+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:15.792471+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 160 08
street_address: Ruzyně - letiště
normalization_timestamp: '2025-12-09T20:41:05.751437+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:05.751422+00:00'

View file

@ -0,0 +1,270 @@
original_entry:
name: Česká botanická společnost - Knihovna
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABB108
- &id002
identifier_scheme: IČO
identifier_value: '00444553'
- &id003
identifier_scheme: Website
identifier_value: https://katalog.botanospol.cz/
identifier_url: https://katalog.botanospol.cz/
- &id004
identifier_scheme: Wikidata
identifier_value: Q114617585
identifier_url: https://www.wikidata.org/wiki/Q114617585
locations:
- street_address: Benátská 2
postal_code: 128 00
city: Praha 2
region: Hlavní město Praha
country: CZ
latitude: 50.071464
longitude: 14.420864
processing_timestamp: '2025-12-06T23:35:06.433942+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-CBSK
ghcid_original: CZ-HL-PRA-L-ČBSK
ghcid_uuid: 8c696492-e42e-5b10-a0b0-d67d7be12203
ghcid_uuid_sha256: 9ec8dfa1-4d46-84b2-9a72-a4a7483f2012
ghcid_numeric: 11441640737213916338
record_id: a52123de-0f83-4b07-a53a-e96b07d5b66e
generation_timestamp: '2025-12-06T23:35:06.433942+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-CBSK
valid_from: '2025-12-10T09:46:58Z'
valid_to:
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
- ghcid: CZ-HL-PRA-L-CBSK
valid_from:
valid_to: '2025-12-10T09:46:58Z'
reason: Previous GHCID with incorrect region code
- ghcid: CZ-HL-PRA-L-CBSK
ghcid_numeric: 11441640737213916338
valid_from: '2025-12-08T11:21:27.632820+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ČBSK)'
- ghcid: CZ-HL-PRA-L-ČBSK
ghcid_numeric: 3946897244846925513
valid_from: '2025-12-06T23:35:06.433942+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:27.632820+00:00'
custodian_name:
claim_type: custodian_name
claim_value: Česká botanická společnost - Knihovna
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-CBSK
- identifier_scheme: GHCID_UUID
identifier_value: 8c696492-e42e-5b10-a0b0-d67d7be12203
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 9ec8dfa1-4d46-84b2-9a72-a4a7483f2012
- identifier_scheme: GHCID_NUMERIC
identifier_value: '11441640737213916338'
- identifier_scheme: RECORD_ID
identifier_value: a52123de-0f83-4b07-a53a-e96b07d5b66e
- *id001
- *id002
- *id003
- *id004
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.800191+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:47Z
schema_version: 2.0.0
enrichment_provenance:
wikidata_enrichment:
content_hash: sha256-oY4wPu7P3YKsrWpr2H3FmyYjbSLqUh+rxXz32uzFU2c=
verified_at: '2025-12-28T19:58:23.405533+00:00'
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617585
provenance_schema_version: '2.0'
standards_compliance:
- W3C PROV-O
- W3C SRI (content hashes)
generated_at: '2025-12-28T19:58:23.405552+00:00'
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.800191+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Česká botanická společnost - Knihovna
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.800191+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.800191+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 2
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.800191+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: wikidata_id
claim_value: Q114617585
property_uri: owl:sameAs
provenance:
namespace: wikidata
path: /entity/Q114617585
timestamp: '2025-11-19T10:14:29.800191+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.98
- claim_type: ghcid
claim_value: CZ-ABB108
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.800191+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:35:06.433942+00:00'
creation_method: create_custodian_from_ch_annotator.py
wikidata_enrichment:
wikidata_entity_id: Q114617585
api_metadata:
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
fetch_timestamp: '2025-12-18T22:27:33.397393+00:00'
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
enrichment_version: 2.1_generic
properties_found:
- P31
- P17
- P159
- P968
wikidata_labels:
cs: Česká botanická společnost, knihovna
en: Czech Botanical Society
wikidata_label_en: Czech Botanical Society
wikidata_label_cs: Česká botanická společnost, knihovna
wikidata_descriptions:
cs: knihovna v Praze
wikidata_aliases:
cs:
- ČBS
wikidata_classification:
instance_of: &id005
- id: Q385994
label: special library
description: specialized library providing resources on a particular topic or discipline
wikidata_instance_of: *id005
wikidata_location:
country: &id006
id: Q213
label: Czech Republic
description: country in Central Europe
headquarters_location:
id: Q2444636
label: Prague 2
description: administrative district, municipal district and municipal part of Prague
wikidata_country: *id006
wikidata_contact:
email: mailto:botspol@natur.cuni.cz
_provenance:
content_hash:
algorithm: sha256
value: sha256-oY4wPu7P3YKsrWpr2H3FmyYjbSLqUh+rxXz32uzFU2c=
scope: enrichment_section
computed_at: '2025-12-28T19:58:23.405533+00:00'
prov:
wasGeneratedBy:
'@type': prov:Activity
name: wikidata_api_fetch
used: https://www.wikidata.org/w/rest.php/wikibase/v1
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617585
verification:
status: verified
last_verified: '2025-12-28T19:58:23.405542+00:00'
location:
latitude: 50.071464
longitude: 14.420864
coordinate_provenance:
source_type: ORIGINAL_ENTRY
source_path: original_entry.locations[0]
city: Praha 2
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 128 00
street_address: Benátská 2
normalization_timestamp: '2025-12-09T06:50:47.995759+00:00'
logo_enrichment:
enrichment_timestamp: '2025-12-23T21:59:31.706423+00:00'
source_url: https://katalog.botanospol.cz
extraction_method: crawl4ai
claims:
- claim_type: favicon_url
claim_value: https://katalog.botanospol.cz/themes/root/images/vufind-favicon.ico
source_url: https://katalog.botanospol.cz
css_selector: '[document] > html > head > link:nth-of-type(7)'
retrieved_on: '2025-12-23T21:59:31.706423+00:00'
extraction_method: crawl4ai_link_rel
favicon_type: image/x-icon
favicon_sizes: ''
summary:
total_claims: 1
has_primary_logo: false
has_favicon: true
has_og_image: false
favicon_count: 1

View file

@ -0,0 +1,161 @@
original_entry:
name: Československá botanická společnost při AV ČR
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABE333
locations: []
processing_timestamp: '2025-12-06T23:37:25.736072+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-CBSPAC
ghcid_original: XX-XX-XXX-L-ČBSPAČ
ghcid_uuid: 1cd8e994-d075-551d-978c-51d7f78ce6cb
ghcid_uuid_sha256: b1e65467-4d97-84c9-89ec-7d46502ff5d6
ghcid_numeric: 12819026192016864457
record_id: 5e1f9dc6-7fc7-4a56-83cd-9150353177d9
generation_timestamp: '2025-12-06T23:37:25.736072+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: NAME_PATTERN_EXTRACTION
resolution_timestamp: '2025-12-06T23:56:13.484431+00:00'
resolution_notes: City 'AV ČR' extracted from institution name
ghcid_history:
- ghcid: CZ-10-PRA-L-CBSPAC
ghcid_numeric: 12819026192016864457
valid_from: '2025-12-08T11:21:22.614180+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-10-PRA-L-ČBSPAČ)'
- ghcid: XX-XX-XXX-L-ČBSPAČ
ghcid_numeric: 10375321127213274876
valid_from: '2025-12-06T23:37:25.736072+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:22.614180+00:00'
- ghcid: CZ-XX-XXX-L-ČBSPAČ
valid_from: '2025-12-06T23:56:13.484437+00:00'
reason: 'Country resolved via Wikidata P17: XX→CZ'
- ghcid: CZ-10-PRA-L-ČBSPAČ
valid_from: '2025-12-07T12:38:22.576530+00:00'
reason: 'Location resolved from institution name pattern: ''AV ČR'' → region 10, city PRA'
custodian_name:
claim_type: custodian_name
claim_value: Československá botanická společnost při AV ČR
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-CBSPAC
- identifier_scheme: GHCID_UUID
identifier_value: 1cd8e994-d075-551d-978c-51d7f78ce6cb
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: b1e65467-4d97-84c9-89ec-7d46502ff5d6
- identifier_scheme: GHCID_NUMERIC
identifier_value: '12819026192016864457'
- identifier_scheme: RECORD_ID
identifier_value: 5e1f9dc6-7fc7-4a56-83cd-9150353177d9
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:30.047635+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- 'Country resolved 2025-12-06T23:56:13Z: XX→CZ via Wikidata P17'
- 'YouTube/Google Maps enrichment 2025-12-08T19:49:58Z: Maps: rejected by LLM; YouTube: not found'
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:19Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:48:05Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:49:29Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.047635+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Československá botanická společnost při AV ČR
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.047635+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.047635+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: ghcid
claim_value: CZ-ABE333
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.047635+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:25.736072+00:00'
creation_method: create_custodian_from_ch_annotator.py
google_maps_status: NO_MATCH
google_maps_rejected:
candidate_name: Česká botanická společnost
rejection_reason: 'NAME MATCH: Names are similar, likely referring to the same entity (''Československá'' vs ''Česká'' are
close equivalents). LOCATION MATCH: Location is a strong match (both are in Prague, Czechia). TYPE MATCH: Google Place
types (''point_of_interest'', ''establishment'') do not match the expected heritage institution types (museum, library,
etc.). ENTITY TYPE: The entity is a learned society (a botanical society), not a heritage custodian institution like a
museum, library, gallery, or archive. Therefore, it is not a GRP.HER.'
timestamp: '2025-12-08T19:49:57.786616+00:00'
youtube_status: NOT_FOUND
youtube_search_query: Československá botanická společnost při AV ČR official
youtube_search_timestamp: '2025-12-08T19:49:58.157855+00:00'
location:
region_code: 10
country: CZ
normalization_timestamp: '2025-12-09T20:40:58.889046+00:00'
latitude: 50.08804
longitude: 14.42076
city: Prague
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:40:58.889035+00:00'

View file

@ -0,0 +1,164 @@
custodian_name:
claim_type: custodian_name
claim_value: ČSAV - Botanický ústav - úsek ekologie rostlin
source_type: ch_annotator
original_entry:
name: ČSAV - Botanický ústav - úsek ekologie rostlin
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: BOE031
locations: []
processing_timestamp: '2025-12-06T23:37:25.837611+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-CBUUER
ghcid_original: XX-XX-XXX-L-ČBÚÚER
ghcid_uuid: ceb02c28-0dec-5d1d-aee9-843abb460aa7
ghcid_uuid_sha256: 27d03f00-17d2-8dd1-be41-bdd369b306d4
ghcid_numeric: 2868862232267226577
record_id: d0f92280-9bec-4289-a47b-9a90766cd98f
generation_timestamp: '2025-12-06T23:37:25.837611+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: NAME_PATTERN_EXTRACTION
resolution_timestamp: '2025-12-06T23:56:13.163034+00:00'
resolution_notes: City 'ČSAV' extracted from institution name
ghcid_history:
- ghcid: CZ-10-PRA-L-CBUUER
ghcid_numeric: 2868862232267226577
valid_from: '2025-12-08T11:21:34.560987+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-10-PRA-L-ČBÚÚER)'
- ghcid: XX-XX-XXX-L-ČBÚÚER
ghcid_numeric: 8403561736984873728
valid_from: '2025-12-06T23:37:25.837611+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:34.560987+00:00'
- ghcid: CZ-XX-XXX-L-ČBÚÚER
valid_from: '2025-12-06T23:56:13.163043+00:00'
reason: 'Country resolved via Wikidata P17: XX→CZ'
- ghcid: CZ-10-PRA-L-ČBÚÚER
valid_from: '2025-12-07T12:38:22.577753+00:00'
reason: 'Location resolved from institution name pattern: ''ČSAV'' → region 10,
city PRA'
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-CBUUER
- identifier_scheme: GHCID_UUID
identifier_value: ceb02c28-0dec-5d1d-aee9-843abb460aa7
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 27d03f00-17d2-8dd1-be41-bdd369b306d4
- identifier_scheme: GHCID_NUMERIC
identifier_value: '2868862232267226577'
- identifier_scheme: RECORD_ID
identifier_value: d0f92280-9bec-4289-a47b-9a90766cd98f
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:30.048230+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- 'Country resolved 2025-12-06T23:56:13Z: XX→CZ via Wikidata P17'
- 'YouTube/Google Maps enrichment 2025-12-08T19:50:03Z: Maps: rejected by LLM; YouTube:
not found'
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:19Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:48:05Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:49:29Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.048230+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date: null
verified_by: null
entity_claims:
- claim_type: full_name
claim_value: ČSAV - Botanický ústav - úsek ekologie rostlin
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.048230+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.048230+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: ghcid
claim_value: CZ-BOE031
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.048230+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:25.837611+00:00'
creation_method: create_custodian_from_ch_annotator.py
google_maps_status: NO_MATCH
google_maps_rejected:
candidate_name: Oddělení vegetační ekologie, Botanický ústav AV ČR
rejection_reason: 'REJECT: TYPE_MISMATCH. The Google Place candidate, ''Oddělení
vegetační ekologie, Botanický ústav AV ČR'', is a department of a scientific research
institute. Although the names and location indicate it is the correct institution,
its function is scientific research, not heritage curation like a museum, library,
or gallery. The Google Place types ''point_of_interest'' and ''establishment''
are too generic and do not support classification as a heritage institution.'
timestamp: '2025-12-08T19:50:02.915435+00:00'
youtube_status: NOT_FOUND
youtube_search_query: ČSAV - Botanický ústav - úsek ekologie rostlin official
youtube_search_timestamp: '2025-12-08T19:50:03.282523+00:00'
location:
region_code: 10
country: CZ
normalization_timestamp: '2025-12-09T20:40:58.926923+00:00'
latitude: 50.08804
longitude: 14.42076
city: Prague
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:40:58.926901+00:00'

View file

@ -0,0 +1,163 @@
original_entry:
name: Českomoravský cement, a.s.
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABF505
locations:
- postal_code: 153 02
city: Praha 5 - Radotín
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:16.356857+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-CCS
ghcid_original: CZ-HL-PRA-L-ČCS
ghcid_uuid: 3a9e1eed-4461-5523-a050-78eba32bc1c6
ghcid_uuid_sha256: 426cee88-c5b2-8ef0-bcef-963c5ef4d2c5
ghcid_numeric: 4786462775187738352
record_id: c1903e67-931d-4cc0-bc08-b53aa007c3c4
generation_timestamp: '2025-12-06T23:37:16.356857+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-CCS
valid_from: "2025-12-10T09:46:58Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-CCS
valid_from: null
valid_to: "2025-12-10T09:46:58Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-CCS
ghcid_numeric: 4786462775187738352
valid_from: '2025-12-08T11:21:21.969957+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ČCS)'
- ghcid: CZ-HL-PRA-L-ČCS
ghcid_numeric: 5999739163448361722
valid_from: '2025-12-06T23:37:16.356857+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:21.969957+00:00'
custodian_name:
claim_type: custodian_name
claim_value: Českomoravský cement, a.s.
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-CCS
- identifier_scheme: GHCID_UUID
identifier_value: 3a9e1eed-4461-5523-a050-78eba32bc1c6
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 426cee88-c5b2-8ef0-bcef-963c5ef4d2c5
- identifier_scheme: GHCID_NUMERIC
identifier_value: '4786462775187738352'
- identifier_scheme: RECORD_ID
identifier_value: c1903e67-931d-4cc0-bc08-b53aa007c3c4
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.918721+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:48Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.918721+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Českomoravský cement, a.s.
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.918721+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.918721+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 5 - Radotín
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.918721+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABF505
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.918721+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:16.356857+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 153 02
normalization_timestamp: '2025-12-09T20:41:05.794726+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:05.794713+00:00'

View file

@ -0,0 +1,161 @@
custodian_name:
claim_type: custodian_name
claim_value: Centrální dispečerská organizace propojených energetických soustav
source_type: ch_annotator
original_entry:
name: Centrální dispečerská organizace propojených energetických soustav
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABE185
locations:
- street_address: Jungmannova 29
postal_code: 111 32
city: Praha 1
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:15.370319+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-CDOPES
ghcid_original: CZ-HL-PRA-L-CDOPES
ghcid_uuid: 93df1df2-b55b-50e1-aa91-2f01d98c37d7
ghcid_uuid_sha256: 0c623fff-4d22-8192-1ce2-ea63343a0f2f
ghcid_numeric: 892346041916084626
record_id: ef77331d-980d-417a-9abe-d32f177a8e14
generation_timestamp: '2025-12-06T23:37:15.370319+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-CDOPES
valid_from: '2025-12-10T09:46:58Z'
valid_to: null
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha))
per ISO 3166-2:CZ
- ghcid: CZ-HL-PRA-L-CDOPES
valid_from: null
valid_to: '2025-12-10T09:46:58Z'
reason: Previous GHCID with incorrect region code
- ghcid: CZ-HL-PRA-L-CDOPES
ghcid_numeric: 892346041916084626
valid_from: '2025-12-06T23:37:15.370319+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-CDOPES
- identifier_scheme: GHCID_UUID
identifier_value: 93df1df2-b55b-50e1-aa91-2f01d98c37d7
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 0c623fff-4d22-8192-1ce2-ea63343a0f2f
- identifier_scheme: GHCID_NUMERIC
identifier_value: '892346041916084626'
- identifier_scheme: RECORD_ID
identifier_value: ef77331d-980d-417a-9abe-d32f177a8e14
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.908755+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:48Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.908755+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date: null
verified_by: null
entity_claims:
- claim_type: full_name
claim_value: Centrální dispečerská organizace propojených energetických soustav
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.908755+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.908755+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 1
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.908755+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABE185
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.908755+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:15.370319+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 111 32
street_address: Jungmannova 29
normalization_timestamp: '2025-12-09T20:41:05.845268+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:05.845243+00:00'

View file

@ -0,0 +1,166 @@
custodian_name:
claim_type: custodian_name
claim_value: ČKD DUKLA, a.s.
source_type: ch_annotator
original_entry:
name: ČKD DUKLA, a.s.
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABF021
locations:
- street_address: Českomoravská 21
postal_code: 190 05
city: Praha 9
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:15.802317+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-CDS
ghcid_original: CZ-HL-PRA-L-ČDS
ghcid_uuid: 4717b660-a36b-5e93-86a5-bd2a0e2cbaf2
ghcid_uuid_sha256: 941ae1c7-9bb9-8f34-904d-a441c6c32476
ghcid_numeric: 10672090514435149620
record_id: 1d1b204d-f167-46b7-829f-1fbfea2ea1f3
generation_timestamp: '2025-12-06T23:37:15.802317+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-CDS
valid_from: '2025-12-10T09:46:58Z'
valid_to: null
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha))
per ISO 3166-2:CZ
- ghcid: CZ-HL-PRA-L-CDS
valid_from: null
valid_to: '2025-12-10T09:46:58Z'
reason: Previous GHCID with incorrect region code
- ghcid: CZ-HL-PRA-L-CDS
ghcid_numeric: 10672090514435149620
valid_from: '2025-12-08T11:21:26.723638+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ČDS)'
- ghcid: CZ-HL-PRA-L-ČDS
ghcid_numeric: 8990548551803364279
valid_from: '2025-12-06T23:37:15.802317+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:26.723638+00:00'
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-CDS
- identifier_scheme: GHCID_UUID
identifier_value: 4717b660-a36b-5e93-86a5-bd2a0e2cbaf2
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 941ae1c7-9bb9-8f34-904d-a441c6c32476
- identifier_scheme: GHCID_NUMERIC
identifier_value: '10672090514435149620'
- identifier_scheme: RECORD_ID
identifier_value: 1d1b204d-f167-46b7-829f-1fbfea2ea1f3
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.914285+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:48Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.914285+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date: null
verified_by: null
entity_claims:
- claim_type: full_name
claim_value: ČKD DUKLA, a.s.
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.914285+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.914285+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 9
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.914285+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABF021
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.914285+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:15.802317+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 190 05
street_address: Českomoravská 21
normalization_timestamp: '2025-12-09T20:41:05.913228+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:05.913214+00:00'

View file

@ -0,0 +1,165 @@
original_entry:
name: ČKD Dopravní systémy - ČKD Lokomotivka, a.s.
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABC002
locations:
- street_address: Českomoravská 205
postal_code: 190 00
city: Praha 9
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:25.476508+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-CDSCLS
ghcid_original: CZ-HL-PRA-L-ČDSČLS
ghcid_uuid: 03c112af-44cd-58c6-9d03-6f1c8c533159
ghcid_uuid_sha256: f4f8a01e-c0d3-87e4-ab3b-577f2490c2ec
ghcid_numeric: 17652034793423189988
record_id: 6415bb37-6c79-4475-9db9-1e6d9f3fba0c
generation_timestamp: '2025-12-06T23:37:25.476508+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-CDSCLS
valid_from: "2025-12-10T09:46:58Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-CDSCLS
valid_from: null
valid_to: "2025-12-10T09:46:58Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-CDSCLS
ghcid_numeric: 17652034793423189988
valid_from: '2025-12-08T11:21:39.771994+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ČDSČLS)'
- ghcid: CZ-HL-PRA-L-ČDSČLS
ghcid_numeric: 14907291261378136606
valid_from: '2025-12-06T23:37:25.476508+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:39.771994+00:00'
custodian_name:
claim_type: custodian_name
claim_value: ČKD Dopravní systémy - ČKD Lokomotivka, a.s.
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-CDSCLS
- identifier_scheme: GHCID_UUID
identifier_value: 03c112af-44cd-58c6-9d03-6f1c8c533159
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: f4f8a01e-c0d3-87e4-ab3b-577f2490c2ec
- identifier_scheme: GHCID_NUMERIC
identifier_value: '17652034793423189988'
- identifier_scheme: RECORD_ID
identifier_value: 6415bb37-6c79-4475-9db9-1e6d9f3fba0c
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:30.045814+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:48Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.045814+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: ČKD Dopravní systémy - ČKD Lokomotivka, a.s.
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.045814+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.045814+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 9
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.045814+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABC002
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.045814+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:25.476508+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 190 00
street_address: Českomoravská 205
normalization_timestamp: '2025-12-09T20:41:05.964036+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:05.964008+00:00'

View file

@ -0,0 +1,166 @@
custodian_name:
claim_type: custodian_name
claim_value: ČKD Dopravní systémy - ČKD Trakce, a.s.
source_type: ch_annotator
original_entry:
name: ČKD Dopravní systémy - ČKD Trakce, a.s.
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABF026
locations:
- street_address: Kolbenova 40
postal_code: 190 00
city: Praha 9
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:25.762860+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-CDSCTS
ghcid_original: CZ-HL-PRA-L-ČDSČTS
ghcid_uuid: 939b075a-63fe-5905-951a-74d436d37d64
ghcid_uuid_sha256: 6f504902-f4f7-83b0-8cd9-7ac2b03daa22
ghcid_numeric: 8020991213395477424
record_id: e25f2850-1721-40b3-ab48-c0631658ca67
generation_timestamp: '2025-12-06T23:37:25.762860+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-CDSCTS
valid_from: '2025-12-10T09:46:58Z'
valid_to: null
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha))
per ISO 3166-2:CZ
- ghcid: CZ-HL-PRA-L-CDSCTS
valid_from: null
valid_to: '2025-12-10T09:46:58Z'
reason: Previous GHCID with incorrect region code
- ghcid: CZ-HL-PRA-L-CDSCTS
ghcid_numeric: 8020991213395477424
valid_from: '2025-12-08T11:21:33.916666+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ČDSČTS)'
- ghcid: CZ-HL-PRA-L-ČDSČTS
ghcid_numeric: 10608205474449462001
valid_from: '2025-12-06T23:37:25.762860+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:33.916666+00:00'
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-CDSCTS
- identifier_scheme: GHCID_UUID
identifier_value: 939b075a-63fe-5905-951a-74d436d37d64
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 6f504902-f4f7-83b0-8cd9-7ac2b03daa22
- identifier_scheme: GHCID_NUMERIC
identifier_value: '8020991213395477424'
- identifier_scheme: RECORD_ID
identifier_value: e25f2850-1721-40b3-ab48-c0631658ca67
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:30.047807+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:48Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.047807+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date: null
verified_by: null
entity_claims:
- claim_type: full_name
claim_value: ČKD Dopravní systémy - ČKD Trakce, a.s.
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.047807+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.047807+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 9
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.047807+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABF026
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.047807+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:25.762860+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 190 00
street_address: Kolbenova 40
normalization_timestamp: '2025-12-09T20:41:06.020234+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:06.020205+00:00'

View file

@ -0,0 +1,165 @@
original_entry:
name: ČKD Dopravní systémy, a.s.
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABF025
locations:
- street_address: Chabská 6
postal_code: 155 41
city: Praha 5 - Zličín
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:15.811932+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-CDSS
ghcid_original: CZ-HL-PRA-L-ČDSS
ghcid_uuid: c8a463f0-4628-5712-a4d2-03fbb449c631
ghcid_uuid_sha256: 34e3d3cc-a477-8d52-892a-988fadd5ba4b
ghcid_numeric: 3811122585571683666
record_id: 547d3bb1-b8b1-4574-821f-0bcb62163961
generation_timestamp: '2025-12-06T23:37:15.811932+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-CDSS
valid_from: "2025-12-10T09:46:58Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-CDSS
valid_from: null
valid_to: "2025-12-10T09:46:58Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-CDSS
ghcid_numeric: 3811122585571683666
valid_from: '2025-12-08T11:21:21.353542+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ČDSS)'
- ghcid: CZ-HL-PRA-L-ČDSS
ghcid_numeric: 13591157510468734875
valid_from: '2025-12-06T23:37:15.811932+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:21.353542+00:00'
custodian_name:
claim_type: custodian_name
claim_value: ČKD Dopravní systémy, a.s.
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-CDSS
- identifier_scheme: GHCID_UUID
identifier_value: c8a463f0-4628-5712-a4d2-03fbb449c631
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 34e3d3cc-a477-8d52-892a-988fadd5ba4b
- identifier_scheme: GHCID_NUMERIC
identifier_value: '3811122585571683666'
- identifier_scheme: RECORD_ID
identifier_value: 547d3bb1-b8b1-4574-821f-0bcb62163961
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.914362+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:48Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.914362+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: ČKD Dopravní systémy, a.s.
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.914362+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.914362+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 5 - Zličín
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.914362+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABF025
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.914362+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:15.811932+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 155 41
street_address: Chabská 6
normalization_timestamp: '2025-12-09T20:41:06.063750+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:06.063728+00:00'

View file

@ -0,0 +1,165 @@
original_entry:
name: České dráhy - Správa výstavby železnic Praha - odbor koncepce a výhledu
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABE103
locations:
- street_address: Italská 45
postal_code: 121 31
city: Praha 2
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:15.256615+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-CDSVZPOKV
ghcid_original: CZ-HL-PRA-L-ČDSVŽPOKV
ghcid_uuid: d184e724-ad62-5b75-a412-96575f9db4d6
ghcid_uuid_sha256: df9f1b2b-ea42-8e8c-bfae-51de5d291665
ghcid_numeric: 16113627867182718604
record_id: 002b54b4-5382-4745-bc71-fc9577a3d54f
generation_timestamp: '2025-12-06T23:37:15.256615+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-CDSVZPOKV
valid_from: "2025-12-10T09:46:58Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-CDSVZPOKV
valid_from: null
valid_to: "2025-12-10T09:46:58Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-CDSVZPOKV
ghcid_numeric: 16113627867182718604
valid_from: '2025-12-08T11:21:26.472989+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ČDSVŽPOKV)'
- ghcid: CZ-HL-PRA-L-ČDSVŽPOKV
ghcid_numeric: 12616989030595101585
valid_from: '2025-12-06T23:37:15.256615+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:26.472989+00:00'
custodian_name:
claim_type: custodian_name
claim_value: České dráhy - Správa výstavby železnic Praha - odbor koncepce a výhledu
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-CDSVZPOKV
- identifier_scheme: GHCID_UUID
identifier_value: d184e724-ad62-5b75-a412-96575f9db4d6
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: df9f1b2b-ea42-8e8c-bfae-51de5d291665
- identifier_scheme: GHCID_NUMERIC
identifier_value: '16113627867182718604'
- identifier_scheme: RECORD_ID
identifier_value: 002b54b4-5382-4745-bc71-fc9577a3d54f
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.907676+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:48Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.907676+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: České dráhy - Správa výstavby železnic Praha - odbor koncepce a výhledu
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.907676+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.907676+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 2
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.907676+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABE103
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.907676+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:15.256615+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 121 31
street_address: Italská 45
normalization_timestamp: '2025-12-09T20:41:06.110303+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:06.110275+00:00'

View file

@ -0,0 +1,258 @@
original_entry:
name: Cesta domů, z.ú. - Knihovna Cesty domů
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABE288
- &id002
identifier_scheme: IČO
identifier_value: '26528843'
- &id003
identifier_scheme: DIČ
identifier_value: CZ26528843
- &id004
identifier_scheme: Website
identifier_value: https://raks.mlp.cz/library/cestydomu?device=1308
identifier_url: https://raks.mlp.cz/library/cestydomu?device=1308
- &id005
identifier_scheme: Wikidata
identifier_value: Q114619385
identifier_url: https://www.wikidata.org/wiki/Q114619385
locations:
- street_address: Heleny Kočvarové 1
postal_code: 140 00
city: Praha 4
region: Hlavní město Praha
country: CZ
latitude: 50.053757
longitude: 14.454272
processing_timestamp: '2025-12-06T23:37:26.110340+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-CDZUKCD
ghcid_original: CZ-HL-PRA-L-CDZÚKCD
ghcid_uuid: 7d9842ae-57b4-5afc-ac24-4e92b583320e
ghcid_uuid_sha256: b0479407-ba06-8eff-8765-3748b7af304a
ghcid_numeric: 12702284034928430847
record_id: 569a8cc3-8623-4865-b078-f5f358b8dac8
generation_timestamp: '2025-12-06T23:37:26.110340+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-CDZUKCD
valid_from: '2025-12-10T09:46:58Z'
valid_to:
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
- ghcid: CZ-HL-PRA-L-CDZUKCD
valid_from:
valid_to: '2025-12-10T09:46:58Z'
reason: Previous GHCID with incorrect region code
- ghcid: CZ-HL-PRA-L-CDZUKCD
ghcid_numeric: 12702284034928430847
valid_from: '2025-12-08T11:21:37.155204+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-CDZÚKCD)'
- ghcid: CZ-HL-PRA-L-CDZÚKCD
ghcid_numeric: 1611998794903873172
valid_from: '2025-12-06T23:37:26.110340+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:37.155204+00:00'
custodian_name:
claim_type: custodian_name
claim_value: Cesta domů, z.ú. - Knihovna Cesty domů
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-CDZUKCD
- identifier_scheme: GHCID_UUID
identifier_value: 7d9842ae-57b4-5afc-ac24-4e92b583320e
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: b0479407-ba06-8eff-8765-3748b7af304a
- identifier_scheme: GHCID_NUMERIC
identifier_value: '12702284034928430847'
- identifier_scheme: RECORD_ID
identifier_value: 569a8cc3-8623-4865-b078-f5f358b8dac8
- *id001
- *id002
- *id003
- *id004
- *id005
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:30.050046+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:12Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:48Z
schema_version: 2.0.0
enrichment_provenance:
wikidata_enrichment:
content_hash: sha256-uyxljZX/ehXA7EicsBnW5SpiqATAabtCWAOUuIFLxJo=
verified_at: '2025-12-28T19:58:23.516984+00:00'
wasDerivedFrom: https://www.wikidata.org/wiki/Q114619385
provenance_schema_version: '2.0'
standards_compliance:
- W3C PROV-O
- W3C SRI (content hashes)
generated_at: '2025-12-28T19:58:23.517020+00:00'
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.050046+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Cesta domů, z.ú. - Knihovna Cesty domů
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.050046+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.050046+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 4
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.050046+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: wikidata_id
claim_value: Q114619385
property_uri: owl:sameAs
provenance:
namespace: wikidata
path: /entity/Q114619385
timestamp: '2025-11-19T10:14:30.050046+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.98
- claim_type: ghcid
claim_value: CZ-ABE288
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.050046+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:26.110340+00:00'
creation_method: create_custodian_from_ch_annotator.py
wikidata_enrichment:
wikidata_entity_id: Q114619385
api_metadata:
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
fetch_timestamp: '2025-12-18T22:27:41.458852+00:00'
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
enrichment_version: 2.1_generic
properties_found:
- P31
- P17
- P159
- P968
- P463
wikidata_labels:
cs: Cesta domů, z.ú., Knihovna Cesty domů
en: Home Care Hospice Cesta domů and Advisory Centre, Library
wikidata_label_en: Home Care Hospice Cesta domů and Advisory Centre, Library
wikidata_label_cs: Cesta domů, z.ú., Knihovna Cesty domů
wikidata_descriptions:
cs: knihovna v Praze
wikidata_classification:
instance_of: &id006
- id: Q6150991
label: medical library
description: library that focuses on medical or health information
wikidata_instance_of: *id006
wikidata_location:
country: &id007
id: Q213
label: Czech Republic
description: country in Central Europe
headquarters_location:
id: Q2686587
label: Prague 4
description: municipal part of Prague
wikidata_country: *id007
wikidata_organization:
member_of:
- id: Q4809890
label: Association of Library and Information Professionals of the Czech Republic
description: organization
wikidata_contact:
email: mailto:knihovna@cestadomu.cz
_provenance:
content_hash:
algorithm: sha256
value: sha256-uyxljZX/ehXA7EicsBnW5SpiqATAabtCWAOUuIFLxJo=
scope: enrichment_section
computed_at: '2025-12-28T19:58:23.516984+00:00'
prov:
wasGeneratedBy:
'@type': prov:Activity
name: wikidata_api_fetch
used: https://www.wikidata.org/w/rest.php/wikibase/v1
wasDerivedFrom: https://www.wikidata.org/wiki/Q114619385
verification:
status: verified
last_verified: '2025-12-28T19:58:23.517010+00:00'
location:
latitude: 50.053757
longitude: 14.454272
coordinate_provenance:
source_type: ORIGINAL_ENTRY
source_path: original_entry.locations[0]
city: Praha 4
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 140 00
street_address: Heleny Kočvarové 1
normalization_timestamp: '2025-12-09T06:50:48.166008+00:00'

View file

@ -0,0 +1,266 @@
original_entry:
name: CERGE-EI knihovna Jana Kmenty
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABB005
- &id002
identifier_scheme: IČO
identifier_value: 00216208
- &id003
identifier_scheme: DIČ
identifier_value: CZ00216208
- &id004
identifier_scheme: Website
identifier_value: https://www.cerge-ei.cz/cs/knihovna
identifier_url: https://www.cerge-ei.cz/cs/knihovna
- &id005
identifier_scheme: Wikidata
identifier_value: Q114617537
identifier_url: https://www.wikidata.org/wiki/Q114617537
locations:
- street_address: Politických vězňů 7
postal_code: 111 21
city: Praha 1
region: Hlavní město Praha
country: CZ
latitude: 50.083233
longitude: 14.429039
processing_timestamp: '2025-12-06T23:35:06.269914+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-CEKJK
ghcid_original: CZ-HL-PRA-L-CEKJK
ghcid_uuid: 7c1d5854-eedc-5e41-830d-5c1a3f22bee1
ghcid_uuid_sha256: b3925072-f548-81b2-f37c-a560d63bf363
ghcid_numeric: 12939493134060507570
record_id: 2d687a48-630e-4b43-8179-1027f3fe1a8b
generation_timestamp: '2025-12-06T23:35:06.269914+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-CEKJK
valid_from: '2025-12-10T09:46:58Z'
valid_to:
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
- ghcid: CZ-HL-PRA-L-CEKJK
valid_from:
valid_to: '2025-12-10T09:46:58Z'
reason: Previous GHCID with incorrect region code
- ghcid: CZ-HL-PRA-L-CEKJK
ghcid_numeric: 12939493134060507570
valid_from: '2025-12-06T23:35:06.269914+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
custodian_name:
claim_type: custodian_name
claim_value: CERGE-EI knihovna Jana Kmenty
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-CEKJK
- identifier_scheme: GHCID_UUID
identifier_value: 7c1d5854-eedc-5e41-830d-5c1a3f22bee1
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: b3925072-f548-81b2-f37c-a560d63bf363
- identifier_scheme: GHCID_NUMERIC
identifier_value: '12939493134060507570'
- identifier_scheme: RECORD_ID
identifier_value: 2d687a48-630e-4b43-8179-1027f3fe1a8b
- *id001
- *id002
- *id003
- *id004
- *id005
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.796972+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:13Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:48Z
schema_version: 2.0.0
enrichment_provenance:
wikidata_enrichment:
content_hash: sha256-8mQ/wjiQEl+5oUWSFyMM+Cr/yQYAW/e5lI7abSoyAXU=
verified_at: '2025-12-28T19:58:23.537241+00:00'
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617537
provenance_schema_version: '2.0'
standards_compliance:
- W3C PROV-O
- W3C SRI (content hashes)
generated_at: '2025-12-28T19:58:23.537260+00:00'
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.796972+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: CERGE-EI knihovna Jana Kmenty
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.796972+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.796972+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 1
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.796972+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: wikidata_id
claim_value: Q114617537
property_uri: owl:sameAs
provenance:
namespace: wikidata
path: /entity/Q114617537
timestamp: '2025-11-19T10:14:29.796972+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.98
- claim_type: ghcid
claim_value: CZ-ABB005
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.796972+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:35:06.269914+00:00'
creation_method: create_custodian_from_ch_annotator.py
wikidata_enrichment:
wikidata_entity_id: Q114617537
api_metadata:
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
fetch_timestamp: '2025-12-18T22:27:49.559323+00:00'
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
enrichment_version: 2.1_generic
properties_found:
- P31
- P17
- P159
- P968
wikidata_labels:
cs: CERGE-EI knihovna Jana Kmenty
en: Jan Kmenta CERGE-EI Library
wikidata_label_en: Jan Kmenta CERGE-EI Library
wikidata_label_cs: CERGE-EI knihovna Jana Kmenty
wikidata_descriptions:
cs: knihovna v Praze
wikidata_classification:
instance_of: &id006
- id: Q1438040
label: research library
description: library that collects materials on one or more subjects to support scholarly or scientific research
wikidata_instance_of: *id006
wikidata_location:
country: &id007
id: Q213
label: Czech Republic
description: country in Central Europe
headquarters_location:
id: Q973974
label: Prague 1
description: administrative district, municipal district and municipal part of Prague
wikidata_country: *id007
wikidata_contact:
email: mailto:library@cerge-ei.cz
_provenance:
content_hash:
algorithm: sha256
value: sha256-8mQ/wjiQEl+5oUWSFyMM+Cr/yQYAW/e5lI7abSoyAXU=
scope: enrichment_section
computed_at: '2025-12-28T19:58:23.537241+00:00'
prov:
wasGeneratedBy:
'@type': prov:Activity
name: wikidata_api_fetch
used: https://www.wikidata.org/w/rest.php/wikibase/v1
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617537
verification:
status: verified
last_verified: '2025-12-28T19:58:23.537250+00:00'
location:
latitude: 50.083233
longitude: 14.429039
coordinate_provenance:
source_type: ORIGINAL_ENTRY
source_path: original_entry.locations[0]
city: Praha 1
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 111 21
street_address: Politických vězňů 7
normalization_timestamp: '2025-12-09T06:50:48.191069+00:00'
logo_enrichment:
enrichment_timestamp: '2025-12-23T22:00:01.805561+00:00'
source_url: https://www.cerge-ei.cz/cs/knihovna
extraction_method: crawl4ai
claims:
- claim_type: favicon_url
claim_value: https://www.cerge-ei.cz/favicon.ico
source_url: https://www.cerge-ei.cz/cs/knihovna
css_selector: '[document] > html > head > link:nth-of-type(3)'
retrieved_on: '2025-12-23T22:00:01.805561+00:00'
extraction_method: crawl4ai_link_rel
favicon_type: image/vnd.microsoft.icon
favicon_sizes: ''
summary:
total_claims: 1
has_primary_logo: false
has_favicon: true
has_og_image: false
favicon_count: 1

View file

@ -0,0 +1,165 @@
original_entry:
name: ČKD Elektrotechnika, a.s.
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABF023
locations:
- street_address: U Kolbenky 159
postal_code: 190 00
city: Praha 9
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:25.760334+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-CES
ghcid_original: CZ-HL-PRA-L-ČES
ghcid_uuid: 3008a3a2-1521-5246-a2cd-e7329405be87
ghcid_uuid_sha256: faa39735-7625-842a-8573-45cc88b567d4
ghcid_numeric: 18060445186557060138
record_id: 94c290d6-fca3-4ebd-b390-c83112c6721d
generation_timestamp: '2025-12-06T23:37:25.760334+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-CES
valid_from: "2025-12-10T09:46:58Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-CES
valid_from: null
valid_to: "2025-12-10T09:46:58Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-CES
ghcid_numeric: 18060445186557060138
valid_from: '2025-12-08T11:21:23.607373+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ČES)'
- ghcid: CZ-HL-PRA-L-ČES
ghcid_numeric: 1379789753405388328
valid_from: '2025-12-06T23:37:25.760334+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:23.607373+00:00'
custodian_name:
claim_type: custodian_name
claim_value: ČKD Elektrotechnika, a.s.
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-CES
- identifier_scheme: GHCID_UUID
identifier_value: 3008a3a2-1521-5246-a2cd-e7329405be87
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: faa39735-7625-842a-8573-45cc88b567d4
- identifier_scheme: GHCID_NUMERIC
identifier_value: '18060445186557060138'
- identifier_scheme: RECORD_ID
identifier_value: 94c290d6-fca3-4ebd-b390-c83112c6721d
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:30.047788+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:13Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:48Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.047788+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: ČKD Elektrotechnika, a.s.
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.047788+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.047788+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 9
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.047788+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABF023
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.047788+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:25.760334+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 190 00
street_address: U Kolbenky 159
normalization_timestamp: '2025-12-09T20:41:06.157711+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:06.157684+00:00'

View file

@ -0,0 +1,166 @@
custodian_name:
claim_type: custodian_name
claim_value: ČSAV - Entomologický ústav
source_type: ch_annotator
original_entry:
name: ČSAV - Entomologický ústav
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABB008
locations:
- street_address: Mánesova 55
postal_code: 120 00
city: Praha 2
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:25.516148+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-CEU
ghcid_original: CZ-HL-PRA-L-ČEÚ
ghcid_uuid: 504431ec-ac7b-5800-a430-bd4bd2d6cf07
ghcid_uuid_sha256: 0354bb33-d832-8ba5-a76e-4e3210db321c
ghcid_numeric: 240022511502416805
record_id: db702a6d-f686-4857-b3df-0462837fd430
generation_timestamp: '2025-12-06T23:37:25.516148+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-CEU
valid_from: '2025-12-10T09:46:58Z'
valid_to: null
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha))
per ISO 3166-2:CZ
- ghcid: CZ-HL-PRA-L-CEU
valid_from: null
valid_to: '2025-12-10T09:46:58Z'
reason: Previous GHCID with incorrect region code
- ghcid: CZ-HL-PRA-L-CEU
ghcid_numeric: 240022511502416805
valid_from: '2025-12-08T11:21:23.566900+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ČEÚ)'
- ghcid: CZ-HL-PRA-L-ČEÚ
ghcid_numeric: 10740319947005259027
valid_from: '2025-12-06T23:37:25.516148+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:23.566900+00:00'
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-CEU
- identifier_scheme: GHCID_UUID
identifier_value: 504431ec-ac7b-5800-a430-bd4bd2d6cf07
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 0354bb33-d832-8ba5-a76e-4e3210db321c
- identifier_scheme: GHCID_NUMERIC
identifier_value: '240022511502416805'
- identifier_scheme: RECORD_ID
identifier_value: db702a6d-f686-4857-b3df-0462837fd430
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:30.046042+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:13Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:48Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046042+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date: null
verified_by: null
entity_claims:
- claim_type: full_name
claim_value: ČSAV - Entomologický ústav
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046042+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046042+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 2
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046042+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABB008
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046042+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:25.516148+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 120 00
street_address: Mánesova 55
normalization_timestamp: '2025-12-09T20:41:06.197560+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:06.197537+00:00'

View file

@ -0,0 +1,163 @@
original_entry:
name: ČSAV - Ekonomický ústav - Ekonomicko-matematická laboratoř
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABB006
locations:
- postal_code: 110 00
city: Praha 1
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:25.500404+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-CEUEML
ghcid_original: CZ-HL-PRA-L-ČEÚEML
ghcid_uuid: b3ecd9fb-08cb-5b6d-9620-01ecd8ec398e
ghcid_uuid_sha256: 8e609de9-9345-8dd4-a723-695e6ce433dd
ghcid_numeric: 10259373577673739732
record_id: 11c7b490-8879-47e4-bdc8-4c3f778228c1
generation_timestamp: '2025-12-06T23:37:25.500404+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-CEUEML
valid_from: "2025-12-10T09:46:58Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-CEUEML
valid_from: null
valid_to: "2025-12-10T09:46:58Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-CEUEML
ghcid_numeric: 10259373577673739732
valid_from: '2025-12-08T11:21:41.451434+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ČEÚEML)'
- ghcid: CZ-HL-PRA-L-ČEÚEML
ghcid_numeric: 1512922320452027432
valid_from: '2025-12-06T23:37:25.500404+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:41.451434+00:00'
custodian_name:
claim_type: custodian_name
claim_value: ČSAV - Ekonomický ústav - Ekonomicko-matematická laboratoř
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-CEUEML
- identifier_scheme: GHCID_UUID
identifier_value: b3ecd9fb-08cb-5b6d-9620-01ecd8ec398e
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 8e609de9-9345-8dd4-a723-695e6ce433dd
- identifier_scheme: GHCID_NUMERIC
identifier_value: '10259373577673739732'
- identifier_scheme: RECORD_ID
identifier_value: 11c7b490-8879-47e4-bdc8-4c3f778228c1
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:30.046001+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:13Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:48Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046001+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: ČSAV - Ekonomický ústav - Ekonomicko-matematická laboratoř
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046001+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046001+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 1
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046001+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABB006
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046001+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:25.500404+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 110 00
normalization_timestamp: '2025-12-09T20:41:06.242627+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:06.242612+00:00'

View file

@ -0,0 +1,165 @@
original_entry:
name: Československý filmový ústav
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABC117
locations:
- street_address: Národní 40
postal_code: 110 00
city: Praha 1
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:25.611839+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-CFU
ghcid_original: CZ-HL-PRA-L-ČFÚ
ghcid_uuid: ecf730c1-9b99-5072-9d7d-db744bc75a6d
ghcid_uuid_sha256: 0fbc3c26-a026-88c1-beae-a8f2a427a7d7
ghcid_numeric: 1133847342783834305
record_id: d88ba827-90e8-4d79-8758-6f99098470a1
generation_timestamp: '2025-12-06T23:37:25.611839+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-CFU
valid_from: "2025-12-10T09:46:58Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-CFU
valid_from: null
valid_to: "2025-12-10T09:46:58Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-CFU
ghcid_numeric: 1133847342783834305
valid_from: '2025-12-08T11:21:31.115869+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ČFÚ)'
- ghcid: CZ-HL-PRA-L-ČFÚ
ghcid_numeric: 14053249839522649669
valid_from: '2025-12-06T23:37:25.611839+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:31.115869+00:00'
custodian_name:
claim_type: custodian_name
claim_value: Československý filmový ústav
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-CFU
- identifier_scheme: GHCID_UUID
identifier_value: ecf730c1-9b99-5072-9d7d-db744bc75a6d
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 0fbc3c26-a026-88c1-beae-a8f2a427a7d7
- identifier_scheme: GHCID_NUMERIC
identifier_value: '1133847342783834305'
- identifier_scheme: RECORD_ID
identifier_value: d88ba827-90e8-4d79-8758-6f99098470a1
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:30.046736+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:13Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:48Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046736+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Československý filmový ústav
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046736+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046736+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 1
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046736+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABC117
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046736+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:25.611839+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 110 00
street_address: Národní 40
normalization_timestamp: '2025-12-09T20:41:06.286052+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:06.286029+00:00'

View file

@ -0,0 +1,276 @@
original_entry:
name: CEFRES - Francouzský ústav pro výzkum ve společenských vědách
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABC163
- &id002
identifier_scheme: IČO
identifier_value: neuvedeno
- &id003
identifier_scheme: Website
identifier_value: https://cefres.kpsys.cz/#!/
identifier_url: https://cefres.kpsys.cz/#!/
- &id004
identifier_scheme: Wikidata
identifier_value: Q114617627
identifier_url: https://www.wikidata.org/wiki/Q114617627
locations:
- street_address: Na Florenci 3
postal_code: 110 00
city: Praha 1
region: Hlavní město Praha
country: CZ
latitude: 50.088931
longitude: 14.433407
processing_timestamp: '2025-12-06T23:37:14.629294+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-CFUPVVSV
ghcid_original: CZ-HL-PRA-L-CFÚPVVSV
ghcid_uuid: d4761e5f-b824-5577-9836-cccfeab220cc
ghcid_uuid_sha256: 215b886d-e14d-839c-88e9-1c5dc4a47909
ghcid_numeric: 2403664831645053852
record_id: 75be2349-64a5-400a-b210-15b115273345
generation_timestamp: '2025-12-06T23:37:14.629294+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-CFUPVVSV
valid_from: '2025-12-10T09:46:58Z'
valid_to:
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
- ghcid: CZ-HL-PRA-L-CFUPVVSV
valid_from:
valid_to: '2025-12-10T09:46:58Z'
reason: Previous GHCID with incorrect region code
- ghcid: CZ-HL-PRA-L-CFUPVVSV
ghcid_numeric: 2403664831645053852
valid_from: '2025-12-08T11:21:29.529071+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-CFÚPVVSV)'
- ghcid: CZ-HL-PRA-L-CFÚPVVSV
ghcid_numeric: 1416280962864530885
valid_from: '2025-12-06T23:37:14.629294+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:29.529071+00:00'
custodian_name:
claim_type: custodian_name
claim_value: CEFRES - Francouzský ústav pro výzkum ve společenských vědách
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-CFUPVVSV
- identifier_scheme: GHCID_UUID
identifier_value: d4761e5f-b824-5577-9836-cccfeab220cc
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 215b886d-e14d-839c-88e9-1c5dc4a47909
- identifier_scheme: GHCID_NUMERIC
identifier_value: '2403664831645053852'
- identifier_scheme: RECORD_ID
identifier_value: 75be2349-64a5-400a-b210-15b115273345
- *id001
- *id002
- *id003
- *id004
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.803999+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:13Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:48Z
schema_version: 2.0.0
enrichment_provenance:
wikidata_enrichment:
content_hash: sha256-6nroyhIMUTL56XzH1D+9ek2r604EZrx9EOMKi2yRxSo=
verified_at: '2025-12-28T19:58:23.597078+00:00'
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617627
provenance_schema_version: '2.0'
standards_compliance:
- W3C PROV-O
- W3C SRI (content hashes)
generated_at: '2025-12-28T19:58:23.597094+00:00'
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.803999+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: CEFRES - Francouzský ústav pro výzkum ve společenských vědách
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.803999+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.803999+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 1
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.803999+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: wikidata_id
claim_value: Q114617627
property_uri: owl:sameAs
provenance:
namespace: wikidata
path: /entity/Q114617627
timestamp: '2025-11-19T10:14:29.803999+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.98
- claim_type: ghcid
claim_value: CZ-ABC163
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.803999+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:14.629294+00:00'
creation_method: create_custodian_from_ch_annotator.py
wikidata_enrichment:
wikidata_entity_id: Q114617627
api_metadata:
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
fetch_timestamp: '2025-12-18T22:27:57.590410+00:00'
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
enrichment_version: 2.1_generic
properties_found:
- P31
- P17
- P159
- P968
wikidata_labels:
cs: CEFRES, Francouzský ústav pro výzkum ve společenských vědách, knihovna
fr: bibliothèque du CEFRES
mul: Bibliothèque du CEFRES
wikidata_label_fr: bibliothèque du CEFRES
wikidata_label_cs: CEFRES, Francouzský ústav pro výzkum ve společenských vědách, knihovna
wikidata_descriptions:
cs: knihovna v Praze
fr: bibliothèque scientifique à Prague
en: library in Prague
wikidata_description_en: library in Prague
wikidata_aliases:
fr:
- bibliothèque du Centre français de recherche en sciences sociales
mul:
- Bibliothèque du Centre français de recherche en sciences sociales
wikidata_classification:
instance_of: &id005
- id: Q1438040
label: research library
description: library that collects materials on one or more subjects to support scholarly or scientific research
wikidata_instance_of: *id005
wikidata_location:
country: &id006
id: Q213
label: Czech Republic
description: country in Central Europe
headquarters_location:
id: Q973974
label: Prague 1
description: administrative district, municipal district and municipal part of Prague
wikidata_country: *id006
wikidata_contact:
email: mailto:claire@cefres.cz
_provenance:
content_hash:
algorithm: sha256
value: sha256-6nroyhIMUTL56XzH1D+9ek2r604EZrx9EOMKi2yRxSo=
scope: enrichment_section
computed_at: '2025-12-28T19:58:23.597078+00:00'
prov:
wasGeneratedBy:
'@type': prov:Activity
name: wikidata_api_fetch
used: https://www.wikidata.org/w/rest.php/wikibase/v1
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617627
verification:
status: verified
last_verified: '2025-12-28T19:58:23.597085+00:00'
location:
latitude: 50.088931
longitude: 14.433407
coordinate_provenance:
source_type: ORIGINAL_ENTRY
source_path: original_entry.locations[0]
city: Praha 1
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 110 00
street_address: Na Florenci 3
normalization_timestamp: '2025-12-09T06:50:48.297317+00:00'
logo_enrichment:
enrichment_timestamp: '2025-12-23T22:00:16.423383+00:00'
source_url: https://cefres.kpsys.cz/#!
extraction_method: crawl4ai
claims:
- claim_type: favicon_url
claim_value: https://cefres.kpsys.cz/favicon.png?v=2.3.0-32021
source_url: https://cefres.kpsys.cz/#!
css_selector: '#ng-app > head > link:nth-of-type(9)'
retrieved_on: '2025-12-23T22:00:16.423383+00:00'
extraction_method: crawl4ai_link_rel
favicon_type: image/png
favicon_sizes: 256x256
summary:
total_claims: 1
has_primary_logo: false
has_favicon: true
has_og_image: false
favicon_count: 1

View file

@ -0,0 +1,267 @@
original_entry:
name: Česká geologická služba - Knihovna
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABC023
- &id002
identifier_scheme: IČO
identifier_value: 00025798
- &id003
identifier_scheme: Website
identifier_value: https://cgs.gov.cz/knihovna-archivy-sbirky/knihovna
identifier_url: https://cgs.gov.cz/knihovna-archivy-sbirky/knihovna
- &id004
identifier_scheme: Wikidata
identifier_value: Q114617601
identifier_url: https://www.wikidata.org/wiki/Q114617601
locations:
- street_address: Klárov 3/131
postal_code: 118 21
city: Praha 1
region: Hlavní město Praha
country: CZ
latitude: 50.091722
longitude: 14.409786
processing_timestamp: '2025-12-06T23:37:14.421598+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-CGSK
ghcid_original: CZ-HL-PRA-L-ČGSK
ghcid_uuid: 48ef0c1d-6c9e-51e7-94e0-c2d1d898ea88
ghcid_uuid_sha256: d8231fa2-9b99-8b7d-aa55-30380a785fe6
ghcid_numeric: 15574326719633001341
record_id: bd17fa8c-fd47-46f5-98be-66c76dedb878
generation_timestamp: '2025-12-06T23:37:14.421598+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-CGSK
valid_from: '2025-12-10T09:46:58Z'
valid_to:
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
- ghcid: CZ-HL-PRA-L-CGSK
valid_from:
valid_to: '2025-12-10T09:46:58Z'
reason: Previous GHCID with incorrect region code
- ghcid: CZ-HL-PRA-L-CGSK
ghcid_numeric: 15574326719633001341
valid_from: '2025-12-08T11:21:30.084883+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ČGSK)'
- ghcid: CZ-HL-PRA-L-ČGSK
ghcid_numeric: 10769415008464129773
valid_from: '2025-12-06T23:37:14.421598+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:30.084883+00:00'
custodian_name:
claim_type: custodian_name
claim_value: Česká geologická služba - Knihovna
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-CGSK
- identifier_scheme: GHCID_UUID
identifier_value: 48ef0c1d-6c9e-51e7-94e0-c2d1d898ea88
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: d8231fa2-9b99-8b7d-aa55-30380a785fe6
- identifier_scheme: GHCID_NUMERIC
identifier_value: '15574326719633001341'
- identifier_scheme: RECORD_ID
identifier_value: bd17fa8c-fd47-46f5-98be-66c76dedb878
- *id001
- *id002
- *id003
- *id004
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.801378+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:13Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:48Z
schema_version: 2.0.0
enrichment_provenance:
wikidata_enrichment:
content_hash: sha256-oj3JlTi5DQWD5zLxOMay+soc3UFUP0XsdO0Fu8qh3Eg=
verified_at: '2025-12-28T19:58:23.618005+00:00'
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617601
provenance_schema_version: '2.0'
standards_compliance:
- W3C PROV-O
- W3C SRI (content hashes)
generated_at: '2025-12-28T19:58:23.618025+00:00'
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.801378+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Česká geologická služba - Knihovna
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.801378+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.801378+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 1
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.801378+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: wikidata_id
claim_value: Q114617601
property_uri: owl:sameAs
provenance:
namespace: wikidata
path: /entity/Q114617601
timestamp: '2025-11-19T10:14:29.801378+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.98
- claim_type: ghcid
claim_value: CZ-ABC023
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.801378+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:14.421598+00:00'
creation_method: create_custodian_from_ch_annotator.py
wikidata_enrichment:
wikidata_entity_id: Q114617601
api_metadata:
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
fetch_timestamp: '2025-12-18T22:28:05.533308+00:00'
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
enrichment_version: 2.1_generic
properties_found:
- P31
- P17
- P159
- P968
wikidata_labels:
cs: Česká geologická služba, knihovna
en: Czech Geological Survey
wikidata_label_en: Czech Geological Survey
wikidata_label_cs: Česká geologická služba, knihovna
wikidata_descriptions:
cs: knihovna v Praze
wikidata_classification:
instance_of: &id005
- id: Q1438040
label: research library
description: library that collects materials on one or more subjects to support scholarly or scientific research
wikidata_instance_of: *id005
wikidata_location:
country: &id006
id: Q213
label: Czech Republic
description: country in Central Europe
headquarters_location:
id: Q973974
label: Prague 1
description: administrative district, municipal district and municipal part of Prague
wikidata_country: *id006
wikidata_contact:
email: mailto:knihovna@geology.cz
_provenance:
content_hash:
algorithm: sha256
value: sha256-oj3JlTi5DQWD5zLxOMay+soc3UFUP0XsdO0Fu8qh3Eg=
scope: enrichment_section
computed_at: '2025-12-28T19:58:23.618005+00:00'
prov:
wasGeneratedBy:
'@type': prov:Activity
name: wikidata_api_fetch
used: https://www.wikidata.org/w/rest.php/wikibase/v1
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617601
verification:
status: verified
last_verified: '2025-12-28T19:58:23.618015+00:00'
location:
latitude: 50.091722
longitude: 14.409786
coordinate_provenance:
source_type: ORIGINAL_ENTRY
source_path: original_entry.locations[0]
city: Praha 1
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 118 21
street_address: Klárov 3/131
normalization_timestamp: '2025-12-09T06:50:48.318122+00:00'
logo_enrichment:
enrichment_timestamp: '2025-12-23T22:00:25.647871+00:00'
source_url: https://cgs.gov.cz/knihovna-archivy-sbirky/knihovna
extraction_method: crawl4ai
claims:
- claim_type: favicon_url
claim_value: https://cgs.gov.cz/themes/star_cgs/assets/img/favicon/safari-pinned-tab.svg
source_url: https://cgs.gov.cz/knihovna-archivy-sbirky/knihovna
css_selector: '[document] > html.js > head > link:nth-of-type(9)'
retrieved_on: '2025-12-23T22:00:25.647871+00:00'
extraction_method: crawl4ai_link_rel
favicon_type: ''
favicon_sizes: ''
summary:
total_claims: 1
has_primary_logo: false
has_favicon: true
has_og_image: false
favicon_count: 4

View file

@ -0,0 +1,152 @@
original_entry:
name: ČSAV - Geografický ústav - Mapová sbírka
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABB014
locations: []
processing_timestamp: '2025-12-06T23:37:25.519173+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-CGUMS
ghcid_original: XX-XX-XXX-L-ČGÚMS
ghcid_uuid: 59693ad2-a909-598f-aeb0-4676e4c92ac9
ghcid_uuid_sha256: b3d06caa-9df4-8f35-87e4-f49890fe973e
ghcid_numeric: 12956975607995236149
record_id: 6b124526-af21-4124-8ca8-0ca145b8a250
generation_timestamp: '2025-12-06T23:37:25.519173+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: NAME_PATTERN_EXTRACTION
resolution_timestamp: '2025-12-06T23:56:13.377356+00:00'
resolution_notes: City 'ČSAV' extracted from institution name
ghcid_history:
- ghcid: CZ-10-PRA-L-CGUMS
ghcid_numeric: 12956975607995236149
valid_from: '2025-12-08T11:21:21.983376+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-10-PRA-L-ČGÚMS)'
- ghcid: XX-XX-XXX-L-ČGÚMS
ghcid_numeric: 12855336241998218597
valid_from: '2025-12-06T23:37:25.519173+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:21.983376+00:00'
- ghcid: CZ-XX-XXX-L-ČGÚMS
valid_from: '2025-12-06T23:56:13.377367+00:00'
reason: 'Country resolved via Wikidata P17: XX→CZ'
- ghcid: CZ-10-PRA-L-ČGÚMS
valid_from: '2025-12-07T12:38:22.578582+00:00'
reason: 'Location resolved from institution name pattern: ''ČSAV'' → region 10, city PRA'
custodian_name:
claim_type: custodian_name
claim_value: ČSAV - Geografický ústav - Mapová sbírka
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-CGUMS
- identifier_scheme: GHCID_UUID
identifier_value: 59693ad2-a909-598f-aeb0-4676e4c92ac9
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: b3d06caa-9df4-8f35-87e4-f49890fe973e
- identifier_scheme: GHCID_NUMERIC
identifier_value: '12956975607995236149'
- identifier_scheme: RECORD_ID
identifier_value: 6b124526-af21-4124-8ca8-0ca145b8a250
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:30.046059+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- 'Country resolved 2025-12-06T23:56:13Z: XX→CZ via Wikidata P17'
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:19Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:48:05Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:49:29Z
- 'YouTube/Google Maps enrichment 2025-12-09T09:28:27Z: YouTube: not found'
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046059+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: ČSAV - Geografický ústav - Mapová sbírka
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046059+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046059+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: ghcid
claim_value: CZ-ABB014
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046059+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:25.519173+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
region_code: 10
country: CZ
normalization_timestamp: '2025-12-09T20:40:58.958925+00:00'
latitude: 50.08804
longitude: 14.42076
city: Prague
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:40:58.958906+00:00'
youtube_status: NOT_FOUND
youtube_search_query: ČSAV - Geografický ústav - Mapová sbírka official
youtube_search_timestamp: '2025-12-09T09:28:27.414759+00:00'

View file

@ -0,0 +1,284 @@
original_entry:
name: Český helsinský výbor - Dokumentační a informační středisko - Knihovna
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABE176
- &id002
identifier_scheme: IČO
identifier_value: 00539708
- &id003
identifier_scheme: Website
identifier_value: https://helcom.cz/
identifier_url: https://helcom.cz/
- &id004
identifier_scheme: Wikidata
identifier_value: Q114617893
identifier_url: https://www.wikidata.org/wiki/Q114617893
locations:
- street_address: Štefánikova 216 (budova Úřadu práce MČ P-5)
postal_code: 150 00
city: Praha 5
region: Hlavní město Praha
country: CZ
latitude: 50.074003
longitude: 14.403544
processing_timestamp: '2025-12-06T23:37:16.685316+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-CHVDISK
ghcid_original: CZ-HL-PRA-L-ČHVDISK
ghcid_uuid: 43a6b403-88de-5b82-bd5a-e4f57a21499f
ghcid_uuid_sha256: ad1d63b6-02f3-85bd-a40f-7937f5633624
ghcid_numeric: 12474236176270820797
record_id: 7b2f3e1f-bcaa-48bf-bbf3-4581fc90812f
generation_timestamp: '2025-12-06T23:37:16.685316+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-CHVDISK
valid_from: '2025-12-10T09:46:58Z'
valid_to:
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ
- ghcid: CZ-HL-PRA-L-CHVDISK
valid_from:
valid_to: '2025-12-10T09:46:58Z'
reason: Previous GHCID with incorrect region code
- ghcid: CZ-HL-PRA-L-CHVDISK
ghcid_numeric: 12474236176270820797
valid_from: '2025-12-08T11:21:29.433582+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ČHVDISK)'
- ghcid: CZ-HL-PRA-L-ČHVDISK
ghcid_numeric: 16044111670572416104
valid_from: '2025-12-06T23:37:16.685316+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:29.433582+00:00'
custodian_name:
claim_type: custodian_name
claim_value: Český helsinský výbor - Dokumentační a informační středisko - Knihovna
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-CHVDISK
- identifier_scheme: GHCID_UUID
identifier_value: 43a6b403-88de-5b82-bd5a-e4f57a21499f
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: ad1d63b6-02f3-85bd-a40f-7937f5633624
- identifier_scheme: GHCID_NUMERIC
identifier_value: '12474236176270820797'
- identifier_scheme: RECORD_ID
identifier_value: 7b2f3e1f-bcaa-48bf-bbf3-4581fc90812f
- *id001
- *id002
- *id003
- *id004
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.924463+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:13Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:48Z
schema_version: 2.0.0
enrichment_provenance:
wikidata_enrichment:
content_hash: sha256-Y14TjAbdy6m5GqNpEVxY2uzIEqVxK6JjuvP/gG/ogFY=
verified_at: '2025-12-28T19:58:23.649479+00:00'
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617893
provenance_schema_version: '2.0'
standards_compliance:
- W3C PROV-O
- W3C SRI (content hashes)
generated_at: '2025-12-28T19:58:23.649498+00:00'
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.924463+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Český helsinský výbor - Dokumentační a informační středisko - Knihovna
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.924463+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.924463+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 5
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.924463+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: wikidata_id
claim_value: Q114617893
property_uri: owl:sameAs
provenance:
namespace: wikidata
path: /entity/Q114617893
timestamp: '2025-11-19T10:14:29.924463+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.98
- claim_type: ghcid
claim_value: CZ-ABE176
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.924463+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:16.685316+00:00'
creation_method: create_custodian_from_ch_annotator.py
wikidata_enrichment:
wikidata_entity_id: Q114617893
api_metadata:
api_endpoint: https://www.wikidata.org/w/rest.php/wikibase/v1
fetch_timestamp: '2025-12-18T22:28:13.493073+00:00'
user_agent: GLAMDataExtractor/1.2 (glam-data@example.com) Python/httpx
enrichment_version: 2.1_generic
properties_found:
- P31
- P17
- P159
- P968
wikidata_labels:
cs: Český helsinský výbor, Dokumentační a informační středisko, knihovna
en: Czech Helsinki Committee, Documentation and Information Centre, Library
wikidata_label_en: Czech Helsinki Committee, Documentation and Information Centre, Library
wikidata_label_cs: Český helsinský výbor, Dokumentační a informační středisko, knihovna
wikidata_descriptions:
cs: knihovna v Praze
wikidata_aliases:
cs:
- DIS ČHV
wikidata_classification:
instance_of: &id005
- id: Q385994
label: special library
description: specialized library providing resources on a particular topic or discipline
wikidata_instance_of: *id005
wikidata_location:
country: &id006
id: Q213
label: Czech Republic
description: country in Central Europe
headquarters_location:
id: Q577602
label: Prague 5
description: municipal part of Prague
wikidata_country: *id006
wikidata_contact:
email: mailto:info@helcom.cz
_provenance:
content_hash:
algorithm: sha256
value: sha256-Y14TjAbdy6m5GqNpEVxY2uzIEqVxK6JjuvP/gG/ogFY=
scope: enrichment_section
computed_at: '2025-12-28T19:58:23.649479+00:00'
prov:
wasGeneratedBy:
'@type': prov:Activity
name: wikidata_api_fetch
used: https://www.wikidata.org/w/rest.php/wikibase/v1
wasDerivedFrom: https://www.wikidata.org/wiki/Q114617893
verification:
status: verified
last_verified: '2025-12-28T19:58:23.649489+00:00'
location:
latitude: 50.074003
longitude: 14.403544
coordinate_provenance:
source_type: ORIGINAL_ENTRY
source_path: original_entry.locations[0]
city: Praha 5
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 150 00
street_address: Štefánikova 216 (budova Úřadu práce MČ P-5)
normalization_timestamp: '2025-12-09T06:50:48.344961+00:00'
logo_enrichment:
enrichment_timestamp: '2025-12-23T22:00:34.684584+00:00'
source_url: https://helcom.cz
extraction_method: crawl4ai
claims:
- claim_type: logo_url
claim_value: https://helcom.cz/wp-content/uploads/2022/12/Logo_CHV_motto_cz.svg
source_url: https://helcom.cz
css_selector: '#et-boc > header.et-l.et-l--header > div.et_builder_inner_content.et_pb_gutters3 > div.et_pb_section.et_pb_section_0_tb_header > div.et_pb_with_border.et_pb_row > div.et_pb_column.et_pb_column_1_3 > div.et_pb_module.et_pb_image > a > span.et_pb_image_wrap > img.wp-image-111102'
retrieved_on: '2025-12-23T22:00:34.684584+00:00'
extraction_method: crawl4ai_header_logo
detection_confidence: high
alt_text: flexile-white-logo
- claim_type: favicon_url
claim_value: https://helcom.cz/wp-content/uploads/2022/09/favicon-1.png
source_url: https://helcom.cz
css_selector: '[document] > html.js > head > link:nth-of-type(19)'
retrieved_on: '2025-12-23T22:00:34.684584+00:00'
extraction_method: crawl4ai_link_rel
favicon_type: ''
favicon_sizes: 32x32
- claim_type: og_image_url
claim_value: https://helcom.cz/wp-content/uploads/2022/08/Logo_CHV_cz.jpg
source_url: https://helcom.cz
css_selector: '[document] > html.js > head > meta:nth-of-type(12)'
retrieved_on: '2025-12-23T22:00:34.684584+00:00'
extraction_method: crawl4ai_meta_og
summary:
total_claims: 3
has_primary_logo: true
has_favicon: true
has_og_image: true
favicon_count: 1

View file

@ -0,0 +1,166 @@
custodian_name:
claim_type: custodian_name
claim_value: ČSAV - Kabinet cizích jazyků
source_type: ch_annotator
original_entry:
name: ČSAV - Kabinet cizích jazyků
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABB021
locations:
- street_address: Valentinská 1
postal_code: 110 00
city: Praha 1
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:25.524014+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-CKCJ
ghcid_original: CZ-HL-PRA-L-ČKCJ
ghcid_uuid: 78f22202-db45-5112-8927-dbbfc3eca42c
ghcid_uuid_sha256: 4878a8ef-7251-87c1-980e-2a787b8ca612
ghcid_numeric: 5222109514304681921
record_id: 9bc9afc6-3356-4113-a8ae-2e4aa26e8363
generation_timestamp: '2025-12-06T23:37:25.524014+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-CKCJ
valid_from: '2025-12-10T09:46:58Z'
valid_to: null
reason: Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha))
per ISO 3166-2:CZ
- ghcid: CZ-HL-PRA-L-CKCJ
valid_from: null
valid_to: '2025-12-10T09:46:58Z'
reason: Previous GHCID with incorrect region code
- ghcid: CZ-HL-PRA-L-CKCJ
ghcid_numeric: 5222109514304681921
valid_from: '2025-12-08T11:21:23.154226+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ČKCJ)'
- ghcid: CZ-HL-PRA-L-ČKCJ
ghcid_numeric: 5098596963913306570
valid_from: '2025-12-06T23:37:25.524014+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:23.154226+00:00'
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-CKCJ
- identifier_scheme: GHCID_UUID
identifier_value: 78f22202-db45-5112-8927-dbbfc3eca42c
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 4878a8ef-7251-87c1-980e-2a787b8ca612
- identifier_scheme: GHCID_NUMERIC
identifier_value: '5222109514304681921'
- identifier_scheme: RECORD_ID
identifier_value: 9bc9afc6-3356-4113-a8ae-2e4aa26e8363
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:30.046139+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:13Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:48Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046139+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date: null
verified_by: null
entity_claims:
- claim_type: full_name
claim_value: ČSAV - Kabinet cizích jazyků
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046139+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046139+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 1
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046139+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABB021
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:30.046139+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:25.524014+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 110 00
street_address: Valentinská 1
normalization_timestamp: '2025-12-09T20:41:06.322739+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:06.322714+00:00'

View file

@ -0,0 +1,165 @@
original_entry:
name: Československý kamenoprůmysl Praha
institution_type: LIBRARY
source: CH-Annotator (czech_unified_ch_annotator.yaml)
identifiers:
- &id001
identifier_scheme: Sigla
identifier_value: ABF020
locations:
- street_address: Liebknechtova 61
postal_code: 150 00
city: Praha 5
region: Hlavní město Praha
country: CZ
processing_timestamp: '2025-12-06T23:37:15.799986+00:00'
ghcid:
ghcid_current: CZ-10-PRA-L-CKP
ghcid_original: CZ-HL-PRA-L-ČKP
ghcid_uuid: 00ac345f-83fb-5055-9659-92cffb54a22c
ghcid_uuid_sha256: 3ca6be9a-da46-86de-91ae-070f72f5bc49
ghcid_numeric: 4370390060705949406
record_id: 5d12a8e8-7773-4093-99b0-3c6de6d30aa8
generation_timestamp: '2025-12-06T23:37:15.799986+00:00'
location_resolution:
country_code: CZ
region_code: 10
city_code: PRA
method: CH_ANNOTATOR_SOURCE
ghcid_history:
- ghcid: CZ-10-PRA-L-CKP
valid_from: "2025-12-10T09:46:58Z"
valid_to: null
reason: "Corrected region code from CZ-HL to CZ-10 (Prague (Hlavní město Praha)) per ISO 3166-2:CZ"
- ghcid: CZ-HL-PRA-L-CKP
valid_from: null
valid_to: "2025-12-10T09:46:58Z"
reason: "Previous GHCID with incorrect region code"
- ghcid: CZ-HL-PRA-L-CKP
ghcid_numeric: 4370390060705949406
valid_from: '2025-12-08T11:21:23.110362+00:00'
reason: 'Normalized diacritics to ASCII per ABBREV-DIACRITICS rule (was: CZ-HL-PRA-L-ČKP)'
- ghcid: CZ-HL-PRA-L-ČKP
ghcid_numeric: 12709429357571624434
valid_from: '2025-12-06T23:37:15.799986+00:00'
reason: Initial GHCID from CH-Annotator (czech_unified_ch_annotator.yaml)
valid_to: '2025-12-08T11:21:23.110362+00:00'
custodian_name:
claim_type: custodian_name
claim_value: Československý kamenoprůmysl Praha
source_type: ch_annotator
identifiers:
- identifier_scheme: GHCID
identifier_value: CZ-10-PRA-L-CKP
- identifier_scheme: GHCID_UUID
identifier_value: 00ac345f-83fb-5055-9659-92cffb54a22c
- identifier_scheme: GHCID_UUID_SHA256
identifier_value: 3ca6be9a-da46-86de-91ae-070f72f5bc49
- identifier_scheme: GHCID_NUMERIC
identifier_value: '4370390060705949406'
- identifier_scheme: RECORD_ID
identifier_value: 5d12a8e8-7773-4093-99b0-3c6de6d30aa8
- *id001
provenance:
data_source: API_SCRAPING
data_tier: TIER_1_AUTHORITATIVE
extraction_date: '2025-11-19T10:14:29.914267+00:00'
extraction_method: 'Created from CH-Annotator file: czech_unified_ch_annotator.yaml'
confidence_score: 0.95
notes:
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:11:47Z
- Canonical location added via normalize_custodian_files.py on 2025-12-08T23:49:13Z
- Canonical location added via normalize_custodian_files.py on 2025-12-09T06:50:48Z
ch_annotator:
convention_id: ch_annotator-v1_7_0
convention_version: 1.7.0
entity_classification:
hypernym: GRP
hypernym_label: GROUP
subtype: GRP.HER.LIB
subtype_label: LIBRARY
ontology_class: schema:Library
alternative_classes:
- org:FormalOrganization
- rov:RegisteredOrganization
- glam:HeritageCustodian
extraction_provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.914267+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
annotation_provenance:
annotation_agent: opencode-claude-sonnet-4
annotation_date: '2025-12-06T21:12:32.389491+00:00'
annotation_method: retroactive CH-Annotator application via batch script
source_file: czech_unified.yaml
annotation_metadata:
confidence_score: 0.95
verified: false
verification_date:
verified_by:
entity_claims:
- claim_type: full_name
claim_value: Československý kamenoprůmysl Praha
property_uri: skos:prefLabel
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.914267+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: institution_type
claim_value: LIBRARY
property_uri: rdf:type
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.914267+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.95
- claim_type: located_in_city
claim_value: Praha 5
property_uri: schema:addressLocality
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.914267+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 0.9
- claim_type: ghcid
claim_value: CZ-ABF020
property_uri: glam:ghcid
provenance:
namespace: glam
path: /files/czech_unified.yaml
timestamp: '2025-11-19T10:14:29.914267+00:00'
agent: claude-conversation
context_convention: ch_annotator-v1_7_0
confidence: 1.0
integration_note:
created_from: czech_unified_ch_annotator.yaml
creation_date: '2025-12-06T23:37:15.799986+00:00'
creation_method: create_custodian_from_ch_annotator.py
location:
city: Prague
region: Hlavní město Praha
region_code: 10
country: CZ
postal_code: 150 00
street_address: Liebknechtova 61
normalization_timestamp: '2025-12-09T20:41:06.370465+00:00'
latitude: 50.08804
longitude: 14.42076
geonames_id: 3067696
geonames_name: Prague
feature_code: PPLC
coordinate_provenance:
source_type: GEONAMES_GHCID_CITY_CODE
source_path: data/reference/geonames.db
entity_id: 3067696
city_code: PRA
original_timestamp: '2025-12-09T20:41:06.370441+00:00'

Some files were not shown because too many files have changed in this diff Show more