glam/data/isil/czech_republic/czech_isil_analysis.md
2025-11-19 23:25:22 +01:00

4.3 KiB

Czech Republic ISIL Database Analysis

Source Information

  • Authority: National Library of the Czech Republic (Národní knihovna České republiky)
  • Database: ADR (Adresář knihoven a informačních institucí v ČR)
  • Download URL: https://aleph.nkp.cz/data/adr.xml.gz
  • License: CC0 (Public Domain)
  • Format: MARC21 XML
  • File Size: 27 MB (decompressed), 1.9 MB (compressed)
  • Last Updated: Generated weekly (each Monday)
  • Download Date: November 19, 2025

Database Statistics

  • Total Records: 8,145 institutions
  • Coverage: All types of libraries and information centers in Czech Republic
    • National libraries (NK)
    • Academic libraries
    • Public libraries
    • Special libraries
    • Archives
    • Museums with library collections

Data Structure (MARC21 Custom Fields)

Key Fields:

  • SGL: Sigla (Library Code) - equivalent to ISIL identifier
  • NAZ: Name of institution (3 levels: main/department/division)
  • VAR: Name variants (Czech and foreign language)
  • TYP: Institution type
  • ADR: Address (street, postal code, city, GPS coordinates)
  • MES: City/municipality
  • KRJ: Region and district
  • ICO: Company registration number (IČO and DIČ)
  • TEL: Telephone
  • EML: Email
  • URL: Website URLs (main site, catalogs, databases)
  • JMN: Contact persons with titles and roles
  • OTD: Opening hours (by day of week)
  • FND: Collection size (books, periodicals, other materials)
  • KNS: Library system software used

Institution Types (TYP field):

According to documentation, includes various codes for:

  • NK: National library
  • VK: Academic library
  • SVK: Regional library
  • MK: Municipal library
  • OPVK: Specialized public library
  • And many more types

Sample Records

Record 1: National Library of Czech Republic

  • Sigla: ABA000
  • Name: Národní knihovna České republiky
  • English Name: National Library of the Czech Republic
  • Type: NK (National library)
  • Founded: 1602
  • Address: Mariánské náměstí 190/5, 110 00 Praha 1
  • GPS: 50°5'11.12"N, 14°24'56.61"E
  • Phone: 221 663 111
  • Website: https://www.nkp.cz
  • Director: Mgr. Tomáš Foltýn (generální ředitel)
  • Collections: 6,919,075 volumes, 10,449 periodical titles (2015)
  • System: ALEPH

Data Quality Assessment

Strengths:

Comprehensive coverage of Czech institutions
Rich metadata (GPS coordinates, opening hours, collection statistics)
Multiple contact methods (phone, email, web)
Hierarchical institution structure (departments, branches)
Available as open data (CC0 license)
Regular updates (weekly)
Well-documented structure

Limitations:

⚠️ Custom MARC21 format (not standard MARC21 bibliographic)
⚠️ Documentation primarily in Czech
⚠️ Library codes (siglas) are not standard ISIL format (CZ-XXXXX)
⚠️ Some records may lack ISIL codes entirely

Conversion Requirements

To integrate with GLAM project:

  1. Parse MARC21 XML structure
  2. Map SGL (sigla) codes to ISIL format (CZ-[sigla])
  3. Extract institution types and map to GLAMORCUBESFIXPHDNT taxonomy
  4. Geocode addresses (GPS already provided!)
  5. Handle Czech-specific metadata (IČO, regional structure)
  6. Create LinkML-compliant records with provenance

ISIL Code Investigation

The database uses "siglas" (library codes) like:

  • ABA000 - National Library
  • ABA001 - National Library - Library Services Division
  • etc.

These are NOT standard ISIL codes (format: CZ-XXXXX). Need to:

  1. Check if there's a mapping between siglas and official ISIL codes
  2. Contact NK ČR for clarification on ISIL code assignment
  3. Potentially cross-reference with ISO 15511 registry

Next Steps

  1. Download complete database (DONE)
  2. Parse MARC21 XML to extract all institutions
  3. Map institution types to GLAM taxonomy
  4. Geocode addresses (GPS coordinates already available)
  5. Generate LinkML-compliant YAML instances
  6. Cross-reference with official ISIL registry
  7. Validate data quality and completeness

Additional Resources