310 lines
8 KiB
Markdown
310 lines
8 KiB
Markdown
# German ISIL Database - Complete Harvest Summary
|
|
|
|
## Executive Summary
|
|
|
|
✅ **SUCCESSFULLY HARVESTED** the complete German ISIL database
|
|
📊 **16,979 institutions** with ISIL identifiers
|
|
🇩🇪 **Comprehensive coverage** of German libraries, archives, and museums
|
|
📅 **Harvest Date**: November 19, 2025
|
|
⚖️ **License**: CC0 1.0 Universal (Public Domain)
|
|
|
|
---
|
|
|
|
## What is ISIL?
|
|
|
|
**ISIL** = International Standard Identifier for Libraries and Related Organizations (ISO 15511)
|
|
|
|
- **Purpose**: Unique identifiers for heritage institutions worldwide
|
|
- **Format**: Country code + local identifier (e.g., `DE-1` for Staatsbibliothek zu Berlin)
|
|
- **Scope**: Libraries, archives, museums, documentation centers
|
|
|
|
---
|
|
|
|
## Data Source
|
|
|
|
**Provider**: Staatsbibliothek zu Berlin (German ISIL Agency)
|
|
**Authority**: Official national ISIL registry for Germany
|
|
**API**: SRU 1.1 protocol via Deutsche Nationalbibliothek
|
|
**Endpoint**: https://services.dnb.de/sru/bib
|
|
|
|
---
|
|
|
|
## What's Included
|
|
|
|
### Institution Types
|
|
- **Libraries** (Bibliotheken)
|
|
- National/state libraries
|
|
- University libraries
|
|
- Public libraries
|
|
- Special/research libraries
|
|
|
|
- **Archives** (Archive)
|
|
- Federal/state archives
|
|
- City/municipal archives
|
|
- Corporate archives
|
|
|
|
- **Museums** (Museen)
|
|
- Art museums
|
|
- History museums
|
|
- Science/technology museums
|
|
|
|
- **Other Heritage Organizations**
|
|
- Documentation centers
|
|
- Research institutes
|
|
- Memory institutions
|
|
|
|
### Data Fields (per record)
|
|
- ✅ ISIL identifier (100%)
|
|
- ✅ Official name (100%)
|
|
- ✅ Alternative names (many records)
|
|
- ✅ Street address (87%)
|
|
- ✅ City, postal code, country (100%)
|
|
- ✅ Geographic coordinates (87%)
|
|
- ✅ Phone number (79%)
|
|
- ✅ Website URL (79%)
|
|
- ✅ Email address (38%)
|
|
- ✅ Interloan region code (43%)
|
|
- ✅ Parent organization (some records)
|
|
- ✅ Collection notes (some records)
|
|
|
|
---
|
|
|
|
## Files Delivered
|
|
|
|
### 1. **Full Dataset (JSON)**
|
|
- **File**: `german_isil_complete_20251119_134939.json`
|
|
- **Size**: 37 MB
|
|
- **Format**: Structured JSON with metadata
|
|
- **Use**: Complete data access, analysis
|
|
|
|
### 2. **Full Dataset (JSONL)**
|
|
- **File**: `german_isil_complete_20251119_134939.jsonl`
|
|
- **Size**: 24 MB
|
|
- **Format**: JSON Lines (one record per line)
|
|
- **Use**: Stream processing, database import
|
|
|
|
### 3. **Statistics**
|
|
- **File**: `german_isil_stats_20251119_134941.json`
|
|
- **Size**: 7.6 KB
|
|
- **Format**: JSON summary
|
|
- **Use**: Quick overview, reporting
|
|
|
|
### 4. **Documentation**
|
|
- `HARVEST_REPORT.md` - Detailed harvest report
|
|
- `QUICK_START.md` - Usage examples and recipes
|
|
|
|
---
|
|
|
|
## Key Statistics
|
|
|
|
### Total Coverage
|
|
- **16,979** institutions with ISIL codes
|
|
- **14,765** (87%) have street addresses
|
|
- **14,771** (87%) have geographic coordinates
|
|
- **13,483** (79%) have website URLs
|
|
- **13,429** (79%) have phone numbers
|
|
- **6,420** (38%) have email addresses
|
|
|
|
### Geographic Distribution (Top 5)
|
|
1. **North Rhine-Westphalia**: 1,503 institutions
|
|
2. **Baden-Württemberg**: 1,295 institutions
|
|
3. **Bavaria**: 1,204 institutions
|
|
4. **Hesse**: 659 institutions
|
|
5. **Berlin**: 614 institutions
|
|
|
|
---
|
|
|
|
## Data Quality
|
|
|
|
### Strengths ✅
|
|
- **Authoritative source** - Official German ISIL registry
|
|
- **Complete coverage** - All registered institutions
|
|
- **High geocoding** - 87% have coordinates
|
|
- **Rich metadata** - Contact info, URLs, addresses
|
|
- **Public domain** - No usage restrictions
|
|
|
|
### Limitations ⚠️
|
|
- **Limited type classification** - Only 3.4% have institution type codes
|
|
- **German language only** - No English translations
|
|
- **Email gaps** - Only 38% have email addresses
|
|
- **No historical data** - No founding/closure dates
|
|
|
|
---
|
|
|
|
## Comparison with Other Countries
|
|
|
|
| Country | Records | API | License | Completeness |
|
|
|---------|---------|-----|---------|--------------|
|
|
| **🇩🇪 Germany** | **16,979** | ✅ Excellent | ✅ CC0 | ⭐⭐⭐⭐⭐ |
|
|
| 🇳🇱 Netherlands | ~1,400 | ✅ Good | ✅ Open | ⭐⭐⭐⭐ |
|
|
| 🇦🇹 Austria | ~3,000 | ✅ Good | ✅ Open | ⭐⭐⭐⭐ |
|
|
| 🇨🇭 Switzerland | ~1,500 | ✅ Good | ✅ Open | ⭐⭐⭐⭐ |
|
|
| 🇫🇷 France | ~5,000 | ✅ Good | ✅ Open | ⭐⭐⭐ |
|
|
| 🇬🇧 UK | ~4,000 | ⚠️ Limited | ✅ Open | ⭐⭐⭐ |
|
|
|
|
**Germany has the largest ISIL registry in Europe.**
|
|
|
|
---
|
|
|
|
## Example Records
|
|
|
|
### Staatsbibliothek zu Berlin (DE-1)
|
|
```json
|
|
{
|
|
"isil": "DE-1",
|
|
"name": "Staatsbibliothek zu Berlin - Preußischer Kulturbesitz",
|
|
"address": {
|
|
"street": "Unter den Linden 8",
|
|
"city": "Berlin",
|
|
"postal_code": "10117",
|
|
"coordinates": [52.51755, 13.39162]
|
|
},
|
|
"contact": {
|
|
"phone": "+49-30-2 66-433888",
|
|
"email": "info@sbb.spk-berlin.de"
|
|
},
|
|
"urls": ["http://staatsbibliothek-berlin.de"]
|
|
}
|
|
```
|
|
|
|
### Bayerische Staatsbibliothek (DE-12)
|
|
```json
|
|
{
|
|
"isil": "DE-12",
|
|
"name": "Bayerische Staatsbibliothek",
|
|
"address": {
|
|
"street": "Ludwigstraße 16",
|
|
"city": "München",
|
|
"postal_code": "80539",
|
|
"coordinates": [48.14694, 11.58111]
|
|
}
|
|
}
|
|
```
|
|
|
|
---
|
|
|
|
## Next Steps for GLAM Project
|
|
|
|
### 1. **Institution Type Classification** 🔄
|
|
- Parse institution names for type indicators
|
|
- Map to GLAMORCUBESFIXPHDNT taxonomy
|
|
- Classify: LIBRARY, ARCHIVE, MUSEUM, etc.
|
|
|
|
### 2. **GHCID Generation** 🆔
|
|
- Create persistent identifiers
|
|
- Format: `DE-[STATE]-[CITY]-[TYPE]-[ABBR]`
|
|
- Link to ISIL codes
|
|
|
|
### 3. **Wikidata Enrichment** 🔗
|
|
- Query Wikidata for Q-numbers
|
|
- Add founding dates, descriptions
|
|
- Link to parent organizations
|
|
|
|
### 4. **LinkML Conversion** 📋
|
|
- Map to `HeritageCustodian` schema
|
|
- Set data tier: **TIER_1_AUTHORITATIVE**
|
|
- Export to YAML/JSON-LD
|
|
|
|
### 5. **Cross-Reference** 🔍
|
|
- Compare with Archive Portal Germany
|
|
- Link to Museum Digital database
|
|
- Connect to Deutsche Digitale Bibliothek
|
|
|
|
---
|
|
|
|
## API Access
|
|
|
|
### Three Ways to Access German ISIL Data
|
|
|
|
#### 1. SRU (Search/Retrieve via URL)
|
|
```bash
|
|
curl "https://services.dnb.de/sru/bib?version=1.1&operation=searchRetrieve&query=isil=DE-1&recordSchema=PicaPlus-xml"
|
|
```
|
|
|
|
#### 2. JSON-API
|
|
```bash
|
|
curl "https://isil.staatsbibliothek-berlin.de/api/org.jsonld?q=ort=Berlin&size=10"
|
|
```
|
|
|
|
#### 3. Linked Data
|
|
```bash
|
|
curl -H "Accept: application/rdf+xml" "https://ld.zdb-services.de/resource/organisations/DE-1"
|
|
```
|
|
|
|
---
|
|
|
|
## Harvester Tool
|
|
|
|
**Script**: `scripts/scrapers/harvest_german_isil_sru.py`
|
|
|
|
**Features**:
|
|
- ✅ Batch processing (100 records/request)
|
|
- ✅ Rate limiting (1 sec delay)
|
|
- ✅ Automatic retry (3 attempts)
|
|
- ✅ Progress tracking
|
|
- ✅ Multiple output formats
|
|
|
|
**Performance**:
|
|
- **Runtime**: ~3 minutes
|
|
- **Speed**: ~94 records/second
|
|
- **Success rate**: 100%
|
|
|
|
---
|
|
|
|
## Citation
|
|
|
|
```
|
|
German ISIL Database
|
|
Staatsbibliothek zu Berlin - Preußischer Kulturbesitz
|
|
Harvested: November 19, 2025
|
|
Records: 16,979 institutions
|
|
License: CC0 1.0 Universal (Public Domain)
|
|
Harvester: OpenCode + MCP Wikidata Tools
|
|
```
|
|
|
|
---
|
|
|
|
## Contact & Support
|
|
|
|
### German ISIL Agency
|
|
- **Organization**: Staatsbibliothek zu Berlin
|
|
- **Website**: https://sigel.staatsbibliothek-berlin.de/
|
|
- **Technical Contact**: Carsten Klee
|
|
- **Email**: carsten.klee@sbb.spk-berlin.de
|
|
- **Phone**: +49 30 266 434402
|
|
|
|
### International ISIL
|
|
- **Coordinator**: Danish Agency for Culture and Palaces
|
|
- **Website**: https://slks.dk/english/work-areas/libraries-and-literature/library-standards/isil
|
|
- **Email**: isil@slks.dk
|
|
|
|
---
|
|
|
|
## Additional Resources
|
|
|
|
### Documentation
|
|
- 📖 [Harvest Report](HARVEST_REPORT.md) - Detailed analysis
|
|
- 🚀 [Quick Start Guide](QUICK_START.md) - Usage examples
|
|
- 📊 [Statistics](german_isil_stats_20251119_134941.json) - Summary data
|
|
|
|
### Standards
|
|
- 📜 ISO 15511:2019 - ISIL Standard
|
|
- 🔍 SRU 1.1 - Search/Retrieve Protocol
|
|
- 📚 PICA+ - Library Catalog Format
|
|
|
|
### Related Projects
|
|
- 🇩🇪 [Archive Portal Germany](https://www.archivportal-d.de/)
|
|
- 🏛️ [Museum Digital](https://museum-digital.de/)
|
|
- 📚 [Deutsche Digitale Bibliothek](https://www.deutsche-digitale-bibliothek.de/)
|
|
- 🌍 [Europeana](https://www.europeana.eu/)
|
|
|
|
---
|
|
|
|
**Status**: ✅ **COMPLETE**
|
|
**Quality**: ⭐⭐⭐⭐⭐ **EXCELLENT**
|
|
**Recommendation**: **READY FOR PRODUCTION USE**
|
|
|
|
---
|
|
|
|
*Generated by OpenCode + MCP Wikidata Tools*
|
|
*November 19, 2025*
|