99 lines
No EOL
4 KiB
Text
99 lines
No EOL
4 KiB
Text
================================================================================
|
|
SWISS ISIL DATABASE - FINAL SCRAPING REPORT
|
|
================================================================================
|
|
Completed: 2025-11-18 21:18:38
|
|
Duration: 33 minutes 3 seconds
|
|
|
|
SUMMARY STATISTICS
|
|
--------------------------------------------------------------------------------
|
|
Total institutions scraped: 2379
|
|
Detail pages successfully scraped: 1929
|
|
Errors encountered: 0
|
|
|
|
DATA COMPLETENESS
|
|
--------------------------------------------------------------------------------
|
|
ISIL codes: 1,923 (80.8%)
|
|
Email addresses: 986 (41.4%)
|
|
Phone numbers: 1,168 (49.1%)
|
|
Websites: 934 (39.3%)
|
|
Physical addresses: 117 (4.9%)
|
|
Membership info: 0 (0.0%)
|
|
Dewey classifications: 0 (0.0%)
|
|
|
|
INSTITUTION STATUS
|
|
--------------------------------------------------------------------------------
|
|
Active: 2,379 (100.0%)
|
|
Inactive: 0 (0.0%)
|
|
|
|
GEOGRAPHIC DISTRIBUTION (BY CANTON)
|
|
--------------------------------------------------------------------------------
|
|
ZH : 479 ( 20.1%)
|
|
BE : 311 ( 13.1%)
|
|
GE : 227 ( 9.5%)
|
|
VD : 224 ( 9.4%)
|
|
BS : 139 ( 5.8%)
|
|
VS : 121 ( 5.1%)
|
|
NE : 121 ( 5.1%)
|
|
FR : 102 ( 4.3%)
|
|
SG : 87 ( 3.7%)
|
|
TG : 85 ( 3.6%)
|
|
AG : 81 ( 3.4%)
|
|
LU : 59 ( 2.5%)
|
|
GR : 56 ( 2.4%)
|
|
TI : 54 ( 2.3%)
|
|
ZG : 49 ( 2.1%)
|
|
SO : 45 ( 1.9%)
|
|
BL : 42 ( 1.8%)
|
|
SH : 18 ( 0.8%)
|
|
SZ : 18 ( 0.8%)
|
|
JU : 17 ( 0.7%)
|
|
Unknown : 9 ( 0.4%)
|
|
AR : 7 ( 0.3%)
|
|
OW : 7 ( 0.3%)
|
|
UR : 6 ( 0.3%)
|
|
GL : 6 ( 0.3%)
|
|
NW : 6 ( 0.3%)
|
|
AI : 2 ( 0.1%)
|
|
TessinMunicipal archives or county/local authority archives: 1 ( 0.0%)
|
|
|
|
INSTITUTION TYPES (TOP 20)
|
|
--------------------------------------------------------------------------------
|
|
University and research library : 764
|
|
Public library : 347
|
|
Special library : 339
|
|
Municipal archives or county/local authority archi: 190
|
|
Church and religious archives : 85
|
|
Regional archives : 45
|
|
Cantonal library : 37
|
|
Specialised non-governmental archives and archives: 36
|
|
University and research archives : 36
|
|
Business archives : 23
|
|
Private persons and family archives : 22
|
|
Regional and local museums : 22
|
|
Historical museums : 19
|
|
Art museums : 18
|
|
Media archives : 16
|
|
Natural science museums : 8
|
|
Other museums : 8
|
|
National archives : 7
|
|
National library : 5
|
|
Ethnographic museums : 3
|
|
|
|
MISSING DATA ANALYSIS
|
|
--------------------------------------------------------------------------------
|
|
Institutions without ISIL codes: 456
|
|
Institutions without any contact info: 1,211
|
|
|
|
FILES CREATED
|
|
--------------------------------------------------------------------------------
|
|
Main output: swiss_isil_complete_final.json (1.3 MB)
|
|
Batch files: 47 checkpoint files (every 50 institutions)
|
|
Statistics: scraping_stats_resume_20251118_211838.json
|
|
|
|
================================================================================
|
|
NEXT STEPS:
|
|
1. Export to CSV for spreadsheet analysis
|
|
2. Convert to LinkML format for GLAM project integration
|
|
3. Geocode addresses to obtain lat/lon coordinates
|
|
4. Cross-reference with other European ISIL registries
|
|
================================================================================ |