glam/data/custodian/NL-UT-UTR-M-FMU.yaml
kempersc 1f723fd5d7 feat(data): merge staff data from 35 PENDING files into enriched custodians
Merged LinkedIn-extracted staff sections from PENDING files into their
corresponding proper GHCID custodian files. This consolidates data from
two extraction sources:
- Existing enriched files: Google Maps, Museum Register, YouTube, etc.
- PENDING files: LinkedIn staff data extraction

Files modified:
- 28 custodian files enriched with staff data
- 35 PENDING files deleted (merged into proper locations)
- Originals archived to archive/pending_duplicates_20250109/

Key institutions enriched:
- Rijksmuseum (NL-NH-AMS-M-RM)
- Stedelijk Museum Amsterdam (NL-NH-AMS-M-SMA)
- Amsterdam Museum (NL-NH-AMS-M-AM)
- Regionaal Archief Alkmaar (NL-NH-ALK-A-RAA)
- Maritiem Museum Rotterdam (NL-ZH-ROT-M-MMR)
- And 23 more museums/archives across NL

New scripts:
- scripts/merge_staff_data.py: Automated staff data merger
- scripts/categorize_pending_files.py: PENDING file analysis utility
2026-01-09 14:51:17 +01:00

97 lines
3.4 KiB
YAML

custodian_name:
emic_name: Fietsers Museum Utrecht
emic_name_source: linkedin
institution_type:
- M
linkedin_enrichment:
linkedin_url: https://www.linkedin.com/company/fietsers-museum-utrecht
linkedin_slug: fietsers-museum-utrecht
industry: Museums
website: null
follower_count: '544'
staff_count: 2
heritage_staff_count: 2
heritage_staff:
- name: Fietsers Museum Utrecht
headline: ''
heritage_type: M
- name: Kaspar Hanenbergh
headline: Op zoek naar tegelijk rechtvaardig en rechtmatig handelen in de Publieke Sector. In mijn werk met een passie
voor de uitdagingen van beleidsuitvoering. Dat is immers de belangrijkste basis voor een betrouwbare overheid.
linkedin_url: https://www.linkedin.com/in/kaspar-hanenbergh-989a90b9
heritage_type: O
enrichment_timestamp: '2025-12-16T21:06:44.485093+00:00'
provenance:
source: linkedin_company_scrape
original_file: data/custodian/linkedin/fietsers-museum-utrecht.yaml
schema_version: 1.0.0
location:
city: Utrecht
region: UT
country: NL
coordinates:
latitude: 52.09083
longitude: 5.12222
source: geonames
ghcid:
ghcid_current: NL-UT-UTR-M-FMU
ghcid_original: NL-UT-UTR-M-FMU
ghcid_uuid: ccfac140-d5fa-5020-b2b3-8d9095836ee6
ghcid_uuid_sha256: 55d913bd-ba52-8d39-9029-053382447b65
ghcid_numeric: 6185997268765805881
record_id: fa439aae-a797-4168-aa59-41e7a73951aa
generation_timestamp: '2025-12-16T21:06:44.485093+00:00'
ghcid_history:
- ghcid: NL-UT-UTR-M-FMU
ghcid_numeric: 6185997268765805881
valid_from: '2025-12-16T21:06:44.485093+00:00'
valid_to: null
reason: Initial GHCID assignment from LinkedIn batch import
location_resolution:
method: CITY_INFERRED_FROM_NAME
city_code: UTR
region_code: UT
country_code: NL
geonames_id: 2745912
geonames_name: Utrecht
feature_code: PPLA
admin1_code: 09
provenance:
schema_version: 1.0.0
generated_at: '2025-12-16T21:06:44.485093+00:00'
sources:
linkedin:
- source_type: linkedin_company_profile
data_tier: TIER_4_INFERRED
source_file: data/custodian/linkedin/fietsers-museum-utrecht.yaml
extraction_timestamp: '2025-12-16T21:06:44.485093+00:00'
claims_extracted:
- name
- industry
- location
- website
- staff_count
- heritage_staff
data_tier_summary:
TIER_4_INFERRED:
- linkedin_company_profile
notes:
- Created from unmatched LinkedIn company profile
- 'Location resolution method: CITY_INFERRED_FROM_NAME'
- Staff data merged from NL-XX-XXX-PENDING-FIETSERS-MUSEUM-UTRECHT.yaml on 2026-01-09T11:41:34.105176+00:00
staff:
provenance:
source_type: linkedin_company_people_page_html
registered_timestamp: '2025-12-30T09:59:29Z'
registration_method: html_parsing_with_full_staff_data
total_staff_extracted: 1
staff_list:
- staff_id: fietsers-museum-utrecht_staff_0001_kaspar_hanenbergh
person_name: Kaspar Hanenbergh
person_profile_path: data/custodian/person/entity/kaspar-hanenbergh-989a90b9_*.json
role_title: Op zoek naar tegelijk rechtvaardig en rechtmatig handelen in de Publieke Sector. In mijn werk met een passie
voor de uitdagingen van beleidsuitvoering. Dat is immers de belangrijkste basis voor een betrouwbare overheid.
heritage_relevant: true
heritage_type: O
linkedin_profile_url: https://www.linkedin.com/in/kaspar-hanenbergh-989a90b9
linkedin_slug: kaspar-hanenbergh-989a90b9