glam/data/entity_annotation/modules/hypernyms/wrk.yaml
2025-12-05 15:30:23 +01:00

360 lines
14 KiB
YAML

# =============================================================================
# WORK Hypernym Module (FRBR/LRM Model)
# =============================================================================
# Version: 1.7.0
# Module: hypernyms/wrk.yaml
#
# BREAKING CHANGE v1.7.0: Renamed from TEXTUAL_REFERENCE (TXT) to WORK (WRK)
# and restructured following FRBR/IFLA-LRM model:
# - WORK: Abstract intellectual creation (the concept)
# - EXPRESSION: Realization in a specific form (the text)
# - MANIFESTATION: Physical embodiment (the publication)
# - ITEM: Single exemplar (the specific copy)
#
# NERD's "Product" class is semantically wrong for intellectual works.
# FRBR provides the bibliographic and archival standard.
# =============================================================================
id: hypernym_work
name: "WORK Hypernym (FRBR/LRM)"
description: |
References to intellectual and creative works, spanning the FRBR/LRM
abstraction levels from abstract concept (Work) to physical copy (Item).
hypernym:
WORK:
code: "WRK"
definition: |
References to intellectual and creative works, spanning the FRBR/LRM
abstraction levels from abstract concept (Work) to physical copy (Item).
The FRBR model distinguishes:
- WORK: Distinct intellectual creation ("Hamlet" as a concept)
- EXPRESSION: Specific intellectual realization ("Hamlet in Dutch translation")
- MANIFESTATION: Physical embodiment ("2023 Penguin edition of Hamlet")
- ITEM: Single exemplar ("the copy in Amsterdam Public Library")
Most textual mentions reference WORK level (titles, concepts).
Archival contexts often reference MANIFESTATION or ITEM level.
design_rationale: |
NERD's "Product" class conflates:
- Creative works (books, music, films)
- Commercial products (software, merchandise)
- Services (subscriptions, access)
For heritage contexts, FRBR (now IFLA-LRM) is the bibliographic standard:
- Library catalogs use FRBR for bibliographic relationships
- FRBRoo integrates with CIDOC-CRM for museum contexts
- RiC-O (archival) aligns with FRBR for records
This enables:
- Linking editions, translations, and copies of the same work
- Tracking provenance of specific items
- Distinguishing conceptual references from physical objects
ontology_mappings:
primary_class: "frbroo:F1_Work"
primary_class_definition: |
FRBRoo F1 Work: "This class comprises distinct concepts or combination
of concepts identified in artistic and intellectual expressions."
Equivalent to IFLA-LRM Work.
alternative_classes:
- "lrm:Work" # IFLA Library Reference Model
- "crm:E89_Propositional_Object" # CIDOC-CRM intellectual content
- "schema:CreativeWork" # Schema.org
- "dct:BibliographicResource" # Dublin Core
linkml_mapping:
class_uri: "frbroo:F1_Work"
exact_mappings:
- "lrm:Work"
close_mappings:
- "schema:CreativeWork"
- "crm:E89_Propositional_Object"
related_mappings:
- "bf:Work" # BIBFRAME
nerd_class: "nerd:Product"
nerd_deprecation_note: |
DEPRECATED: NERD's Product class is semantically incorrect for
intellectual works. "Product" implies commercial goods, not creative
expressions. Use FRBR/FRBRoo/LRM for bibliographic precision.
Retain NERD mapping ONLY for basic NLP pipeline interchange.
frbr_mapping:
work: "frbroo:F1_Work | lrm:Work"
expression: "frbroo:F2_Expression | lrm:Expression"
manifestation: "frbroo:F3_Manifestation | lrm:Manifestation"
item: "frbroo:F5_Item | lrm:Item"
note: |
FRBRoo is the object-oriented version of FRBR, aligned with CIDOC-CRM.
IFLA-LRM (2017) is the consolidated successor to FRBR/FRAD/FRSAD.
BIBFRAME (bf:) is the Library of Congress RDF vocabulary for bibliographic data.
subcategories:
# ----- FRBR WORK LEVEL (abstract concept) -----
WORK_ABSTRACT:
code: "WRK.ABS"
definition: |
Abstract intellectual works referenced by title or concept,
independent of specific editions or copies.
examples:
- "Hamlet"
- "The Diary of Anne Frank"
- "Beethoven's Ninth Symphony"
- "the Quran"
ontology_class: "frbroo:F1_Work"
alternative_classes:
- "lrm:Work"
- "bf:Work"
linkml_mapping:
class_uri: "frbroo:F1_Work"
exact_mappings:
- "lrm:Work"
- "bf:Work"
note: |
Use for references to works as concepts, not specific editions:
- "Shakespeare's Hamlet" (the work, not a specific edition)
- "Anne Frank's diary" (the conceptual work)
Link to authority files (VIAF, Wikidata) for work identification.
# ----- FRBR EXPRESSION LEVEL (specific realization) -----
EXPRESSION:
code: "WRK.EXP"
definition: |
Specific realizations of works: translations, versions, arrangements,
performances captured as recordings.
examples:
- "the Dutch translation of Hamlet"
- "the 1603 quarto text of Hamlet"
- "Karajan's 1962 recording of Beethoven's Ninth"
- "the King James Bible"
ontology_class: "frbroo:F2_Expression"
alternative_classes:
- "lrm:Expression"
- "bf:Work" # BIBFRAME conflates Work/Expression
note: |
Expressions are realizations of Works:
- Same Work, different language → different Expression
- Same Work, different arrangement → different Expression
- Same musical Work, different performance → different Expression
# ----- FRBR MANIFESTATION LEVEL (publication/edition) -----
MANIFESTATION:
code: "WRK.MAN"
definition: |
Physical or digital embodiments: editions, publications, pressings,
broadcasts. Manifestations are producible in multiple copies.
examples:
- "the 2023 Penguin edition of Hamlet"
- "Museum Journal vol. 15, no. 2"
- "Nature magazine, March 2023"
- "the 1609 Quarto of Shakespeare's Sonnets"
ontology_class: "frbroo:F3_Manifestation"
alternative_classes:
- "lrm:Manifestation"
- "bf:Instance" # BIBFRAME equivalent
- "schema:Book"
- "schema:Periodical"
linkml_mapping:
class_uri: "frbroo:F3_Manifestation"
exact_mappings:
- "lrm:Manifestation"
- "bf:Instance"
close_mappings:
- "schema:Book"
note: |
Manifestations are what libraries catalog and acquire:
- Publisher, date, ISBN, format are Manifestation attributes
- Multiple identical copies = one Manifestation, many Items
# ----- FRBR ITEM LEVEL (specific copy) -----
ITEM:
code: "WRK.ITM"
definition: |
Single exemplars: a specific copy, with provenance, annotations,
damage. Items are unique physical objects.
examples:
- "the British Library's copy of the First Folio"
- "manuscript KB 128 A 14 in the Royal Library"
- "the annotated copy formerly owned by Van Gogh"
ontology_class: "frbroo:F5_Item"
alternative_classes:
- "lrm:Item"
- "bf:Item"
- "crm:E22_Human-Made_Object"
note: |
Items have individual provenance and condition:
- Ownership history, annotations, damage
- Physical location (shelf mark, repository)
- For manuscripts, each copy is a unique Item
# ----- MANUSCRIPT (always Item level) -----
MANUSCRIPT:
code: "WRK.MSS"
definition: |
Handwritten documents and manuscripts. Manuscripts are unique
Items (FRBR Item level), not reproducible Manifestations.
examples:
- "Codex Manesse"
- "the Book of Kells"
- "Leonardo's notebooks"
- "the Dead Sea Scrolls"
ontology_class: "crm:E22_Human-Made_Object"
alternative_classes:
- "frbroo:F5_Item"
- "rico:Record"
note: |
Manuscripts are physical objects that embody intellectual content:
- Use crm:E22_Human-Made_Object for the physical codex/scroll
- Use frbroo:F1_Work for the textual content
- Link via frbroo:R4_comprises_carriers_of
# ----- ARCHIVAL RECORDS -----
ARCHIVAL:
code: "WRK.ARC"
definition: |
Archival records, documents, and files. May be single items or
aggregations (fonds, series).
examples:
- "deed of sale dated 1650"
- "birth certificate of Rembrandt"
- "notarial act NL-SAA 5075/2135"
- "the VOC archives"
ontology_class: "rico:Record"
alternative_classes:
- "rico:RecordSet" # For aggregations (fonds, series)
- "crm:E31_Document"
rico_levels:
fonds: "rico:RecordSet with rico:hasRecordSetType 'fonds'"
series: "rico:RecordSet with rico:hasRecordSetType 'series'"
file: "rico:RecordSet with rico:hasRecordSetType 'file'"
item: "rico:Record"
note: |
RiC-O provides archival hierarchy:
- Fonds → Series → File → Item
- Use rico:isOrWasIncludedIn for containment
- Use rico:hasOrHadIdentifier for call numbers
# ----- DIGITAL RESOURCES -----
WEBSITE:
code: "WRK.WEB"
definition: "Web pages and online resources as intellectual works"
examples:
- "the museum's online collection"
- "the Wikipedia article on Rembrandt"
- "the Rijksmuseum website"
ontology_class: "schema:WebPage"
alternative_classes:
- "schema:WebSite"
- "fabio:WebPage" # FRBR-aligned Bibliographic Ontology
note: |
Websites are Manifestations of digital Works.
Archived versions (Internet Archive) are different Manifestations
of the same Expression.
URL:
code: "WRK.URL"
definition: "URLs as identifiers for digital resources"
examples:
- "www.rijksmuseum.nl"
- "https://www.britishmuseum.org"
- "https://doi.org/10.1000/xyz123"
ontology_class: "schema:URL"
note: |
URLs are locators, not works themselves.
Links to GROUP hypernym via schema:url.
DOIs are persistent identifiers for Works/Expressions.
# ----- CONTACT INFORMATION -----
EMAIL:
code: "WRK.EML"
definition: "Email addresses as contact identifiers"
examples:
- "info@rijksmuseum.nl"
- "contact@museum.org"
ontology_class: "schema:email"
note: |
Email addresses are identifiers, not intellectual works.
Included here for practical extraction convenience.
Links to GROUP hypernym via schema:email.
SOCIAL:
code: "WRK.SOC"
definition: "Social media handles and profiles"
examples:
- "@rijksmuseum"
- "facebook.com/britishmuseum"
- "Instagram: @vangoghmuseum"
ontology_class: "foaf:OnlineAccount"
foaf_mapping:
class: "foaf:OnlineAccount"
properties:
account_name: "foaf:accountName"
service_homepage: "foaf:accountServiceHomepage"
note: |
Social accounts are identifiers for agent presence on platforms.
Links to GROUP hypernym via foaf:account.
# ----- CITATIONS -----
CITATION:
code: "WRK.CIT"
definition: "Bibliographic citations and references"
examples:
- "Smith (2020)"
- "cf. Van Gogh Letters, no. 532"
- "[ibid., p. 42]"
ontology_class: "dct:bibliographicCitation"
alternative_classes:
- "cito:Citation" # Citation Typing Ontology
note: |
Citations are references to Works/Expressions/Manifestations.
CiTO (Citation Typing Ontology) provides citation intent types:
- cito:cites, cito:critiques, cito:supports, etc.
inclusion_rules:
- id: "WRK_INC001"
rule: "Tag work titles at appropriate FRBR level"
examples:
- "'Hamlet' (Work level - the concept)"
- "'2023 Penguin edition' (Manifestation level)"
- "'the British Library's copy' (Item level)"
- id: "WRK_INC002"
rule: "Tag URLs and identifiers as complete strings"
examples:
- "info@rijksmuseum.nl"
- "https://www.rijksmuseum.nl"
- "doi:10.1000/xyz123"
- id: "WRK_INC003"
rule: "Tag archival references with call numbers when present"
examples:
- "NL-SAA 5075/2135"
- "KB 128 A 14"
- id: "WRK_INC004"
rule: "Tag manuscripts as Items (unique objects)"
examples:
- "Codex Manesse"
- "the Book of Kells"
exclusion_rules:
- id: "WRK_EXC001"
rule: "Do NOT tag generic work type references"
examples:
- "the document (no title)"
- "a manuscript (not named)"
- "some book"
- id: "WRK_EXC002"
rule: "Do NOT tag UI navigation elements"
examples:
- "Click here"
- "Read more"
- "Download PDF"
- id: "WRK_EXC003"
rule: "Do NOT confuse physical carrier with intellectual work"
examples:
- "the book (physical object → use THING if just the object)"
- "the paper (material, not work)"