glam/tests/fixtures/publications/journal_article_example.yaml
kempersc e5a532a8bc Add comprehensive tests for NLP institution extraction and RDF partnership integration
- Introduced `test_nlp_extractor.py` with unit tests for the InstitutionExtractor, covering various extraction patterns (ISIL, Wikidata, VIAF, city names) and ensuring proper classification of institutions (museum, library, archive).
- Added tests for extracted entities and result handling to validate the extraction process.
- Created `test_partnership_rdf_integration.py` to validate the end-to-end process of extracting partnerships from a conversation and exporting them to RDF format.
- Implemented tests for temporal properties in partnerships and ensured compliance with W3C Organization Ontology patterns.
- Verified that extracted partnerships are correctly linked with PROV-O provenance metadata.
2025-11-19 23:20:47 +01:00

107 lines
3.7 KiB
YAML

---
# Example: Journal article from Semantic Web Journal
# Reference: Peroni, S., & Shotton, D. (2018). The SPAR Ontologies.
# Semantic Web Journal, 9(2), 119-136. https://doi.org/10.3233/SW-170277
- publication_id: https://doi.org/10.3233/SW-170277
title: "The SPAR Ontologies"
publication_type: JOURNAL_ARTICLE
publishing_status: PUBLISHED
authors:
- person_id: https://orcid.org/0000-0003-0530-4305
person_name: "Silvio Peroni"
orcid: "0000-0003-0530-4305"
affiliation:
organization_id: https://ror.org/01111rn36
organization_name: "University of Bologna"
organization_type: "University"
location: "Bologna, IT"
ror_id: "https://ror.org/01111rn36"
- person_id: https://orcid.org/0000-0001-5506-523X
person_name: "David Shotton"
orcid: "0000-0001-5506-523X"
affiliation:
organization_id: https://ror.org/052gg0110
organization_name: "University of Oxford"
organization_type: "University"
location: "Oxford, GB"
ror_id: "https://ror.org/052gg0110"
published_in: https://w3id.org/heritage/journal/swj
publication_date: "2018-01-01"
publication_year: 2018
volume: "9"
issue: "2"
page_range: "119-136"
doi: "10.3233/SW-170277"
url: "http://www.semantic-web-journal.net/content/spar-ontologies-1"
abstract: >-
The SPAR Ontologies (Semantic Publishing and Referencing Ontologies) are
a suite of orthogonal and complementary ontologies that enable all aspects
of the publishing process to be described in machine-readable metadata
statements, encoded using RDF. They enable the creation of semantic
descriptions of publications, such as journal articles and conference papers.
keywords:
- "SPAR Ontologies"
- "FaBiO"
- "CiTO"
- "BiRO"
- "DoCO"
- "Semantic Publishing"
- "Bibliographic Ontologies"
open_access_status: FULLY_OPEN_ACCESS
citations:
- citation_id: https://w3id.org/heritage/citation/sw-170277-cites-fabio
citing_work: https://doi.org/10.3233/SW-170277
cited_work: https://doi.org/10.3233/SW-2011-0057
citation_type: CITES_AS_AUTHORITY
citation_context: "FaBiO is the foundational bibliographic ontology in the SPAR suite"
- citation_id: https://w3id.org/heritage/citation/sw-170277-cites-cito
citing_work: https://doi.org/10.3233/SW-170277
cited_work: https://doi.org/10.1186/2041-1480-1-S1-S6
citation_type: CITES_AS_AUTHORITY
citation_context: "CiTO enables semantic description of citation relationships"
document_sections:
- section_id: https://w3id.org/heritage/section/sw-170277-intro
section_type: INTRODUCTION
section_title: "Introduction"
section_order: 1
- section_id: https://w3id.org/heritage/section/sw-170277-methods
section_type: METHODS
section_title: "The SPAR Ontologies Suite"
section_order: 2
- section_id: https://w3id.org/heritage/section/sw-170277-discussion
section_type: DISCUSSION
section_title: "Use Cases and Applications"
section_order: 4
- section_id: https://w3id.org/heritage/section/sw-170277-conclusion
section_type: CONCLUSION
section_title: "Conclusions and Future Work"
section_order: 5
- section_id: https://w3id.org/heritage/section/sw-170277-refs
section_type: BIBLIOGRAPHY
section_title: "References"
section_order: 6
provenance:
data_source: CSV_REGISTRY
data_tier: TIER_1_AUTHORITATIVE
extraction_date: "2025-11-09T15:00:00Z"
extraction_method: "Manual curation from Semantic Web Journal metadata"
confidence_score: 1.0
verified_date: "2025-11-09T00:00:00Z"
verified_by: "GLAM bibliographic team"