Finnish National Bibliography Fennica as Linked Data
Osma Suominen
SWIB17, Hamburg 6 Dec 2017
Finnish National Bibliography Fennica as Linked Data Osma Suominen - - PowerPoint PPT Presentation
Finnish National Bibliography Fennica as Linked Data Osma Suominen SWIB17, Hamburg 6 Dec 2017 NATIONAL BIBLIOGRAPHY with apologies to Scott Adams Why? 1. Making our data more visible, also internationally 2. Improving the quality and
SWIB17, Hamburg 6 Dec 2017
NATIONAL BIBLIOGRAPHY
with apologies to Scott Adams
bib record bib record bib record bib record auth record auth record auth record bib record bib record auth record auth record auth record
Work Instance Person Subject
1M bib records 125k person names 40k corporate names 35k subjects (YSA) bib record bib record
Place Organization
Work Instance Person Subject
Image credit: MaryMaking blog bib record bib record bib record bib record auth record auth record auth record bib record bib record auth record auth record auth record 125k person names 40k corporate names 35k subjects (YSA) bib record bib record 1M bib records
As seen in: SWIB16 talk DCMI webinar
“From MARC silos to Linked Data silos”
with separate Works and Instances like BIBFRAME, as enabled by the bibliographic extensions because it allows us to describe our resources from a common-sense, Web user perspective (and we get a metadata haircut for free!) Special thanks to Richard Wallis for help with applying schema.org!
MARCXML BIBFRAME RDF Schema.org RDF Linked to external URIs MARC / Aleph seq With deduplicated works Work keys With deduplicated agents Agent keys
Convert & clean using Catmandu Convert using marc2bibframe2 Convert to Schema.org using SPARQL CONSTRUCT
YSA subjects YSO subjects Corporate names RDA Media, Content, Carrier
Link against controlled vocabularies using SPARQL Generate work keys for merging using SPARQL Merge works using SPARQL Merge agents (person, org) using SPARQL RDF store
https://github.com/NatLibFi/bib-rdf-pipeline
Data dump downloads
RDF HDT
Jena Fuseki bib-lod-ui Flask app HTML+JSON-LD OpenSearch API Linked Data RDF
RDF store
RDF N-Triples MARC records
Linked Data Fragments server SPARQL LDF
http://data.nationallibrary.fi/bib/me/W00009584100
Spelunking UI...maybe?
Not so easy in practice. Lots of problems in the metadata that cause inconsistencies in the output.
Work Instance Person Subject Place Organization
LCSH Finnish Place Name Registry Wikidata
Work Instance Person Subject Place Organization
LCSH Finnish Place Name Registry Wikidata WorldCat Other national libraries WorldCat Works LIBRIS XL ISNI VIAF ISNI Wikidata
...but we rely on conversion of MARC records that change all the time!
1. Findable: URIs as identifiers, with rich metadata 2. Accessible: URI lookup, SPARQL and LDF endpoints, downloadable data dumps 3. Interoperable: RDF represenation using Schema.org and a little bit of RDAu 4. Reusable: CC0 license. Entities that are references also from other metadata
1. Enriching and cleaning the RDF data, e.g. using subclasses like Map 2. More links to other Linked Data sets 3. Expanding to new data sets: Viola discography, Arto article database
The Finnish Declaration of Independence was adopted by the Parliament of Finland on 6 December 1917
http://data.nationallibrary.fi - @NatLibFiData Code: https://github.com/NatLibFi/bib-rdf-pipeline https://github.com/NatLibFi/bib-lod-ui These slides: http://tinyurl.com/fennica-ld