“My colleagues Jean Godby, Karen Smith-Yoshimura, and Bruce Washburn, along with a host of partners, have just released Creating Library Linked Data with Wikibase: Lessons Learned from Project Passage, a fascinating account of their experiences working with a customized instance of Wikibase to create resource descriptions in the form of linked data. In the spirit of their report, I’d like to offer a modest yet illustrative use case showing how access to the relationships and properties of the linked data in another Wikibase environment – Wikidata – smoothed the way for OCLC Research’s recent study of the Canadian presence in the published record.
Maple Leaves: Discovering Canada Through the Published Record is the latest in a series of OCLC Research studies that explore national contributions to the world’s accumulated body of published materials. A national contribution is defined as materials published in, about, and/or by the people of that country. The last category presents a special challenge: how to assemble a list of entities – people and organizations – associated with a particular country from which authors, musicians, film makers, and other creators of published works can be identified?…”
Abstract: Library catalogues may be connected to the linked data cloud through various types of thesauri. For name authority thesauri in particular I would like to suggest a fundamental break with the current distributed linked data paradigm: to make a transition from a multitude of different identifiers to using a single, universal identifier for all relevant named entities, in the form of the Wikidata identifier. Wikidata (https://wikidata.org) seems to be evolving into a major authority hub that is lowering barriers to access the web of data for everyone. Using the Wikidata identifier of notable entities as a common identifier for connecting resources has significant benefits compared to traversing the ever-growing linked data cloud. When the use of Wikidata reaches a critical mass, for some institutions, Wikidata could even serve as an authority control mechanism.
“There has been a growing interest from libraries and other cultural heritage organizations in Wikidata. Of the many potential uses for Wikidata, one emerging area of focus has been using Wikidata as a hub for institutional identifiers. Many organizations maintain unique identifiers for people, subjects, works, etc. If these IDs are all added to Wikidata then you could seamlessly access data from dozens of sources if you know the Wikidata ID. If we return to the author example from above you can see the Wikidata page for Virginia Woolf has ninety external links to various organizations. Many of these are national libraries, museums, and other cultural heritage institutions including the Library of Congress.
The Library of Congress maintains many authority files that are widely used. Two of the largest are the Name Authority File (NAF) and Library of Congress Subject Headings (LCSH). The Network Development and MARC Standard Office maintains the Linked Open Data version of these files at the site id.loc.gov. For example, authority data for Virginia Woolf is located at //id.loc.gov/authorities/names/n79041870. This data ensures that items being cataloged are all referencing the same person. One of the goals of linked data is to make sure you link out to other’s data. With id.loc.gov we maintain links to many other institutions authority files including the French and German national libraries, other government services such as Department of Agriculture and other cultural institutions like the Getty Museum. You’ll notice these links on the page and are also present in the machine readable data. With the potential of Wikidata being a hub of identifiers we wanted to also include links in our authority record out to Wikidata….
Using records from the Library of Congress Prints & Photographs Division I built an interface that combines Library of Congress collection items with Wikidata information. This tool demonstrates the possibilities in connecting these two knowledge systems….”
Abstract: Knowledge workers like researchers, students, journalists, research evaluators or funders need tools to explore what is known, how it was discovered, who made which contributions, and where the scholarly record has gaps. Existing tools and services of this kind are not available as Linked Open Data, but Wikidata is. It has the technology, active contributor base, and content to build a large-scale knowledge graph for scholarship, also known as WikiCite. Scholia visualizes this graph in an exploratory interface with profiles and links to the literature. However, it is just a working prototype. This project aims to “robustify Scholia” with back-end development and testing based on pilot corpora. The main objective at this stage is to attain stability in challenging cases such as server throttling and handling of large or incomplete datasets. Further goals include integrating Scholia with data curation and manuscript writing workflows, serving more languages, generating usage stats, and documentation.
“In a new Association of Research Libraries (ARL)white paper, a task force of expert Wikidata users recommend a variety of ways for librarians to use the open knowledge base in advancing global discovery of their collections, faculty, and institutions.
Librarians are using Wikidata’s structured data about people, topics, concepts, and objects to populate open source faculty profiling systems, to enhance bibliographic records in online catalogs, and to collaborate with communities on meaningful, culturally relevant, descriptive metadata for special collections and archives. The white paper, circulated for public comment in fall 2018, contains examples of Wikidata applications, screenshots, and recommendations for involvement on an individual or organizational level….”
“[Recommendations] For individual librarians: • Use and experiment with Wikidata, for example: • Contribute local name authorities to Wikidata, particularly for underrepresented creators and organizations. • Add institutional holdings to existing Wikidata items using the “archives at”13 property.14 • Create items for faculty in an institution. • Explore and experiment with Wikidata editing tools such as Mix’n’match, batch uploading, and database dumps.15 • Create a “hub of hubs” for authority controls, metadata vocabularies, and other data sources, to facilitate the connection between existing external metadata sources and Wikidata. • Get involved in the greater Wikimedia community by holding edit-a-thons and workshops, participating in discussions on email lists and in social media channels, and by joining the Wikimedia and Libraries User Group.16 • Advocate within your research communities and organizations for open, compatible licensing of data sets so that they can be incorporated into Wikidata.17 …
[Recommendations] For library leadership and organizations: • Give staff time to experiment and contribute to Wikidata, including by determining tasks that can be added to existing positions and workflows, or incorporating Wikidata participation into existing incentive and reward structures. • Expand capacity with Wikimedians in Residence or fellowships. • Inform and advocate with your patrons/scholars/research community to use LOD for their research projects that involve data/data sets. • Make data sets and scholarship from existing institutional projects visible on Wikidata as part of a global network of knowledge. Large-scale cooperative projects like Social Networks and Archival Context (SNAC),18 and VIAF, the Virtual International Authority File,19 for example, have added identifiers to Wikidata. • Provide linked data support to researchers, academics, and other patrons wishing to expand the context of their own research and data or to develop web applications representing knowledge from their field. • Engage scholars and communities working in underrepresented knowledge areas to help extend existing sets of knowledge in Wikidata. • Explore and advocate for the use of Wikidata identifiers (“Q IDs”) or equivalent uniform resource identifiers (URIs) in library and archival systems, repositories, and platforms. • Consider the use of Wikibase as a LOD store for local identifiers and authority-like data….”
Abstract: Interdisciplinary collaborations and data sharing are essential to addressing the long history of human-environmental interactions underlying the modern biodiversity crisis. Such collaborations are increasingly facilitated by, and dependent upon, sharing open access data from a variety of disciplinary communities and data sources, including those within biology, paleontology, and archaeology. Significant advances in biodiversity open data sharing have focused on neontological and paleontological specimen records, making available over a billion records through the Global Biodiversity Information Facility. But to date, less effort has been placed on the integration of important archaeological sources of biodiversity, such as zooarchaeological specimens. Zooarchaeological specimens are rich with both biological and cultural heritage data documenting nearly all phases of human interaction with animals and the surrounding environment through time, filling a critical gap between paleontological and neontological sources of data within biodiversity networks. Here we describe technical advances for mobilizing zooarchaeological specimen-specific biological and cultural data. In particular, we demonstrate adaptations in the workflow used by biodiversity publisher VertNet to mobilize Darwin Core formatted zooarchaeological data to the GBIF network. We also show how a linked open data approach can be used to connect existing biodiversity publishing mechanisms with archaeoinformatics publishing mechanisms through collaboration with the Open Context platform. Examples of ZooArchNet published datasets are used to show the efficacy of creating this critically needed bridge between biological and archaeological sources of open access data. These technical advances and efforts to support data publication are placed in the larger context of ZooarchNet, a new project meant to build community around new approaches to interconnect zoorchaeological data and knowledge across disciplines.
“In June 2018, the Association of Research Libraries (ARL) charged a task force to look at Wikidata. The task force emerged from several years of discussion between ARL and the Wikimedia Foundation on where the two communities can effectively collaborate. The focus on Wikidata and Wikibase came from two points of alignment in particular: interest in linked open data for both library discovery systems and Wikipedia, and advancing a diversity and inclusion agenda in the cultures of both libraries and Wikimedia….”
“As I analyze the responses to the OCLC Research 2018 International Linked Data Survey for Implementers, I’m looking out for significant differences with the responses to the previous, 2015 survey. One change that jumped out at me was the surge of using Wikidata as a linked data source consumed by linked data projects or services.”
“This archive contains the dump of the OpenCitations Corpus (OCC, http://opencitations.net) triplestore (Blazegraph, https://www.blazegraph.com/, licensed in GPLv2) containing all the data of the corpus, and created regularly every month.”