There’s A PID For That! Next Steps in Establishing a National PID Strategy – Jisc scholarly communications

“A stakeholder group was therefore formed earlier this year, with representatives from all disciplines and sectors — funders, HEIs, infrastructure providers, libraries, publishers, researchers, research managers, and more. At an initial meeting of this group in April, participants discussed the five persistent identifiers (PIDs) that have been deemed high priority for improving access to UK research. These are ORCID iDs for people, Crossref and DataCite DOIs for outputs, Crossref grant DOIs, ROR identifiers for organisations, and RAiDs for projects.  This was followed by five focus group meetings during May and June, each focused on one of the priority PIDs….”

The Data Nutrition Project

“A “nutrition label” for datasets.

The Data Nutrition Project aims to create a standard label for interrogating datasets for measures that will ultimately drive the creation of better, more inclusive algorithms.

Our current prototype includes a highly-generalizable interactive data diagnostic label that allows for exploring any number of domain-specific aspects in datasets. Similar to a nutrition label on food, our Dataset Nutrition Label aims to highlight the key ingredients in a dataset such as meta-data and populations, as well as unique or anomalous features regarding distributions, missing data, and comparisons to other ‘ground truth’ datasets. We are currently testing our label on several datasets, with an eye towards open sourcing this effort and gathering community feedback.

The design utilizes a ‘modular’ framework that can be leveraged to add or remove areas of investigation based on the domain of the dataset. For example, Dataset Nutrition Labels for data about people may include modules about the representation of race and gender, while Nutrition Labels for data about trees may not require that module.

To learn more, check out our live prototype built on the Dollars for Docs dataset from ProPublica. A first draft of our paper can be found here….”

Blog – Europe PMC: Announcing the new version of SciLite – the Europe PMC tool for highlighting annotations

“This month, Europe PMC released a new version of SciLite, a powerful tool for highlighting annotations in life sciences publications. SciLite is powered by the Europe PMC annotation platform via the open annotation API, which provides access to over 1.3 billion annotations. Highlighting annotations in the text enables users to easily scan the article and locate key biological entities, such as genes/proteins, accession numbers, protein interactions, diseases, gene-disease relationship and more….”

4Science, share your knowledge: research data and digital libraries

“Our mission is to support universities, research and cultural institutes in managing the different phases of a digital project.

To successfully fulfill this mission 4Science chose DSpace, the most widely used repository software in the world.

As a DSpace Registered Service Provider and thanks to our Team of experts, that includes 2 DSpace Committers, we provide any kind of support to your repository.

4Science is constantly working with the DSpace Community on improving the platform, developing new functionalities and add-on modules and implementing compliancy with international standards.

Thanks to our natural inclination towards innovation and our deep understanding of the Research Data & Information and the Cultural Heritage domains, we developed two out-of-the-box configurations of DSpace that meet the requirements of these two areas….”

Sharing and organizing research products as R packages | SpringerLink

Abstract:  A consensus on the importance of open data and reproducible code is emerging. How should data and code be shared to maximize the key desiderata of reproducibility, permanence, and accessibility? Research assets should be stored persistently in formats that are not software restrictive, and documented so that others can reproduce and extend the required computations. The sharing method should be easy to adopt by already busy researchers. We suggest the R package standard as a solution for creating, curating, and communicating research assets. The R package standard, with extensions discussed herein, provides a format for assets and metadata that satisfies the above desiderata, facilitates reproducibility, open access, and sharing of materials through online platforms like GitHub and Open Science Framework. We discuss a stack of R resources that help users create reproducible collections of research assets, from experiments to manuscripts, in the RStudio interface. We created an R package, vertical, to help researchers incorporate these tools into their workflows, and discuss its functionality at length in an online supplement. Together, these tools may increase the reproducibility and openness of psychological science.


Being Fair about the Design of FAIR Data Standards | Zenodo

Abstract:  Since 2014 the FAIR data movement has been rapidly altering the landscape of data sharing and re-use. Support for the FAIR movement has seen the evolution of disciplinary-specific standards to foster data that are “finable, accessible, interoperable and reusable.” While these exciting developments should not be minimised, it is important to interrogate how these standards are set. Key questions to ask include how representation in standard setting communities is addressed; what infrastructures and resources these emergent standards are reliant on; and how standards dictate specific interpretations of “value” and “valuable data.” Asking such questions introduces a needed reflexivity into FAIR discussions, as standard setters interrogate what data practices commit present—and future—researchers to.

Celebrating the Five Year Anniversary of the UK ORCID Consortium – Jisc scholarly communications

“August 2020 sees the 5-year anniversary of the UK ORCID consortium. The evolution of ORCID and the UK Consortium can be viewed as a change programme. If we look back and reflect, what have been the drivers for change and what improvements can we celebrate?…

The range and complexity of outputs that ORCID identifiers are associated with has expanded as well, as new systems and ways of capturing information emerge – especially as we move to a data rich, information-centric open science model of scholarship. As such, the power of interconnected PIDs with the personal identifier of ORCID ID embedded, gives deeply intertwined and more useful information. These potential benefits can be realised as the various systems and identifiers mature and adoption improves. Examples of associations with unique persistent person identities are: works (e.g. works identified with a DOI); organisations (identified, for example with a ROR id); affiliations and workflows which can be examined via the events captured in PID Graphs. A project identifier such as RAiD allows you to associate people, data, works and funding with a long term effort, track the impact of efforts over the long term, and focus on the narrative, rather than a particular researcher or funding stream. This evolving landscape of interconnection allows us to build better, more effective scholarly machines, to do open research on a better, more cohesive and collaborative scale….”

Data Sharing in a Time of Pandemic: Patterns

“The resulting recommendations and guidelines on data sharing,


 published in final form on Jun 30, 2020, are a thorough and comprehensive overview of how to share data (and research software) from multiple disciplines to inform response to a pandemic, along with guidelines and recommendations on data sharing under the present COVID-19 circumstances. It is a long document (more than 140 pages) but is very thorough and well structured….”


“FAIRsFAIR – Fostering Fair Data Practices in Europe – aims to supply practical solutions for the use of the FAIR data principles throughout the research data life cycle. Emphasis is on fostering FAIR data culture and the uptake of good practices in making data FAIR. FAIRsFAIR will play a key role in the development of global standards for FAIR certification of repositories and the data within them contributing to those policies and practices that will turn the EOSC programme into a functioning infrastructure.

In the end, FAIRsFAIR will provide a platform for using and implementing the FAIR principles in the day to day work of European research data providers and repositories. FAIRsFAIR will also deliver essential FAIR dimensions of the Rules of Participation (RoP) and regulatory compliance for participation in the EOSC. The EOSC governance structure will use these FAIR aligned RoPs to establish whether components of the infrastructure function in a FAIR manner….”