OpenAIRE Monitoring Services – EC FP7 & H2020 and other national funders (presentation at the Open Science FAIR 2017 conference)

“Presentation at the FOSTERplus project workshop on “fostering the practical implementation of open science in horizon 2020 and beyond”, at the Open Science FAIR conference in Athens, September, 7 2017.”

Science Beam – using computer vision to extract PDF data | Labs | eLife

“There’s a vast trove of science out there locked inside the PDF format. From preprints to peer-reviewed literature and historical research, millions of scientific manuscripts today can only be found in a print-era format that is effectively inaccessible to the web of interconnected online services and APIs that are increasingly becoming the digital scaffold of today’s research infrastructure….Extracting key information from PDF files isn’t trivial. …It would therefore certainly be useful to be able to extract all key data from manuscript PDFs and store it in a more accessible, more reusable format such as XML (of the publishing industry standard JATS variety or otherwise). This would allow for the flexible conversion of the original manuscript into different forms, from mobile-friendly layouts to enhanced views like eLife’s side-by-side view (through eLife Lens). It will also make the research mineable and API-accessible to any number of tools, services and applications. From advanced search tools to the contextual presentation of semantic tags based on users’ interests, and from cross-domain mash-ups showing correlations between different papers to novel applications like ScienceFair, a move away from PDF and toward a more open and flexible format like XML would unlock a multitude of use cases for the discovery and reuse of existing research….We are embarking on a project to build on these existing open-source tools, and to improve the accuracy of the XML output. One aim of the project is to combine some of the existing tools in a modular PDF-to-XML conversion pipeline that achieves a better overall conversion result compared to using individual tools on their own. In addition, we are experimenting with a different approach to the problem: using computer vision to identify key components of the scientific manuscript in PDF format….To this end, we will be collaborating with other publishers to collate a broad corpus of valid PDF/XML pairs to help train and test our neural networks….”

Machine accessibility of Open Access scientific publications from publisher systems via ResourceSync

“Poster presented at OAI10, University of Geneva, 21 -23 June 2017.”

 

“The number of scholarly research papers being published is gradually growing; it is estimated that approximately 1.5 million of research papers are produced each year and about 4% of them are offered via Open Access journals[1]. The high volume of scientific papers introduces new opportunities for content discoverability and facilitates a growth in various scientific disciplines via text and data mining (TDM)[2]. One of the greatest barriers to TDM is caused by the difficulty of programmatically accessing open access content from a wide range of publishers[3]…”

Open Academic Society

“Open Academic Society is formed by a group of institutions to create a shared, open and expanding knowledge graph of research and education-focused entities and relationships. With the initial contributions from the Microsoft Academic by Microsoft Research and the AMiner graph from Tsinghua University, the reach and depth of the knowledge graph will come through the Society members’ contributions. The data set is available under a freely accessible cloud API, and the society will organize workshops, challenges, and data sharing activities for the benefit of the larger computer science community….”

LexisNexis Announces Acquisition of Ravel Law

“Finally, Ravel Law’s access to the Harvard case law content and PDF images of original case opinions will enrich the already expansive case law collection available from LexisNexis. LexisNexis is committed to continuing Ravel Law’s open access to this historical collection, giving the American public, and anyone with an internet connection, access to this vital collection of legal information….”

LexisNexis acquires case analytics firm Ravel Law

“LexisNexis Legal & Professional has acquired legal research and litigation analytics firm Ravel Law, and will integrate Ravel’s data visualization and profiling technology into LexisNexis services….In the next few months, the team will complete its project with Harvard University to digitize the school’s case law library, and Lewis notes that LexisNexis will consider ways to support the effort. “We’ll continue to provide public access and expand it with APIs,” Lewis says, referring to the application program interfaces that developers use to distribute information….”

“How automated workflows helped us ingest 600 faculty publications in t” by Shilpa Rele and Jessea Young

Abstract: “Conducting copyright clearance and ingesting appropriate versions of faculty publications can be a labor intensive and time consuming process. At Loyola Marymount University (LMU), a medium-size, private institution, the Digital Library Program (DLP) had been conducting copyright clearance one publication at a time. This meant that it took an enormous amount of time from start to finish to review and process the list of publications on a given faculty member’s CV. In October 2016, the Digital Program Librarian learned about the automated workflow developed by librarians at University of North Texas and decided to give it a try. At this time, the DLP hired a Library Assistant who then began exploring and experimenting with this automated workflow. The goal of such experimentation was to increase efficiency in our processes to ingest more faculty publications in LMU’s institutional repository.

In this session, we will share information about our workflows and tools used to manage our various processes. […]”