S2ORC: The Semantic Scholar Open Research Corpus

“S2ORC is a general-purpose corpus for NLP and text mining research over scientific papers.

We’ve curated a unified resource that combines aspects of citation graphs (i.e. rich paper metadata, abstracts, citation edges) with a full text corpus that preserves important scientific paper structure (i.e. sections, inline citation mentions, references to tables and figures).
Our corpus covers 136M+ paper nodes with 12.7M+ full text papers and connected by 467M+ citation edges by unifying data from many different sources covering many different academic disciplines and identifying open-access papers using services like Unpaywall. …”

Exploring the COVID-19 network of scientific research with SciSight | AI2 Blog

“Three months into the coronavirus pandemic, the world’s scientific knowledge of the SARS-CoV-2 virus is rapidly expanding. Reports of potential vaccines and treatments sprout up almost daily. Thousands of papers have been pouring into Semantic Scholar’s COVID-19 Open Research Dataset (CORD-19), a collection of nearly 60,000 scientific publications of potential relevance to the topic, both historical and cutting-edge….

To help accelerate scientific discovery with visualization, last month we launched SciSight, a framework of exploratory search and visualization tools for the COVID-19 literature. The first version of SciSight supported exploring associations between biomedical concepts appearing in the literature. In preliminary user interviews, the tool was found helpful in discovery-oriented search. We now release two important updates of SciSight….”

Exploring the COVID-19 network of scientific research with SciSight | AI2 Blog

“Three months into the coronavirus pandemic, the world’s scientific knowledge of the SARS-CoV-2 virus is rapidly expanding. Reports of potential vaccines and treatments sprout up almost daily. Thousands of papers have been pouring into Semantic Scholar’s COVID-19 Open Research Dataset (CORD-19), a collection of nearly 60,000 scientific publications of potential relevance to the topic, both historical and cutting-edge….

To help accelerate scientific discovery with visualization, last month we launched SciSight, a framework of exploratory search and visualization tools for the COVID-19 literature. The first version of SciSight supported exploring associations between biomedical concepts appearing in the literature. In preliminary user interviews, the tool was found helpful in discovery-oriented search. We now release two important updates of SciSight….”

The rise of the “open” discovery indexes? Lens.org, Semantic Scholar and Scinapse | Musings about librarianship oa.scite

“In this blog post, I will talk specifically on a very important source of data used by Academic Search engines – Microsoft Academic Graph (MAG) and do a brief review of four academic search engines – Microsoft Academic, Lens.org, Semantic Scholar and Scinapse ,which uses MAG among other sources….

We live in a time, where large (>50 million) Scholarly discovery indexes are no longer as hard to create as in the past, thanks to the availability of freely available Scholarly article index data like Crossref and MAG.”

SUPP.AI by AI2

“Dietary and herbal supplements are popular but unregulated. Supplements can interact or interfere with the action of prescription or over-the-counter medications. Currently, it is difficult to find accurate and timely scientific evidence for these interactions.

To solve this problem, Supp.AI automatically extracts evidence of supplement and drug interactions from the scientific literature and presents them here….

To find out more about this work, please read our publication….

Supp.AI is a free service of the non-profit Allen Institute for AI….”

Could This Search Engine Save Your Life? – The Chronicle of Higher Education

One of the Allen Institute’s priorities is an academically oriented search engine, established in 2015, called Semantic Scholar (slogan: “Cut through the clutter”). The need is great, with more than 34,000 peer-reviewed journals publishing 2.5 million articles a year. “What if a cure for an intractable cancer is hidden within the tedious reports on thousands of clinical studies?,” Etzioni once said.

Although Semantic Scholar has focused so far on computer and biomedical sciences, Etzioni says that the engine will soon push into the social sciences and the humanities as well. The Chronicle spoke with him about information overload, impact factors’ imperfect inevitability, and the promise and perils of AI….”