“Stanford Libraries seeks a full-time Director for the Office of Scholarly Communications, a unit that will support the Open Access Policy passed by Stanford University’s Faculty Senate in November 2020. With the passage of the OA Policy, the entire focus of which is academic articles, the University is poised to quickly move forward in support of open scholarship, increased access to scholarly information, and transforming the academic article publishing landscape. …”
“A woman who has in the past been described as “the spiritual successor to Aaron Swartz” – who was a US web pioneer hounded to suicide by US prosecutors for making academic research available to everyone – has now learned the FBI is investigating her….
Elbakyan included a screenshot of the conveniently “no-reply” email in her tweet, where Apple informed her that it in February 2019 received a request from the FBI for data pertaining to her account, and that the nature of the request was such that it only allowed the tech giant to notify the user with delay.
Apple also told Elbakyan that the requested data had been handed over, and washed its hands off the whole thing by advising the programmer that if she wanted to know more about the request and what kind of information the FBI wanted – she should talk to the FBI….”
“The Librarian for Scholarly Communications supports Open research, communication, and data on the Vanderbilt University campus through education, training, and analysis. As a member of the Digital Scholarship and Communications (DiSC) Office, the Librarian for Scholarly Communications consults with faculty, students, and staff about emergent scholarly communications models, analyzes faculty publishing patterns and impact metrics, and evaluates and advises on research communication tools. The Librarian for Scholarly Communications also manages the institutional repository, coordinating the ingestion workflow in dialogue with liaison librarians in the divisional libraries, and with LTDS in software development….”
Abstract: DataONE, funded from 2009-2019 by the U.S. National Science Foundation, is an early example of a large-scale project that built both a cyberinfrastructure and culture of data discovery, sharing, and reuse. DataONE used a Working Group model, where a diverse group of participants collaborated on targeted research and development activities to achieve broader project goals. This article summarizes the work carried out by two of DataONE’s working groups: Usability & Assessment (2009-2019) and Sociocultural Issues (2009-2014). The activities of these working groups provide a unique longitudinal look at how scientists, librarians, and other key stakeholders engaged in convergence research to identify and analyze practices around research data management through the development of boundary objects, an iterative assessment program, and reflection. Members of the working groups disseminated their findings widely in papers, presentations, and datasets, reaching international audiences through publications in 25 different journals and presentations to over 5,000 people at interdisciplinary venues. The working groups helped inform the DataONE cyberinfrastructure and influenced the evolving data management landscape. By studying working groups over time, the paper also presents lessons learned about the working group model for global large-scale projects that bring together participants from multiple disciplines and communities in convergence research.
“Ten months into the NIH Preprint Pilot, more than 2,100 preprints reporting NIH-supported research on COVID-19 are now discoverable in PubMed Central (PMC) and PubMed. Through early April 2021, these records have been viewed more than 1 million times in each of these databases (1.4 million in PMC; 1 million in PubMed). Of the preprints included in the pilot, ~60% are currently discoverable only as a preprint version, having not yet been linked to a published article. All articles are clearly identified as preprints. Preprints may be selected or excluded in searches by using the preprint filter.
The pilot launched in June 2020 with preprint records from medRxiv, bioRxiv, arXiv, ChemRxiv, Research Square, and SSRN. Phase 1 has focused on improving the discoverability of preprints relating to the ongoing public health emergency and accelerating dissemination of NIH-supported research on the SARS-CoV-2 virus and COVID-19. This narrowly scoped first phase has allowed the National Library of Medicine (NLM) to streamline curation and ingest workflows for NIH-supported preprints and refine the details of implementation with a set of articles for which there has been high demand for accelerated access and discovery. Since launching the pilot, NLM has made display of preprint records in PubMed search results more transparent. We have also automated checks for new preprint versions and preprint withdrawals, and reduced the steps required to report preprints as products of awards in My Bibliography….”
“genomeRxiv is a newly-funded US-UK collaboration to provide a public, web-accessible database of public genome sequences, accurately catalogued and classified by whole-genome similarity independent of their taxonomic affiliation. Our goal is to supply the basic and applied research community with rapid, precise and accurate identification of unknown isolates based on genome sequence alone, and with molecular tools for environmental analysis….”
“For decades, a special court—the Foreign Intelligence Surveillance Court, or “FISC”—has issued secret legal opinions authorizing the U.S. government to conduct sweeping programs of electronic surveillance. These opinions have had a profound impact on Americans’ rights to privacy, free expression, and free association. But many of them are entirely hidden from public view….”
“The archive’s catalog currently holds more than 120 million digital records, as well as “archival metadata and other types of records, including electronic databases.” However, the system has “an unsophisticated search” function, according to a request for information.
While NARA employees add metadata tags to digital records, “There is a delta between what NARA has been able to describe and the specific information that users want from our records,” the RFI states, asking, “Can AI fill the gap?”
During an informational day held in early April, NARA executives outlined some of the challenge, including a single search returning a flood of results from the same source—making it difficult to sift through to find multiple sources—and difficulty distinguishing between records with similar names, such as a search for “Truman” the president versus “Truman” the aircraft carrier.
The current search function also is not able to return accurate results if the search term input is not exactly the same as it exists in the metadata.
The RFI is seeking feedback on automated solutions that can analyze how users search the digital archives and associate those search terms with the appropriate record….”
“The [National Archives] Catalog currently has a large data set (over 100 million digital pages of records, plus archival metadata and other types of records, including electronic databases) and an unsophisticated search. The archival hierarchy of the records is intended to assist the user in discovery, but in the digital realm, users find it difficult to use. The metadata that we have entered manually cannot provide the granular information for users to get the search results they want and it has taken NARA decades to produce. There is a delta between what NARA has been able to describe, and the specific information that users want from our records. Can AI fill the gap?…”
“Data Management Plans
ARL is heartened to see Congress acknowledge the necessity of machine-readable data management plans (DMPs) and open repositories in supporting the academic research enterprise. At a National Science Foundation–funded conference on effective data practices in December 2019, ARL, along with the Association of American Universities, the Association of Public and Land-grant Universities, and the California Digital Library, convened stakeholders including university research officers, scientists, and librarians. Conference participants agreed that data management planning is important for sharing and use of research data and outputs. Participants suggested that the ability to update plans (“just in time”) across the project life cycle and as part of progress reporting would accelerate the value and adoption of DMPs among researchers, beyond what is required for compliance.
ARL encourages the development of a collaborative set of data repository criteria. Coordination among federal agencies will be necessary, as will stakeholder input from researchers, repository managers, librarians, and others. ARL looks forward to continuing these conversations and building upon work already underway within groups such as the Confederation of Open Access Repositories, the Research Data Alliance, and the World Data System….”