“The National Archives of Japan preserves government documents and records of importance as historical materials received by the Prime Minister from various government ministries and agencies, and makes them available to the public with the aim of achieving appropriate preservation and use of such government documents and records that are in the custody of the National Archives or government organs….”
“The Data Preservation Alliance for the Social Sciences (Data-PASS) is a voluntary partnership of organizations created to archive, catalog and preserve data used for social science research. Examples of social science data include: opinion polls; voting records; surveys on family growth and income; social network data; government statistics and indices; and GIS data measuring human activity….”
“I am writing to you about digital preservation, but this is a scholarly communication blog. So, let’s delve into what preservation has to do with open access. SPARC (the Scholarly Publishing and Academic Resources Coalition) defines open access as ‘the free, immediate, online availability of research articles, coupled with the rights to uses these articles fully in the digital environment.‘ The simple answer to what digital preservation has to do with access is that we are not only advocating for open access in the here and now but also for continued access in years to come.
In the traditional sense of open access, I will encourage you to pay attention to what open access publishers say about what they intend to do with the work that you submit to them. LOCKSS (Lots of Copies Keep Stuff Safe) and CLOCKSS (Controlled LOCKSS) are examples of programs designed to provide publishers with digital preservation tools and networks to ensure the safety of their content. If you are submitting your articles to a publisher who is openly involved with LOCKSS or CLOCKSS, then you can be reasonably assured that they have your best preservation interests at heart. But they’re not the only tools available to publishers, so be a good investigator when you explore your publication possibilities.
This introduction is the first part of two posts about digital preservation and access. Look out for the next post with four simple rules for incorporating digital preservation into your personal research routine.”
“Since 2010, Cornell’s sustainability planning initiative has aimed to reduce arXiv’s financial burden and dependence on a single institution, instead creating a broad-based, community-supported resource. arXiv’s funding and governance for the current operation (Classic arXiv) is based on a membership program engaging libraries and research laboratories worldwide that represent the repository’s heaviest institutional users. As of February 2017, we have 206 members representing 25 countries. arXiv’s sustainability plan is founded on and presents a business model for generating revenues and a set of governance, editorial, and financial principles. Cornell University Library (CUL), the Simons Foundation, and a global collective of institutional members support arXiv financially. The financial model for 2013–2017 entails three sources of revenues:
CUL provides a cash subsidy of $75,000 per year in support of arXiv’s operational costs. In addition, CUL makes an in-kind contribution of all indirect costs, which currently represents 37% of total operating expenses.
The Simons Foundation contributes $100,000 per year ($50,000 prior to 2016) in recognition of CUL’s stewardship of arXiv. In addition, the Foundation matches $300,000 per year of the funds generated through arXiv membership fees.
Each member institution pledges a five-year funding commitment to support arXiv. Based on institutional usage ranking, the annual fees are set in four tiers from $1,500 to $3,000.
In 2016, Cornell raised approximately $515,000 through membership fees from 201 institutions and the total revenue (including CUL, Simons Foundation direct contributions, and online fundraising) is around $1,015,000. We remain grateful for the support from the Simons Foundation that encouraged long-term community support by lowering arXiv membership fees and making participation affordable to a broad range of institutions. This model aims to ensure that the ultimate responsibility for sustaining arXiv remains with the research communities and institutions that benefit from the service most directly.”
“While it is illegal to destroy government data, removing data from accessible agency websites can effectively impede accessibility. Revising websites or creating other barriers to the underlying information can make it very difficult to find vital information. Also, much of the scientific information painstakingly collected over past decades, and costing hundreds of billions of dollars, remains held only by the government, and it is distributed through thousands of servers in hundreds of federal departments where it might not be backed up, making it difficult or impossible to find. Once information becomes sequestered, it becomes nearly impossible to know what has been lost if one doesn’t know what originally was there.
Thus, there is growing anxiety developing among many scientists who rely on the vast cache of data housed on government servers that key data may become sequestered or unavailable for public access. Many researchers further fear a crusade by the Trump administration against the scientific information provided to the public; the National Centers for Environmental Information may be one federal agency especially vulnerable to having vital information sequestered or removed from ready access. The proposed deep budget cuts for several government agencies have added to the fears of important databases being selectively reduced or removed….”
“The unfettered exchange and careful preservation of information are fundamental to democracy, progress, and intellectual freedom. The critical research and scholarship conducted by government entities and academic institutions worldwide safeguard and support human rights, public health, the environment, artistic and literary enterprise, scientific and technological innovation, and much more. This scholarship is critical for informed discourse and policy development throughout society. As such, the fruits of governmental and scholarly research—the data and documentation generated and released—must remain publicly available and must not be suppressed, endangered, or altered to serve political ends….OSC and the UC Libraries are working to protect public access to government data and research in the event that the original sources for these materials should be compromised. In the coming weeks, OSC and librarians on each of the UC campuses will identify specific actions to be taken to ensure that research data, publications, and scholarship remain accessible and discoverable. These efforts are not intended to supplant the authoritative sources for government data, publications, and information. Rather, we are working to make certain that these materials remain shielded from inappropriate political influence or suppression. We support similar information rescue and preservation efforts taking place around the country and encourage other institutions to join in this commitment. …”
“This paper explains the digital preservation challenge, examines various technical, legal, commercial, and governance elements, and recommends concrete proposals in each area. The research is based on a wide-ranging review of pertinent books, reports, studies, and articles familiar to experts in the digital preservation community….[W]hile open access publishing can lower the entry barriers for smaller publications to publish high-quality research, these same works face a greater risk of not being preserved than larger, well-established proprietary journals….”
“Scientists, librarians, and digital historians from a growing number of universities have begun a crowdsourced effort to copy and archive thousands of federal-government websites and data sets related to climate change, the environment, and other areas of scientific research that they fear could become compromised or inaccessible under the incoming Trump administration.
The movement, now known by its hashtag, #DataRefuge, took off in response to an essay in The Washington Post in which a meteorologist and journalist named Eric Holthaus described how he had begun “to systematically catalog and preserve as much of the federal government’s publicly available climate-science data as possible in the next five weeks.” …”
“Since the launch of ETDs @ Harvard in 2014, nineteen Schools, Departments, and Programs have adopted it to manage their student theses and dissertations….
ETDs @ Harvard streamlines the submission process for students, faculty, and administrators, and pipes student data and files to several downstream systems. Once a student submits her work, and it is approved, ETDs @ Harvard sends it to DASH, the Harvard open-access repository, HOLLIS+, the Harvard Library catalog, DRS, the Harvard Library digital preservation system, and the printer, for producing bound copies for the Harvard University Archives or Countway Library….
[W]e’ve deposited 2,853 ETDs [since 2011], which have been downloaded 624,594 times, an average of 219 downloads each….“