“As someone who’s uploaded hundreds of thousands of items to the Internet Archive’s stacks and who has probably done a few million transactions with the materials over the years, I just “know” about the Internet Archive python client, and if you’re someone who wants to interact with the site as a power user (or were looking for an excuse to), it’ll help you to know about it too.
You might even be the kind of power user who is elbowing me out of the way saying “show me the code and show me the documentation”. Well, the documentation is here and the code is here. Have a great time….”
“This zip file contains JATS-standard XMLcontent of every PLOS article, including all Articles and Front Matter. It does not include Figures or Supplemental Data. It’s just under five GB in size, and is updated every day with new articles. We also make our articles available through PubMed Central and our API….”
” The Association of American Publishers has reviewed the 2018 documents authored by David R. Hansen and Kyle K. Courtney on the subject of unauthorized library copying for the purpose of digital transmission of entire books to the public. These documents (collectively the “White Paper”) argue that libraries engaging in this activity do not infringe copyright in literary works because such copying and transmission falls within the fair use and first sale doctrines under the invented theory and White Paper definition of “controlled digital lending” (“CDL”).
AAP strongly disagrees with the analysis of the White Paper and its call to libraries to copy and transmit copies of entire books to the public in disregard of the law. CDL not only rationalizes what would amount to systematic infringement, it denigrates the incentives that copyright law provides to authors and publishers to document, write, invest in, and disseminate literary works for the benefit of the public ecosystem….”
“We’re very pleased to announce the release of two documents that we believe have the potential to help greatly expand digital access to print library collections by helping libraries do online what we have always done in print: lend books.
Both documents are aimed at addressing the legal and policy rationales for what we term “controlled digital lending” — a method by which libraries loan digitized print books to digital patrons in a “lend like print” fashion similar to how non-digital patrons check out books in-person. Through CDL, libraries use technical controls to ensure a consistent “owned-to-loaned” ratio, meaning the library circulates the exact number of copies of a specific title it owns, regardless of format, putting controls in place to prevent users from redistributing or copying the digitized version….”
” “Controlled Digital Lending” or “CDL” is a recently invented legal theory that allows libraries to justify the scanning (or obtaining of scans) of print books and e-lending those digital copies to users without obtaining authorization from the copyright owners. A position statement on CDL, along with an accompanying white paper, was issued this past October by legal scholars, the culmination of several academic meetings on the subject. The statement and paper argue that it is fair use for libraries to scan or obtain scans of physical books that they own and loan those books through e-lending technologies, provided they apply certain restrictions akin to physical library loans, such as lending only one copy (either the digital copy or the physical copy) at a time and only for a defined loan period.
A couple dozen organizations, including Internet Archive, as well as a number of academics and academic librarians, are listed as signing the position statement. Several major library systems, including the California State University libraries and the Boston and San Francisco public libraries, are signatories and apparently already rely on CDL to e-lend scanned copies of books….”
“The UC Berkeley Library took an important step forward today in improving widespread digital access to its vast collections. Along with the UC Davis and UCSF libraries and the California Digital Library, the Library became an early signatory to a newly released position statement supporting rights for libraries to digitize in-copyright works in their collections, then lend them according to the same lending terms as the original print copies. The position statement, developed by copyright scholars from multiple institutions, as well as policy counsel for the Internet Archive, is accompanied by a white paper that outlines legal rationale for how controlled digital lending can be implemented to enable electronic access to certain library collections….”
“The recent nomination and appointment of Brett Kavanaugh to the U.S. Supreme Court offered a timely opportunity to demonstrate how controlled digital lending can be used by libraries to circulate digital copies of books that are out of print or not widely held. The basic premise of controlled digital lending is “own one, loan one”—rather than loaning a physical book in their collection, libraries can choose instead to loan a scanned version of that book to one user at a time, while the physical book remains on the shelf.
A key player during the confirmation hearing was Mark Judge, a friend of Kavanaugh’s who wrote the book Wasted: Tales of a GenX Drunk, describing his raucous, alcohol-fueled high school years. Judge’s memoir was published in 1997 by Hazelden Publishing, the publishing arm of the Hazelden Betty Ford Foundation, which runs the recovery centers where Judge was treated for addiction. The book had a limited print run and subsequent shelf life—it was not widely held by libraries outside of those focusing on addiction and recovery. Interest skyrocketed once Judge’s book entered the public consciousness, but because the book was no longer being sold by the publisher and used copies were scarce, when available at all, its price on Amazon.com topped out just under $2,000.
Boston Public Library (BPL), a long-time scanning partner with Internet Archive, located a copy of Wasted in their research stacks. Those books are only available for use within the library, so the book was never going to circulate. Tom Blake, Manager of Content Discovery at BPL, sent the book down to be scanned by Internet Archive book scanners in their in-house digitization center. Internet Archive staff digitized the book using the same procedures and equipment that have been used to digitize more than 55,000 books from BPL’s collection since the partnership with Internet Archive began in 2007. Using existing workflows and post-production processes, the physical book was scanned and turned into a digital book complete with page images, OCR text, and mobile-friendly formats before being placed online at https://archive.org/details/wastedtalesofgen00judg….”
“It’s been a few years since we last had to write about the Authors Guild — a group that ostensibly represents authors’ interests, but really acts more like a front group forpublishers’interests (often in opposition to the actual interests of authors). As you may recall, the Authors Guild spent tons of the money authors gave it for dues on suing libraries. Specifically it sued and lost againstHathitrust (a collection of libraries which were scanning books to make a searchable index), and then had the same result with Googleand its book scanning project. In both cases, the courts deemed such scanning and indexing as fair use — a transformative use of the work.
Apparently, unable to comprehend that maybe it shouldn’t attack libraries, the Authors Guild is at it again, threatening the Internet Archive and other libraries for daring to start a carefully designed program to lend out copies of some of their scanned works. The system, called Controlled Digital Lending was put together by a bunch of libraries and the Internet Archive to lay out a system that they believe is clearly covered by fair use, by which digital scans of certain books could be made available on loan like any other library book. The whole setup of the Controlled Digital Lending system is carefully laid out and designed to mimic traditional library lending….”
“Co-hosted by theInternet Archive andCreative Commons, this celebration will feature a keynote addresses by Lawrence Lessig and Cory Doctorow, lightning talks, demos, multimedia displays and more to mark the “re-opening” of the public domain in the United States. The event will take place at the Internet Archive in San Francisco….”