Google Books
US Copyright office no fan of google book settlement
Submitted by jrjacobs on Thu, 2009-09-10 11:19.Building on our previous post about today's House hearing on digital books, it appears that Marybeth Peters, head of the US Copyright Office, is not supportive of the google book settlement. In written testimony (PDF) before the House Judiciary Committee, she wrote that the settlement...
“...inappropriately creates something similar to a compulsory license for works, unfairly alters the property interests of millions of rights-holders of out-of-print works without any Congressional oversight and has the capacity to create diplomatic stress for the United States.”
For more, see today's Wallstreet Journal blog: "Copyright Office No Fan of Google Books Settlement."
- jrjacobs's blog
- Add new comment
- 771 reads
September 10: House Hearing on Digital Books
Submitted by PGarvin on Wed, 2009-09-09 15:32.The House Judiciary Committee will hold a hearing on "Competition and Commerce in Digital Books" at 10 a.m. tomorrow, September 10. The hearing will be webcast; the link is on the committee's hearings calendar page.
ALA has some information available on its site. See:
Library Associations submit testimony... and the ALA Google Book Settlement page.
- PGarvin's blog
- Add new comment
- 892 reads
Tell Google Book Search to protect reader privacy
Submitted by jrjacobs on Sat, 2009-07-25 00:50.The Electronic Frontier Foundation (EFF), those defenders of online free speech, privacy, innovation, and consumer rights, have begun an action to tell Google to protect reader privacy. Please sign the petition and send a clear message to Google CEO Eric Schmidt to protect reader privacy.
You shouldn't be forced to pay for digital books with your privacy. Tell Google it needs to develop a robust privacy policy that gives you at least as much privacy in books online as you have in your neighborhood library or bookstore. Google must:
- Protect your reading records from government and third party fishing expeditions by responding only to properly-issued warrants and court orders, and by letting you know if someone has demanded access to information Google has collected about you.
- Make sure that you can still browse and read anonymously by not forcing you to register or give personal information and by deleting any logging information for all services after a maximum of 30 days.
- Separate data related to Google Book Search from any other information the company collects about you, unless you give it express permission.
- Give you the ability to edit and delete any information collected about you, transfer books from one account to another without tracking, and hide your "bookshelves" or other reading lists from others with access to your computer.
- Keep Google Book Search information private from third parties like credit card processors, book publishers, and advertisers.
And since Google is clearly angling itself as a "library" -- even publishing a "Google librarian newsletter"! -- I would ask all who submit an EFF petition to include the American Library Association's (ALA) Library Bill of Rights and a link to the ALA Intellectual Freedom Manual, which states:
Privacy is essential to the exercise of free speech, free thought, and free association. The courts have established a First Amendment right to receive information in a publicly funded library. Further, the courts have upheld the right to privacy based on the Bill of Rights of the U.S. Constitution. Many states provide guarantees of privacy in their constitutions and statute law. Numerous decisions in case law have defined and extended rights to privacy.
In a library (physical or virtual), the right to privacy is the right to open inquiry without having the subject of one’s interest examined or scrutinized by others. Confidentiality exists when a library is in possession of personally identifiable information about users and keeps that information private on their behalf.
Protecting user privacy and confidentiality has long been an integral part of the mission of libraries. The ALA has affirmed a right to privacy since 1939. Existing ALA policies affirm that confidentiality is crucial to freedom of inquiry. Rights to privacy and confidentiality also are implicit in the Library Bill of Rights’ guarantee of free access to library resources for all users.
--Privacy: an interpretation of the Library Bill of Rights
- jrjacobs's blog
- Add new comment
- 1033 reads
Brewster Kahle on Google Book Digitization and the Future of Libraries
Submitted by jajacobs on Thu, 2009-04-30 16:16.Of all the things I have read about the Google book digitization project and its consequences, this is one of the best. Listen to the interview (Lunchtime Listen!) or read the transcript.
- Google Faces Antitrust Investigation for Agreement to Digitize Millions of Books Online, interview with Brewster Kahle, Democracy Now!, April 30, 2009.
This is relevant to government documents since so many are in the project. The way they are treated and controlled by Google and Google's contracts and licenses and agreements will have lasting impact on long-term, free, public access.
Kahle highlights two things that, for me, are very important. First, at least some of the participating libraries are relying solely on Google and its restrictions and are not even getting digital copies from Google although they could.
BREWSTER KAHLE: Let's take the out of copyright, the stuff that's really--it's public domain, meaning belongs to the public. It's lived long enough to become part of the public sphere. But there are perpetual restrictions that the libraries must perform, that if they get these digital copies back, they must put up restrictions on use, such that they cannot be accessible by the general public.
AMY GOODMAN: Who can they be accessed by?
BREWSTER KAHLE: People on campus can use them, for the out-of-copyright works, but just on campus. And otherwise, they have to put up restrictions. And what's turning out is a lot of these libraries aren't even bothering to get copies back, because what can they use them for? I mean, in the future, people are going to want to have access to as many books as possible. And what Google is doing is pulling these together for many libraries to build a great collection. Terrific. But the bits and pieces that are going back to these libraries don't make up a great collection. And what they can do with them is very, very limited. So these libraries aren't, in many cases, even bothering to get the digital copies back.
Second, when Kahle asked if Google would share copies of digitized books with the Internet Archive, Google refused.
AMY GOODMAN: Conceivably, Google could give you the digitized copies, is that right?
BREWSTER KAHLE: Yes, Google could, but they have refused.
AMY GOODMAN: Why?
BREWSTER KAHLE: They say that they've paid for the work. They want to be the place that people go to get them. So they are going to be the proprietors of the public domain.
Although Google claims its mission is "to organize the world's information and make it universally accessible and useful," it would be more accurate to say its mission is to make money controlling the world's information.
- jajacobs's blog
- 1 comment
- 1385 reads
National Academies Reports (1863 to 1997) Now Available in Open Access
Submitted by jajacobs on Fri, 2009-04-10 18:49.The National Academies (The National Academy of Sciences, National Academy of Engineering, Institute of Medicine, and National Research Council) have a long history of advising the government. Now, they have announced "the completion of the first phase of a partnership with Google to digitize the library's collection of reports from 1863 to 1997, making them available – free, searchable, and in full text – through Google Book Search. The Academies plan to have their entire collection of nearly 11,000 reports digitized by 2011."
- More Than 9,000 National Academies Reports Now Available in Open Access, press release, April 10, 2009.
Some publications of the Academies are already available through Google Book Search, but not full text. (See for example: Realizing the information future By National Research Council). The announcement does not make clear whether some of these will become available full text or not.
- jajacobs's blog
- Add new comment
- 749 reads
Why is the 1957 Census of Govts deemed under copyright?
Submitted by jrjacobs on Wed, 2009-02-25 17:58.I just talked with a researcher who was interested in getting his hands on a digital copy of the 1957 Census of Governments. My momentary joy at finding a copy at the University of Michigan (my go-to library to find digital govt documents!) quickly turned to disappointment on seeing the message:
Page images and full text of this item are not available due to copyright restrictions.
There ought to be a way for people/librarians to check the document for copyrighted bits and then quickly flip a switch to release it into the public domain and make it accessible to everyone. Is that too much to ask? Over time, we could lessen the impact that Google's scorched earth copyright policy has on documents that should rightfully be in the public domain. And another thing, why didn't they scan statistical resources to .csv files?!
That is all.
- jrjacobs's blog
- 2 comments
- 1791 reads
Google Books/Fed Docs: Google Books Statistics--The Bigger Picture
Submitted by jtryon on Fri, 2007-08-17 08:10.Now that I had some statistics it dawned on me I had no idea whether or not this was a lot documents. So I was off to the FDLP desktop and the Catalog of U.S. Government Publications.
I looked around the desktop to see if GPO listed any statistics. On the "about" page for the CGP, GPO says merely that there are more than 500,000 records in the database. So I gave some thought to how I might get a better figure, and off I went to OCLC and the GPO database in FirstSearch. On the database info page, OCLC lists 507,000+ as the number of records and that the database had its last monthly update on August 8, 2007.
So I went back to the CGP and its advance search page. Searching for GPO in the publisher field is not terribly effective. Of course, in this database everything is a government document so that is not a problem.
But how to get a real number out of the database? I tried using the most common of words--a and the-- but to no great effect. A brings up 359,875 records and the brings up 411,493. Neither result comes close enough to the supposed 507,000.
I had another realization that the CGP now includes records for electronic titles--titles that would not be fodder for the Google Book Project. Using the New Electronic Titles page is not really an option to count them as it only goes back to April of 2005 and since early 2006 the monthly lists are not numbered (leaving me to do a lot of counting).
So back to the advanced searching page in the CGP. Happily here you can search for terms in the URL/PURL. I proceeded to search for every record that listed .gov, .mil, .us, .org, and .com. I came up with a total of 64,504 records. So approximately 13% of the records in the CGP are electronic titles or are titles with an electronic counterpart.
Unfortunately I had another realization that these figures really only represent documents published from 1976 on. This is a really big problem in that most of the documents I found in Google Books dated to before 1923. My only hope to get good numbers was to askGPO. So late on August 8th I shot off a query to GPO asking for statistics on the number of documents GPO has distributed both before and after 1976.
Surprisingly enough, GPO called me first thing the next morning. askGPO is notoriously slow in providing answers to queries so I was very surprised! I spoke with Nancy Faget at GPO and she was very pleasant though not exactly forthcoming with numbers. It struck me that I got the quick call back as GPO viewed my query as the first step in getting out of the program. As far as I know my director has no real intentions of doing that, but I don't think I convinced her on that point. But aside from that she told me that GPO really didn't know how many documents went to depositories since the beginning. Alas!!
I honestly don't know if would be fair to take the view that probably as much was distributed from 1813 to 1976 as was published after 1976. But if you did, that would lead to believe that over one million documents have been produced.
So the bigger picture suggests that the 167,878 titles in Google Books is only about 17% of all the documents that could be digitized. At a guess...
So I put a call out to everyone in GovDoc Land. If you are a full depository and have been one since 1813 and have kept really good records, could you please send me the statistics? Thank you very kindly in advance!
- jtryon's blog
- Add new comment
- 1594 reads
Google Books/Fed Docs: Google Statistics from Scratch
Submitted by jtryon on Fri, 2007-08-17 06:45.Having found no published statistics for numbers of digitized books in Google Books, and especially nothing about digitized government publications, I was left with coming up with them on my own.
So I went to the Advanced Book Search screen for Google Books. Looking at the search options provided there I decided that the only way I could get reasonably useful statistics was to search for books published by GPO. As you are all aware not all government documents are actually published by GPO. Many are merely distributed by them. So I knew that my numbers would not be exact. Another problem was that over the years GPO listed themselves as publishers using a variety of abbreviations and phrases.
My first try was to use GPO in publisher and on August 8th I retrieved 141,600 hits. However just now when I ran it again, I only got 117,600. Hmmm.
Next search was for Government Printing Office, which retrieved both today and on the 8th, 43,600 hits. This was followed by gov't, which on the 8th retrieved 2,322 titles but today only retrieved 2,258.
The grand total for using these three searches on August 8th was 187,522.
Today as I was double checking my results, I also tried gov. print. off. and got 4,420 hits. So as of this morning the grand total is 167,878. I find it rather disconcerting that the number as dropped so much in nine days!
- jtryon's blog
- Add new comment
- 1603 reads
Google Books/Fed Docs: Google Books and Documents Coverage--From the Beginning
Submitted by jtryon on Fri, 2007-08-17 06:11.I have a hard time trying to figure out where to begin this blog, so I have decided to start at the beginning even though I have written a bit about this in a message posted to GOVDOC-L on August 8th. So here goes...
I was asked to find out how much government information is available in the various Googles. Over the past few months I had saved posts from GOVDOC-L that had Google in the subject line; so I thought this would be an easy assignment. Turns out that the messages did not give statistics, instead they were questions about Google's practice of making the full text of all books published after 1923 unavailable.
Well I was a bit disappointed but I still thought that I would find the information on Google's website. I figured that Google would be tooting its own horn about the growth of this infamous project. Not so. There are no statistics anywhere, and there was very little that described the scope of the project.
Next I went into research mode. I checked for articles in EBSCO's Academic Search Premier and Lexis-Nexis. I found some interesting news articles on the project but again no statistics. I then tried to search the web pages of a few library partners I looked at the University I had a little luck on Stanford's web site on Robotic Book Scanning. There was a page a few statistics listed there but alas they dated from June2004.
I even Googled such keywords that I hoped would bring up statistics. But considering how many different way one might refer to statistical information, it was frustrating to do. I didn't find any statistics this way but I did find some intersting Blog entries about the full-text copyright issue.
So I was on my own.
- jtryon's blog
- Add new comment
- 1166 reads
Google Books/Fed Docs: More on Julia and her project
Submitted by dcornwall on Thu, 2007-08-16 06:28.Julia Tryon was kind enough to send us a fuller description of her Google Books research plan and a biography, which we are sharing with you today:
What this project is about:
At the request of my director, I am researching how much of government information is available online and in particular how much is available in Google Books. I have begun by checking out Google's info pages on Google Books and by looking at some of Google's partner's websites such as University of Michigan and Stanford University. I am also looking at Google Books' functionality and ease of use, which plan to compare with other full-text versions. I have no idea where all this will take me!
Biography:
I have been the Government Documents Librarian at the Phillips Memorial Library at Providence College since 2000. I have been a reference librarian here for 19 years. I am currently the chair of the CRIARL (Consortium of Rhode Island Academic and Research Libraries) Government Documents Committee and the treasurer for GPLNE (Government Publications Librarians of New England).On a personal note, I live in the quaint village of Ashaway in Rhode Island, I am a rabid reader of tacky novels, and I am being held hostage by four cats and a Victorian house.
-----------------------------
We and I hope you are eagerly awaiting to her more dispatches from Julia's explorations!
- dcornwall's blog
- Add new comment
- 1011 reads


Recent comments
2 days 4 hours ago
2 days 12 hours ago
3 days 23 hours ago
4 days 2 hours ago
4 days 23 hours ago
1 week 6 days ago
3 weeks 2 days ago
3 weeks 2 days ago
3 weeks 3 days ago
4 weeks 4 days ago