Submitted by jrjacobs on Sat, 2013-04-13 12:41.
I had a good time yesterday on a panel about Web archiving and digital preservation at the Society of California Archivists General meeting 2013 (slides to be posted there soon). The panel was organized by Scott Reed at the Internet Archive, and included Scott, Claude Zachary (University of Southern California), myself and my Stanford colleague Henry Lowood.
One of the coolest things -- other than the fascinating keynote by Dr. Michael Cohen, who talked about "Culture Wars: Engaging Undergraduates in Documenting the Crisis in California Through the Historian's Eye Project" -- was learning about the site archiveready.com. This is a handy little tool to test your Website's archivability. Paste in your url, and it goes through and checks things like standards compliance, accessibility, CSS, site maps, external media and proprietary objects like flash or quicktime, and lastly whether or not your site is already being collected by the Internet Archive's Wayback machine. Freegovinfo did pretty well in the test with an overall rating of 78%. We lost points for having some external images and external scripts (google analytics and a facebook badge), but I don't consider those things critical to the site for the long-term. How does your site do? Are you ready to be archived?!
Submitted by jajacobs on Wed, 2013-04-10 12:11.
If you provide public service for legal or government information, you have probably come across "private laws" and may have wondered what they are. These are not secret laws (which are laws that the public cannot even see!). These are private laws, which means that they usually deal with immigration issues or claims against the government. You might find this CRS report of interest:
Hat tip to Steven Aftergood!
Submitted by jajacobs on Wed, 2013-04-10 08:57.
Some libraries, library organizations, and library managements believe they can "manage" their collections better by first digitizing historic collections of books and other paper and ink information sources and then weeding their collections of these materials. Such projects will reduce the number of copies held in the aggregate by all libraries (Lavoie, Schonfeld, Schottlaender, Yano). One problem that these projects often overlook is the subtle (and not so subtle) differences between the legal standing of paper and digital objects with regard to access and use. Too often, creators of digital objects attempt to impose copyright restrictions on the digital objects even if the originals were in the public domain. Additionally, digital objects are often encumbered with licenses and technological restrictions that limit how they can be used and who can use them. The digital objects are often just not as accessible or as usable as the original print. How bad would it be if we threw away our print collections in favor of digital collections that are less accessible and less usable?
Randal C. Picker, who is Leffmann Professor of Commercial Law and Senior Fellow at the The Computation Institute of the University of Chicago and Argonne National Laboratory University of Chicago Law School, has written a paper and created a presentation on just this issue.
In the paper, he considers how legal issues affect digitization projects such as The Internet Archive, JSTOR, Google Book Search, HathiTrust, and THOMAS.
His take-aways from the presentation are:
- Access rights and use rights are different animals and operate in different legal settings.
- Even though the public domain is coming online, the financing models for the projects will result in efforts to restrict use ina variety of ways.
- Those efforts will be situated at the intersection of technology controls, contract, terms of use, copyright and the CFAA (Computer Fraud and Abuse Act).
- Perhaps a truly public public domain, something like the DPLA perhaps, is required to avoid the path of non-copyright control over the public domain.
Hat Tip: ARL Policy Notes.
Endnotes
Lavoie, Brian F., Constance Malpas, and J.D. Shipengrover. 2012. Print Management at “Mega-scale”: a Regional Perspective on Print Book Collections in North America. Dublin, OH: OCLC Research. http://www.oclc.org/research/publications/library/2012/2012-05.pdf (Accessed July 19, 2012).
Schonfeld, Roger C., and Ross Housewright. 2009. 28 What to Withdraw: Print Collections Management in the Wake of Digitization. Ithaka S+R. http://www.sr.ithaka.org/research-publications/what-withdraw-print-colle....
Schottlaender, Brian E.C. et al. 2004. 82 Collection Management Strategies In A Digital Environment, A Project Of The Collection Management Initiative Of The University Of California Libraries, Final Report to the Andrew W. Mellon Foundation. University of California, Office of the President, Office of Systemwide Library Planning. http://www.ucop.edu/cmi/finalreport/index.html.
Yano, Candace Arai, Z.J. Max Shen, and Stephen Chan. 2008. Optimizing the Number of Copies for Print Preservation of Research Journals. Berkeley, CA: University of California Berkeley, Industrial Engineering & Operations Research. http://www.ieor.berkeley.edu/~shen/webpapers/V.8.pdf.
Submitted by jajacobs on Wed, 2013-04-10 07:28.
The Digital Access to Legal Information Committee (DALIC) of the American Association of Law Libraries (AALL) has created a new website to host information about the status of online legal materials in every state with respect to authentication, official status, preservation, permanent public access, copyright, and universal citation.
- State Online Legal Information, American Association of Law Libraries.
AALL and chapter volunteers researched primary legal materials in their states to determine if online legal materials are trustworthy and preserved for permanent public access. This website brings together information from AALL's National Inventory of Legal Materials and updates AALL's Preliminary Analysis of AALL’s State Legal Inventories, 2007 State-by-State Report on Authentication of Online Legal Resources and 2009-2010 State Summary Updates. Information is provided about the online Administrative Code, Administrative Register, Statutes, Session Laws, High Court Opinions and Appellate Court Opinions in all 50 states and the District of Columbia in the following categories*:
Official Status
Authentication
Preservation
Permanent Public Access
Copyright
Universal Citation
The state pages will be updated as information changes and as we learn more about developments in the states. AALL’s Digital Access to Legal Information Committee (DALIC) will monitor this site and periodically check in with AALL’s state working groups to ensure the accuracy of the information. DALIC also welcomes your additions or corrections.
- New Website for State Online Legal Information, By Elizabeth Holland, American Association of Law Libraries, Washington Blawg (April 9, 2013).
See also:
State-by-State Report on Authentication of Online Legal Resources (2007).
Submitted by jrjacobs on Tue, 2013-04-09 07:20.
Sunlight Foundation's OpenGov Champion of the month is Sandra Moscoso. Sandra is a mom of two public school students in Washington DC, and a member of the Capitol Hill Public School Parent Organization (CHPSPO) -- oh and she just happens to manage an open data portal at the World Bank’s financial sector.
...she and other CHPSPO members were able to collect data to show how the schools that had a full time librarian had better test score results than those who had lost theirs due to budget cuts. The group was able to use that figure as an effective basis for their request to the city to restore funding for librarians.
Submitted by jrjacobs on Mon, 2013-04-08 14:18.
Wikileaks today announced the launch of the Public Library of US Diplomacy (PlusD), a searchable database with the release of Special Project K: the Kissinger cables -- ostensibly, PlusD will include other records in the future. WikiLeaks has published more than 1.7 million U.S. diplomatic records -- including cables from previously released Cablegate cables, intelligence reports, and congressional correspondence -- from January 1, 1973 to December 31, 1976, the period during which Henry Kissinger was secretary of state and national security advisor. The documents were formerly confidential, classified, or labeled "NODIS" ("no distribution") or "Eyes Only". The database can be accessed at http://search.wikileaks.org/plusd/.
According to Wikileaks:
...Most of the records were reviewed by the United States Department of State's systematic 25-year declassification process. At review, the records were assessed and either declassified or kept classified with some or all of the metadata records declassified. Both sets of records were then subject to an additional review by the National Archives and Records Administration (NARA). Once believed to be releasable, they were placed as individual PDFs at the National Archives as part of their Central Foreign Policy Files collection. Despite the review process supposedly assessing documents after 25 years there are no diplomatic records later than 1976. The formal declassification and review process of these extremely valuable historical documents is therefore currently running 12 years late.
According to the Guardian:
The data, which has not been leaked, comprises diplomatic records from the beginning of 1973 to the end of 1976, covering a variety of diplomatic traffic including cables, intelligence reports and congressional correspondence.
Julian Assange said WikiLeaks had been working for the past year to analyse and assess a vast amount of data held at the US national archives before releasing it in a searchable form.
WikiLeaks has called the collection the Public Library of US Diplomacy (PlusD), describing it as the world's largest searchable collection of US confidential, or formerly confidential, diplomatic communications.
Assange told Press Association the information showed the vast range and scope of US diplomatic and intelligence activity around the world.
Henry Kissinger was US secretary of state and national security adviser during the period covered by the collection, and many of the reports were written by him or were sent to him. Thousands of the documents are marked NODIS (no distribution) or Eyes Only, as well as cables originally classed as secret or confidential.
Assange said WikiLeaks had undertaken a detailed analysis of the communications, adding that the information eclipsed Cablegate, a set of more than 250,000 US diplomatic cables published by WikiLeaks from November 2010 and over the following year. He said WikiLeaks had developed sophisticated technical systems to deal with complex and voluminous data.
Top secret documents were not available, while some others were lost or irreversibly corrupted for periods including December 1975 and March and June 1976, said Assange.
Submitted by dcornwall on Sun, 2013-04-07 07:37.
As March link checking was completed, activity has slowed at the State Agency Databases project at http://wikis.ala.org/godort/index.php/State_Agency_Databases. But we reached a a major milestone as we once again have full coverage of the fifty states and the District of Columbia.
We started the year with six or seven "orphan" states but that was quickly whittled down to just Hawaii, Minnesota and Oklahoma. Those stayed in the orphanage for a couple of months and recruiting here and govdoc-l wasn't helping.
So three of our project volunteers stepped up and took on second states. I'd like to offer a loud round of applause to:
- Jenn Zuccaro, for taking on Hawaii in addition to West Virgina.
- Paul J McDonough, for taking on Minnesota in addition to Vermont
- April Sheppard, for taking on Oklahoma in addition to Arkansas.
Now, for this week's activity. You can find a full listing of all the week's changes by visiting http://tinyurl.com/statedbs. Here are some highlights:
DATABASES ADDED:
MICHIGAN (Michael McDonnell)
GeoWebFace - GeoWebFace is a mapping service provided by the Michigan Department of Environmental Quality. It can be used to access all kinds of geologic information. Accessible layers are too numerous to list here but include; mine and quarry data, geologic and hydrological information, land ownership and leasing information, and the location and type of oil and gas wells. There are also links to USGS topographic maps.
WASHINGTON (Marilyn Von Seggern)
Fertilizer Product Database - Database of fertilizer products currently registered for distribution in Washington. Two major pieces of information in the database are the nutrient guarantees and the levels of nine heavy metals (arsenic, cadmium, cobalt, mercury, molybdenum, lead, nickel, selenium, and zinc) found in the product and for which the state has developed soil loading standards.
SUBJECT PAGE ACTIVITY
HEALTHCARE PRACTITIONER DATABASES (Lynn McClelland)
Louisiana
LPN Disciplinary Actions - Current disciplinary actions, by type
Historical LPN Disciplinary Actions] - Lists of disciplinary actions, back to November 1997.
Submitted by jajacobs on Thu, 2013-04-04 17:59.
David Rosenthal gave another fascinating talk about the state of the web and whether or not we can expect to preserve it by harvesting it. This talk was at the 2013 Spring CNI Membership Meeting in San Antonio, TX. David presents an edited text of his talk with links to the sources on his blog:
David and co-presenter Kris Carpenter Negulescu note, among other things, that the days of a document-centered web are long over and that today, what most web pages do "is download and run programs in the current Web's primary language, Javascript. Javascript is a programming language, not a document description language. Your browser is only incidentally a document rendering engine, its primary function is as a virtual machine."
This presents problems for those wishing to preserve information. Among these problems:
- Database driven features & functions
- Complex/variable URI formats & inconsistent/variable link implementations
- Dynamically generated, ever changing, URIs
- Rich Media
- Scripted, incremental display & page loading mechanisms
- Scripted, HTML forms
- Multi-sourced, embedded material
- Dynamic login/auth services: captchas, cross-site/social authentication, & user-sensitive embeds
- Alternate display based on user agent or other parameters
- Exclusions by convention
- Exclusions by design
- Server side scripts & remote procedure calls
- HTML5 "web sockets"
- Mobile publishing
For more about these problems, see also: IIPC Future of the Web Workshop -- Introduction & Overview, International Internet Preservation Consortium (May 17, 2012).
Read David's complete post for a rich discussion of the issues.
Submitted by jajacobs on Wed, 2013-04-03 17:55.
We love our gov-docs, don't we? Enjoy:
- Twenty Awesome Covers From The US Space Program, Space Kinja.

The upcoming 2013 April 18 Space Exploration Signature Auction by Heritage Auctions brought us these fine document covers. Manuals, guidebooks, press kits, reports, brochures - all with cool artworks and typography. Enjoy!
|
Recent comments
1 week 2 days ago
1 week 2 days ago
1 week 5 days ago
2 weeks 7 hours ago
2 weeks 23 hours ago
2 weeks 4 days ago
3 weeks 2 days ago
3 weeks 2 days ago
4 weeks 9 hours ago
4 weeks 2 days ago