Home » Posts tagged 'data preservation' (Page 2)

Tag Archives: data preservation

Our mission

Free Government Information (FGI) is a place for initiating dialogue and building consensus among the various players (libraries, government agencies, non-profit organizations, researchers, journalists, etc.) who have a stake in the preservation of and perpetual free access to government information. FGI promotes free government information through collaboration, education, advocacy and research.

FGI accidental docs librarian webinar: “Saving govt data: a conversation with the future”

Please tune in next Wednesday, March 29, 2017 from 9am – 10am Pacific / 12:00 – 1:00pm Eastern for the next Help! I’m an Accidental Government Information Librarian Webinar “Saving government data: A conversation with the future.” You’ll need to in order to get the link to the WebEx live session. “See” you there!

Help! I’m an Accidental Government Information Librarian presents … Saving government data: A conversation with the future, on Wednesday, March 29, 2017 from 12:00 – 1:00 p.m. (Eastern).

In recent months, the DataRefuge project has collaborated with hundreds of volunteers around the United States to collect, describe, and store federal data that support climate and environmental research and advocacy. This project, and others like it, works in conjunction with the End of Term Web Archive to capture and make available federal web content during administrative transitions.

Our discussion will explore the fragility of digital information, and expand on ideas about what data is. We’ll talk about current projects and efforts, and explore the future of this work. Finally, we’ll address the concept of sustainability, and propose a paradigm of empowered experimentation that aligns with our values and roles within libraries.

We will meet together for Session #69, online on Wednesday, March 29, 2017 from 12:00 – 1:00 p.m. (Eastern). Please RSVP for the session using this link:  http://bit.ly/GRS-Session69

We will use WebEx for the live session. Information on testing and accessing the session will be made available when you register.

The session will be recorded and available after the live session, linked from the NCLA GRS web page (http://www.nclaonline.org/government-resources).

Presenters:
Laurie Allen is the Assistant Director for Digital Scholarship in the Penn Libraries, where she leads a group working to expand the capacity of researchers at Penn to create and share scholarship in new forms. The group engages in digital project development, data management and curation, mapping, experimentations with emerging research methods, and open access publishing. In late 2016, Allen was part of the group that started Data Refuge, and has been involved in bringing together a group of collaborators to form a network of libraries, open data activists and open government efforts.

James A. Jacobs is Data Services Librarian Emeritus, University of California San Diego. He has more than 25 years experience working with digital information, digital services, and digital library collections. He is a technical consultant and advisor to the Center for Research Libraries in the auditing and certification of digital repositories using the Trusted Repository Audit Checklist (TRAC) and related CRL criteria. He served as Data Services Librarian at the University of California San Diego and co-taught the ICPSR summer workshop, “Providing Social Science Data Services: Strategies for Design and Operation”. He is a co-founder of Free Government Information.

James R. Jacobs is the US Government Information Librarian at Stanford University Libraries where he works on both collection development as well as digital projects like LOCKSS-USDOCS. He is a member of ALA’s Government Documents Roundtable (GODORT) and served a 3-year term on Depository Library Council to the Public Printer, including serving as DLC Chair. He is a co-founder of Free Government Information (freegovinfo.info) and Radical Reference (radicalreference.info) and is on the board of Question Copyright, a 501(c)(3) non-profit organization that promotes a better public understanding of the  effects of copyright, and encourages the development of alternatives to information monopolies.

Shari Laster is the Government Information Librarian and Data Services Librarian at the University of California, Santa Barbara. She currently serves as Assistant Chair/Chair-Elect for the Government Documents Round Table of the American Library Association, and is a past chair of the Depository Library Council, the advisory body for the Federal Depository Library Program.

via Help! I’m an Accidental Government Information Librarian Webinars | North Carolina Library Association.

DoE’s Carbon Dioxide Information Analysis Center (CDIAC) shut down without comment. Data in preservation danger

This is terrible. The US Department of Energy (DOE) has summarily shut down the Carbon Dioxide Information Analysis Center (CDIAC), located at the Oak Ridge National Laboratory (ORNL) as of 10/1/2016. CDIAC is the primary climate change data and information analysis center for DOE. CDIAC is supported by DOE’s Climate and Environmental Sciences Division within the Office of Biological and Environmental Research (BER).

A friend reports that CDIAC has limited funding and is trying to save its data in the NASA Distributed Active Archive Center (DAAC). There has been no outside comment and neither DOE nor ORNL have yet to issue a press release.

I just checked and the CDIAC site is in the Internet Archive Wayback Machine. However, the CDIAC’s entire data catalog is served out over FTP which is not captured by IA’s heritrix Web crawler.

NOTICE: CDIAC as currently configured and hosted by ORNL will cease operations on September 30, 2017. Data will continue to be available through this portal until that time. Data transition plans are being developed with DOE to ensure preservation and availability beyond 2017.

via Carbon Dioxide Information Analysis Center (CDIAC).

Research data lost to the sands of time

Here’s an interesting article, not on link rot (a topic FGI has been tracking for some time), but on *data rot*. In a recent article in Current Biology, researchers examined the availability of data from 516 studies between 2 and 22 years old. They found the following:

  • that the odds of a data set being reported as extant fell by 17% per year;
  • Broken e-mails and obsolete storage devices were the main obstacles to data sharing
  • Policies mandating data archiving at publication are clearly needed

Librarians have known of this issue for years — the Inter-university Consortium for Political and Social Research (ICPSR) was set up in 1962 to tackle this — but it does put the issue in focus. And finally the federal government — via efforts like the NSF’s data management plan and OSTP’s new directive to improve the management of and access to scientific collections — is beginning to get behind the effort to improve on data rot. And many libraries — not to mention scientists and researchers — are beginning to struggle with the issue of data preservation. The issue is too big for just government information librarians to handle obviously. But this is fertile space in which govt information librarians, data librarians, research communities, and federal agencies can come together. The Federal policy stating the importance of data preservation is there, it’ll just take effort by multiple stakeholders to make sure it actually happens. It’s a positive that the writers of Dragonfly, the blog of the National Network of Libraries of Medicine Pacific Northwest Region — where I came across the article — point out that academic institutions can and should play a leading role in data preservation. I wholeheartedly agree!

Vines, Timothy H., Arianne YK Albert, Rose L. Andrew, Florence Débarre, Dan G. Bock, Michelle T. Franklin, Kimberly J. Gilbert, Jean-Sébastien Moore, Sébastien Renaut, and Diana J. Rennison. “The availability of research data declines rapidly with article age.” Current Biology 24, no. 1 (2014): 94-97.
http://dx.doi.org/10.1016/j.cub.2013.11.014

The researchers found that for every year that had passed since the paper’s publication date, the odds of finding an email address that led to contact with a study author decreased by 7% and that the odds of turning up the data reduced by 17% per year.  The authors report that while some of the data sets were truly lost others fell more into the category of “unavailable,” since they existed, but solely on inaccessible media (think Jaz disk).  These findings will not come as a shock to those who have worked in a research lab.  This publication does put some tangible numbers behind the underlying message of NYU Health Sciences Library’s excellent dramatic portrayal of an instance of inaccessible data.  The authors conclude by suggesting that a solution to this problem moving forward can be found in more journals requiring the deposit of data into a public archive upon publication.  I would also suggest that academic institutions can take a role by establishing policies supporting research data preservation alongside providing a data repository.

Canadian libricide: Harper government dumps centuries of irreplaceable environmental archives

This is absolutely tragic. In 2012, when Canada’s Harper government announced that it would close down national archive sites around the country, they promised that anything that was discarded or sold would be digitized first. However, reporting coming out of Canada now is finding that, along with the closure of some of the world’s finest fishery, ocean and environmental libraries, a significant amount of irreplaceable collections and data are simply being thrown out or burned. As David Rosenthal noted in his blog post “Threat Model for Archives”, this should be a giant warning to anyone who thinks that “single, government-funded archives are a solution to anything.”

Hutchings said none of the closures has anything to do with saving money, due to the small cost of maintaining the collections. He, like many scientists, concludes that Harper’s political convictions are driving the unprecedented consolidation.

“It must be about ideology. Nothing else fits,” said Hutchings. “What that ideology is, is not clear. Does it reflect that part of the Harper government that doesn’t think government should be involved in the very things that affect our lives? Or is it that the role of government is not to collect books or fund science? Or is it the idea that a good government is stripped down government? ”

Hutchings saw the library closures fitting a larger pattern of “fear and insecurity” within the Harper government, “about how to deal with science and knowledge.”

That pattern includes the gutting of the Fisheries Act, the muzzling of scientists, the abandonment of climate change research and the dismantling of countless research programs, including the world famous Experimental Lakes Area. All these examples indicate that the Harper government strongly regards environmental science as a threat to unfettered resource exploitation.

“There is a group of people who don’t know how to deal with science and evidence. They see it as a problem and the best way to deal with it is to cut it off at the knees and make it ineffective,” explained Hutchings.

via The Tyee – What’s Driving Chaotic Dismantling of Canada’s Science Libraries?.

[HT BoingBoing!]

Geospatial Data Preservation Resource Center

Geospatial Data Preservation Resource Center

A new Web site, the Geospatial Data Preservation Resource Center, aims to help those responsible for producing and managing geospatial data learn about the latest approaches and tools available to facilitate long-term geospatial data preservation and access. The Web site provides descriptions and links for a variety of relevant resources, including education and training modules, useful tools and software, information on policies and standards for preserving geospatial data, and examples of successful preservation and associated benefits. This first release of the Web site, which CIESIN will be enhancing over the next year, was developed as an element of the National Digital Information Infrastructure and Preservation Program (NDIIPP) of the Library of Congress.

The Geospatial Data Preservation Resource Center is accessible at http://geopreservation.org/

CIESIN, the Center for International Earth Science Information Network, is a unit of the Earth Institute at Columbia University, based at the Lamont campus in Palisades, New York.

Archives