Home » Search results for 'America's Data Index'

Search Results for: America's Data Index

Our mission

Free Government Information (FGI) is a place for initiating dialogue and building consensus among the various players (libraries, government agencies, non-profit organizations, researchers, journalists, etc.) who have a stake in the preservation of and perpetual free access to government information. FGI promotes free government information through collaboration, education, advocacy and research.

Newest data tracking effort: dataindex.us

For those looking to keep abreast on all of the changes and deletions to federal government information and data, now there’s dataindex.us. DataIndex is a new collaborative project dedicated to monitoring the federal data infrastructure including dataset availability, new releases, and both planned and unplanned changes to data collections. The platform is developing tools to […]

Continue Reading →

Fear, uncertainty, or doubt? Why the Census and ACS are critical to a well-functioning democracy

[UPDATE 7/10/12: We have received a few flame comments against the ACS which have been deleted without being published. Only comments -- pro or con -- that are actually reasoned with cited references in support of their argument will be published. Flames do nothing to forward the conversation and will not be tolerated. FGI editors]

Recently we posted about a petition to save the American Community Survey. We received the following comment from reader "David":

What a joke this article is. Social infrastructure and welfare distribution worked just fined [sic] without the ACS, which by the way only officially began in 2000. If there is a 'demographics information' demand, the market will provide that information, it is not the role of the government to provide that kind of luxury.

Besides the ACS and the long arm [sic] (its older brother) Census are not constitutionally mandated. But I guess if your [sic] into big gov, dem or repub you don't care about constitutional boundaries.

Normally we would simply delete this comment because FGI isn't a forum for attacking or defending long-established, legal, respected, government information gathering policies. Unfortunately, this comment is similar to many others we see on the web every day about the Census and similar government information programs. You can hardly find a blog post or newspaper article about the census without also finding a comment very much like this one. They share factual inaccuracies, inflammatory accusations, unsubstantiated assertions, falacious slippery slope arguments, and a fervent distrust of government. In short, these arguments display classic characteristics that marketers and propagandists use to spread Fear, Uncertainty, and Doubt (FUD).

It is, of course, impossible to know the motives behind any given individual or comment, but it is probably reasonable to assume that professional politicians and organizations are aware of what they are doing and some individuals are victims of FUD and are repeating what cynical politicians have told them. Regardless of the intentions behind this comment and those like it, it is surely true that they are at best misguided and at worst intentionally misleading.

So, we decided to take the time to respond to these kinds of arguments against the ACS and the Census. We think it might be helpful to document why we believe ACS is both legal and good.

Some Context

Although there have always been some people who express doubts and fears about the Census (Callahan), recently some conservative and libertarian politicians are pushing hard to turn these doubts and fears into policy (Canadian Press). It is reported that Britain will drop its census (UPI) and Canada has already scrapped its own long-form Census (Roman) -- prompting the head of Statistics Canada to resign in protest (Proudfoot). In the U.S., in addition to the bill before Congress (H.R. 5326) that would kill the ACS, the Republican National Committee has described the Census Bureau as behaving like "a scam artist" and recommended (RNC) that the American Community Survey be eliminated (which is what H.R. 5326 does), dumbed-down to make it less accurate (which is what Canada has done to its long-form census), or make the ACS no longer mandatory (Groves) .

What this means is that, although censuses have long been an accepted part of modern governments world-wide and most countries conduct regular censuses of their populations (ESRC), there are those, not just in the U.S., who are actively trying to shut down (Seife) or dumb-down (Milligan) national censuses.

When I told a friend I was working on this post, she said, "Oh, no! You're not trying to correct the Internet, are you?" (Munroe). No, this isn't about silly internet debates. This is about an actual, current political attack on something that, until very recently, almost everyone took as a noble and important part of any functioning government.

Getting the Facts Right

If we are to discuss the efficacy of the census, the first thing we have to do is get our facts right. Let's look at some of the claims that David and others like him make about the Census and the ACS.

Claims about ACS

David gets the facts wrong in a very specific way that -- if he were accurate -- would strengthen his case. But he is not correct when he says, "Social infrastructure and welfare distribution worked just fined without the ACS, which by the way only officially began in 2000."

The Facts. What is actually true is that the ACS replaced the "long-form" questionnaire that had been a part of the decennial census since 1940. The Bureau has conducted longer surveys to gather more detailed information than the decennial census since 1850 (Yglesias). The ACS is part of a long history of the government gathering social and demographic information. The ACS is not a new program that suddenly sprang up in 2000. In fact, the Bureau began testing the ACS methodology in 1996 and the first ACS was in 2005, (not 2000) (GAO).

Claims about Constitutionality and legality

Those who fight to cancel or reduce the accuracy or scope of the Census often claim that it is unconstitutional or otherwise illegal. The RNC makes this claim and says the Census excedes its Constitutionally mandated "scope" (RNC). David writes that "the ACS and the long arm [sic] (its older brother) Census are not constitutionally mandated."

The Facts. The legality of the Census has been repeatedly affirmed by the courts. The gathering of information is done in accordance with the Constitution under the authority of Congress and is regulated by other laws and regulations. It is essential to understand just how important this is. Neither the Census nor the ACS is a wild and crazy abuse of power. It is implemented in a careful, public way under the Constitution, the U.S. Code, and the Code of Federal Regulations -- all with the approval of the courts.

As the Bureau notes, putting the Census in the Constitution (Article I, Section 2) was a significant act of empowering the people over unruly governments:

Enshrining this invention in our Constitution marked a turning point in world history. Previously censuses had been used mainly to tax or confiscate property or to conscript youth into military service. The genius of the Founders was taking a tool of government and making it a tool of political empowerment for the governed over their government. (Census Bureau "Constitution")
The Constitution is, of course, just the starting point of our system of checks and balances designed to prevent abuse. It authorizes our elected representatives to create laws to carry out what the Constitution enables. Congress enacted 13 U.S.C. § 141 which requires a decennial census of population and authorizes The Secretary of Commerce "to obtain such other census information as necessary." The law goes further with its checks and balances by requiring the Bureau to notify Congress of what subjects it will address:
Title 13, U.S. Code, does not specify which subjects or questions are to be included in the decennial census. However, it does require the Census Bureau to notify Congress of general census subjects to be addressed 3 years before the decennial census and the actual questions to be asked 2 years before the decennial census. (Census Bureau "Constitution")

The Congressional law is further implemented by regulations which are created by professionals under the supervision of Presidential appointees, announced to the public for comment in the Federal Register, and codified in the Code of Federal Regulations. In the case of the Census, regulations appear in four CRF Titles: 15, 19, 22, 39 (Cornell).

The Government Accountability Office has determined that The Bureau has the authority under 13 U.S.C. § 141 and 13 U.S.C. § 193 to conduct the ACS (GAO). The information gathering also conforms to the Paperwork Reduction Act, the Confidential Information Protection and Statistical Efficiency Act, and the Privacy Act.

All these laws and regulations are subject to further checks and balances through reviews by the courts. The Bureau notes that the courts have repeatedly confirmed the legality of the Census:

On numerous occasions, the courts have said the Constitution gives Congress the authority to collect statistics in the census. As early as 1870, the Supreme Court characterized as unquestionable the power of Congress to require both an enumeration and the collection of statistics in the census. The Legal Tender Cases, Tex.1870; 12 Wall., U.S., 457, 536, 20 L.Ed. 287. In 1901, a District Court said the Constitution's census clause (Art. 1, Sec. 2, Clause 3) is not limited to a headcount of the population and "does not prohibit the gathering of other statistics, if 'necessary and proper,' for the intelligent exercise of other powers enumerated in the constitution, and in such case there could be no objection to acquiring this information through the same machinery by which the population is enumerated." United States v. Moriarity, 106 F. 886, 891 (S.D.N.Y.1901). (Census Bureau "Constitution")

In short, those who claim that the Census or the ACS are not legal are wrong.

Claims about proper role of government

David says, "If there is a 'demographics information' demand, the market will provide that information, it is not the role of the government to provide that kind of luxury."

The Facts. Is collecting demographic information a "luxury" and an improper role of government? This is, of course, a matter of opinion, but the facts suggest that having official statistics makes government more efficient and effective and provides information useful to the private sector. Making government effective and efficient shouldn't be considered a luxury by anyone, should it? (We'll examine separately below whether or not "the market" could provide this information.)

The gathering of census information is widely recognized as an essential role of government. The private sector supports this activity (Jacobs). The Wall Street Journal calls conducting the ACS a "useful government purpose." The Economist notes that the ACS "helps target government spending on those who need it most." Business Week points out that the Chamber of Commerce, the National Association of Business Economists, and even conservative think tanks like the Cato Institute, the American Enterprise Institute, and the Heritage Foundation support the public collection of data (Philips). Non-profits and social organizations agree (Baker).

The Decennial Census and American Community Survey are critical not only for "demographics" and Congressional apportionment, but for public policy analysis at the Federal, state and local levels. Policy makers can't know how their policies are affecting citizens without data -- and citizens can't know how they are being affected by their elected officials' policies without data! Even Republican mayors and governors say that Census data help them run their states and cities more efficiently (Loth). Elimination of the ACS would not only make government less efficient, but it would eliminate a critical tool for profit-driven businesses (Hogue). The National Retail Federation told Congress that they wanted a mandatory ACS (Shearman). The ACS is even more important now because the long form of the decennial census was discontinued for the 2010 census. (The long form, by the way, used to be required of one household in six; the ACS questionnaire goes to just one in 46.) The ACS and the Census, rather than being "big gov" (with the implication that "big" is always "bad"), actually do much to keep costs down and assure that governments are spending in a wise and targeted fashion.

So, The Census and the ACS are critical tools for a well-functioning country. Which leads us to our next question: Should we leave it to "the market" to provide us those tools?

Claims about The Market

David says, "If there is a 'demographics information' demand, the market will provide that information."

The Facts: Although there are lots of private sector companies that repackage government-provided information and combine it with privately gathered information, there is no evidence that the private sector wants to or can gather the kind of information that the government does. In contrast, see above all the business groups and businesses and business press that are supporters of the ACS and the Census.

But this argument also goes against common sense. Imagine if the government stopped collecting the demographic information that it uses to monitor and run government programs and that businesses use to make business decisions. There would be only two alternative scenarios. First, some or all of that data would go uncollected. That would clearly be bad for everyone. Second, the private sector would collect at least some of the data. This raises a host of questions, though: What data would 'the market' value and collect and what data would be deemed by the market as inessential? Would 'market' data be collected, but data that help governments operate efficiently and effectively be left uncollected? What price would 'the market' charge for the data and who would have access to it or be denied access because of costs? Would privacy of respondents be respected? Would there be any transparency or public accountability for market-gathered information the way there is for government-gathered data? Would respondents be more likely to trust business surveys than government surveys and what effect would that have on the accuracy of those surveys?

In short, claiming that 'the market' can adequately serve the very real needs of government and businesses while providing accurate, affordable information is a claim without support and a claim that is contradicted by the business community itself and by common sense.

Claims about privacy

A favorite claim of those who oppose the ACS and the Census in general is that these surveys are an invasion of citizens' privacy. (David doesn't bring up this point, but it commonly made, so we want to address it here.) The RNC says, for example, that "ACS is an invasion of privacy that demands detailed personal information that the government has no business seeking, knowing, or compiling" and that the Census Bureau is "conducting a dangerous invasion of privacy" and that the Bureau is "violat[ing] the rights and invad[ing] the personal privacy of United States Citizens." These claims are often linked to descriptions of possible, not actual, abuses of power. For example, Ron Paul asks us to "imagine the countless malevolent ways our federal bureaucrats could use this information," and others ask us to imagine "the new American police state" (Loth). David B. Kopel, an associate policy analyst at the Cato Institute, piled on the hypotheticals as far back as 1990, saying the Bureau would "probably" help the Department of Justice and that it "may not even keep its word." Kopel also repeats the story of the federal government's using census data to locate citizens of Japanese descent and imprisoning them in concentration camps.

The Facts. The Census Bureau is bound by Title 13 of the US Code to protect the confidentiality of the information it gathers and it has always done so. Disclosure of confidential Census information is punishable by prison and fines (Census Bureau "Principles"). It cannot share confidential information even with other agencies.

It is against the law for any Census Bureau employee to disclose or publish any private information that identifies an individual or business. This is true even for inter-agency communication: the FBI and other government entities do not have legal right to access census information. In fact, these protections have been challenged, Title 13's privacy and confidentiality guarantees have been upheld. In addition, other federal laws, including the Confidential Information Protection and Statistical Efficiency Act and the Privacy Act, reinforce these protections. (Census Bureau "Privacy")

The purpose of the Census is to create aggregate information, not personal information. In other words, it profiles states, cities, and communities, not individuals, by aggregating (summarizing) the information it gathers from individuals. The personal, individual information is confidential and is not disclosed. The summaries are what we all use every day when we want to know the population of a city of the change in population over time and so forth. The Bureau uses sophisticated statistical procedures to ensure that information about an individual cannot be gleaned from the summary information about cities and communities that it publishes (Census Bureau "Safeguards").

The story of the federal government using census data to locate citizens of Japanese descent in 1942 is at least partially true, but the Census critics oversimplify the facts to the point of making misleading arguments and draw, what I believe to be, the wrong conclusions from it. Until recently, the Bureau has maintained that it did not divulge any confidential information to aid in internment. The Bureau maintained that it did not provide information on individuals, but that it did produce public, summary reports that identified neighborhoods where Japanese-Americans lived (Holmes). The Bureau officially acknowledged and apologized for this activity in 2000 (Watanabe). But a paper published in 2007 (Seltzer and Anderson) says that there is now evidence that the Bureau did, in 1943, provide to the Secret Service information on 79 individuals in the Washington D.C. area. The disclosures were legal at the time (Watanabe). Seltzer and Anderson also say this:

"However, over time, and certainly in retrospect, Census Bureau came to realize that the provision of information gathered under a pledge of statistical confidentiality to law enforcement and intelligence agencies was antithetical to the Bureau's statistical mission.... In the post World War II period, the Census Bureau vigorously sought to end the confidentiality violations sanctioned by the Second War Powers Act."

What lessons can we draw from this experience? I personally believe that the Bureau should not have provided special and personal information during World War II, and am heartened by the fact that the Bureau worked to close confidentiality loopholes in the law. But I am disheartened by the fact that, as recently as 2004, the Bureau provided specially tabulated population statistics on Arab Americans to the Department of Homeland Security (Watanabe). Although the disclosures were legal and only included summary statistics using already-public data, this strikes me as an example of how there are still those who will use "national security" to justify the use of Census data to try to identify individuals.

In short, misuse of Census data has been extremely rare (are there any others than these two?). Misuse of Census data is not a common or recurring or systemic problem. Because the Bureau has a long history of systematic and professional protection of the confidentiality of Census records (see legal and procedural safeguards above and, for example, Benson), hypothesizing worst-case scenarios is more misleading than it is useful or informative.

Summary

I draw two conclusions from all this: First, the benefits of gathering the information with the Census and the ACS far outweigh the possible, but unlikely, abuses of that information. Second, I would rather have this kind of information gathered (as it is being done) under the Constitution, the U.S. Code, and the Code of Federal Regulations by elected officials and public servants under transparent conditions, than by un-accountable private-sector companies.


Endnotes

Continue reading

Continue Reading →

Guide of the Week: U.S. efforts in Iraq, Afghanistan and Pakistan

The Government Accountability Office recently identified our efforts in Iraq, Afghanistan and Pakistan as one of 13 urgent issues facing the next President and Congress. Today on Guide of the Week, we'll talk about some librarian produced guides from the ALA GODORT Exchange Wiki that can help inform citizens, Congress and President-Elect Obama on these countries. Your first stop should be these three excellent country guides produced by the University of Colorado at Boulder (UCB):

Each of the country guides above is divided into the following sections: Government Information, Country Profiles, Articles & Databases, Diplomatic Relations, Health, Peacekeeping & Military Information, Resources in the UCB Catalog and Related Topics. The "Government Information" section contains links to information direct from that country's government. This can be useful if you are looking for an unfiltered view of a country. One thing I really like about the "Country Profiles" section of UCB's country guides is that they don't stop with the CIA World Factbook or the State Department's Background Notes. These sources, while good, are just one perspective. The UCB guides adds profiles from a number of countries plus international organizations. So have a look at how Afghanistan, Iraq and Pakistan are viewed by Australia, Human Rights Watch, the European Commission and the BBC, among others. Once you're done with the country-specific guides, check out the guides below if you need more background (historical, social, etc) on these regions of the world. The microform collection at Berkeley provides historical background on America's activities in the Middle East going back into the 19th Century. This information is not online, but knowing about these collections might help you order materials through Interlibrary Loan. Bert Chapman's guide on Intelligence contains some links to reports on Iraq as well as information about the intelligence communities role in our current wars. Grace York's guide reminds us that the wars in Iraq, Afghanistan and Pakistan are not the only conflicts going on in the world and provides materials on the various conflicts going on in the Middle East. Finally, since both Afghanistan and Iraq were portrayed as being part of the "War on Terror", it seemed appropriate to include Karen Hogenboom's guide to terrorism resources from Federal Depository Libraries. Karen's guide has a link to Kathy Amen's War in Iraq/Afghanistan page that I think you will also find helpful. Since I'm not covering each guide listed here in my usual "Guide of the Week" depth, you know that there is lots more in each of the guides. So go check them out. If you find them helpful, send the guide links to your Senators and Representatives. It's their transition too! And if you're a docs librarian with a guide to some of the urgent issues listed above, then please QUICKLY post your guide to the Handout Exchange. Next week I'll be dealing with librarian produced guides relating to "protecting the homeland." So if you have any guides relating to that topic, please try and post them to the Handout Exchange this week. Continue reading

Continue Reading →

Two new LOCKSS news items

If you're trying to understand how LOCKSS works and why anyone would want their own copies of government data when GPO/Google/[Your Third Party Here] will keep it safe and free forever, check out these two recent news items from LOCKSS:

  • LOCKSS Team featured by Library of Congress (04/24/07) Pioneers of Digital Preservation on the Library of Congress' web site features an overview of the LOCKSS program.
  • Presentation at CNI (04/17/07) Vicky Reich and David Rosenthal talked at the CNI meeting in Phoenix, AZ. Vicky gave an overview of the status of the CLOCKSS program, and David talked on Can We Afford To Preserve Large Databases?.
The LC page not only demonstrates that LOCKSS can be a trusted and TESTED partner in digital preservation, but also explains an excellent plain English explanation of how the system works. David Rosenthal's CNI powerpoint touches on the non-technology reasons why information solely in the hands of the government is at risk, especially his slide 18:
Example: Insider Attack ● Political interference (Hansen 2007): – 2006 Earth Science budget retroactively reduced 20% – ''One way to avoid bad news: stop the measurements!'' – Suppose the data itself turned out to be ''inconvenient'' ... ● Remove it (e.g. EPA pollution database) ● Alter it? ● Independent replicas essential – Independently administered in different jurisdictions – Mutually audited so they're tamper evident
The rest of the presentation is a good though slightly technical primer on performance requirements for digital preservation and the need for further research. Also has some scary things to say about RAID. More evidence that no ONE system, not even a Future Digital one, is enough to safeguard America's government information. No system is safe from its parent - particularly when that parent is so reluctant to fully fund information access and preservation. Continue reading

Continue Reading →

Nevada Library Assn presentation: Preservation

Note: While these remarks are based on the outline I used to prepare my digital preservation talk at the Nevada Library Association, they are not identical to my talk. I work from talking points instead of a script and as a result my remarks will vary each time I give a talk using the same outline. In 1999, the National Endowment for the Humanities (NEH) established an ambitious web-based family history project called "My History is America's History." "My History" aimed to assemble a history of America through the eyes of its citizens from all walks of life. At its height, the site contained hundreds of contributed stories and an extensive guide to collecting and preserving family history. Without warning or explanation, NEH shut down the site in 2002. To this day, no one knows why. To some people at the time, this wasn't a problem. They knew about the Internet Archive, a private project that aimed to collect and preserve the entire Internet. Someone else had obviously taken care of the problem. If only. The well-intentioned Internet Archive spider has limitations. One of these is that it cannot retrieve content behind a search box. So, when you go to an Internet Archive page for "My History is America's History", you get the FOUR family stories that happened to be featured on the front page. Nearly everything else is lost. This is part of the price we pay when we expect "someone else" to solve a problem for us. It's not just family history that is fading away. According to the MIT Technology Review article The Fading Memory of the State, millions of government records and even bits of military history have either disappeared or are on the verge of disappearing. I can say with confidence that we know more about the inner workings of our government in 1945 than future researchers will ever know about the inner workings of government in 2005. And that's assuming total access to intact materials. In today's fifteen minute discussion of Digital Preservation, I wish to address the unique challenges represented by digital materials, cover a few proposed solutions, and end with a few actions you can take today to keep the past from fading away. Challenges While preservation is a concern regardless of format, digital materials offer unique challenges that librarians and other stakeholders must be aware of. Most importantly, no one knows what will work. We know how to preserve paper and have made that last centuries. We're pretty sure how to preserve microforms and some has already survived decades. With the right conditions, we think we can make microforms last centuries too. And as long as humans still have magnifying glasses and light sources, microforms can be read. For both formats, there are established best practices. Not so with digital materials. There are some promising approaches, but no one is claiming they know what will work. This is part of the reason that we at Free Government Information are so insistent on the library community retaining multiple copies of electronic documents. With many documents, many approaches can be tried, and it is likely that at least one approach will work. If we only have one or even ten copies, it seems likely that all will fail. Another unique challenge is media fragility. Paper can last a century or so without special treatment if you're not in the tropics. By contrast, most magnetic media (tapes, floppy disks, hard drives) begin to have major problems after ten years. Optical media (CDs, DVDs) may last 50 years or more IF kept under PERFECT conditions. Keeping optical media under less than ideal conditions may shorten their life to less than a decade. A worse problem than fragile digital media is standards obsolescence. Who here remembers WordStar? Who still has Wang word processors? Popular software such as Microsoft Office changes quickly and older versions often get left behind. Unless there is a market for it. Some formats endure longer than others. ASCII text has been around for decades and HTML is starting its second decade. But these formats don't convey rich information like graphics or fancy formatting. Related to the problem of standards obsolescence is that of closed, proprietary standards. A proprietary standard is one that is owned by a private party and subject to licensing. A closed standard is one where the full file specification is not published. An example of a closed proprietary standard is Microsoft Word. Microsoft does not share the full standard with others. This leads to other word processors (like WordPerfect) to showing a Word document in a different format (for me, pink strike through) than the document showed in word. Closed and proprietary are NOT the same thing. The Adobe PDF and MP3 are two files that are open and proprietary. This means that the full file specification has been published, but the owning company decides who can write software that creates or modifies the files. What does this mean to digital preservation? It means that if you are writing a program to convert older files to a newer format, you must get a license from the owning company, and if the standard is closed, your program will fail to fully reproduce the look and feel of the original document. That's assuming you can find the owning company. Closed proprietary standards of bankrupt companies may be permanently unreadable. Another problem unique to digital materials is a technology called digital rights management or DRM. DRM are technology imposed restrictions on a file's ability to be printed, copied, or even viewed. DRM software can be configured to automatically delete files after a period determined by the publisher. DRM is not copyright aware, so it knows nothing of fair use. It has no timer to put it in the public domain. DRM technology can be used to "protect" public domain materials. A file that is locked down through DRM cannot be successfully migrated to a new format because the DRM software will interpret that as an effort to make an "illegal" copy even though that is a right under copyright law. A final unique challenge presented to us by digital materials is that of access versus ownership. Everyone in the world in 2000 had access to "My History is America's History" In a growing number of states, including Alaska, citizens have access to a large collection of electronic journals. This is great and its a fact that people have more access now than any other time in history -- but what happens if the vendor goes under or a state can't pay its database bill? People are left with nothing. What if the vendor is the government and takes down documents that are inconvenient to the current administration? Citizens lose access. You cannot preserve what you do not possess. II. Possible Solutions So what can we do with digital documents that we've acquired? If the file is sufficiently "document-like", you can make a tangible backup (i.e. Print it out or have it microfilmed). This is what we do in Alaska for "born-digital" publications. We print them off and put them into a non-circulating collection for preservation. The Census Bureau has taken the tangible backup approach as well. They are required to make individual household questionnaires available to the public after 72 years. So how do they intend to fulfill their responsibility to make the individual questionnaires available to the public in 2072? The Bureau is microfilming them. They've also created a digital file that they HOPE will be readable in 2072, but they KNOW the microfilm will be there for future genealogists. A number of large corporations write their important digital files to microfilm using Computer Output Microfilm (COM) technology. The "tangible backup" approach has its limitations. It can only be applied to digital objects that have an analog equivalent. Thus it cannot be used for databases and multimedia like movies and audio recordings. Even for files with an analog equivalent, a tangible backup loses functionality such as in-document searching. The most basic thing that can be done for digital files is refreshing, which is copying files onto a new medium. This will guard against media failure, but does nothing to protect you from standards obsolescence Migration is one of the promising approaches to digital preservation. Migrating a file means moving it from one format to another (i.e. WordPerfect to Word, or Word 95 to Word 2002). For maximum success, it should be done as soon as a new format looks like it is taking market share away from existing formats. If a file format has an open specification and is non-proprietary, like XML or PDF/A, it is more likely to have a successful migration path than a closed proprietary standard. If a file is in a closed proprietary format, or has been copy disabled through DRM, migration may be impossible. Emulation, the practice of using current software and hardware to simulate a different (often older) environment is a promising digital practice and may offer the best chance of retaining the look and feel of older digital materials. Unlike tangible backups, emulation could be used to preserve databases and multimedia. There are examples of working emulators today, most notably for old video games and SoftWindows, which allows a Mac computer to mostly act like a Windows box. Emulation has its problems too. It depends on open standards in operating systems and file formats to be able to write software that is faithful to the original look and feel. SoftWindows isn't Windows, as both Mac and Windows users can tell you. Unless we can specify the original operating system, the original application that created a file, and the original file specification, emulation isn't going to work. Even when we have all these items, emulation can be difficult. What all of the above solutions have in common is that they cannot be implemented in the presence of strong Digital Rights Management (DRM). If DRM doesn't allow printing, you cannot make a tangible backup. If it doesn't allow copying, you cannot refresh the file to a new media and you likely cannot upgrade to a newer format. DRM software is invariably closed and proprietary, so you will not be able to recreate the original DRM scheme in an emulator. The “digital rights” in DRM is for the publisher, not you. That is why we at Free Government Information have repeatedly called on the Government Printing Office to publicly commit to rejecting DRM and are disappointed they have not done so. III. What you can do today to save the past But let's not end on a downer. If we have the will and interest, there are things that we can do to assure the preservation of important "born digital" documents:

  • Start by taking the Cornell University Digital Preservation Tutorial. No matter your level or lack of digital expertise, the people at Cornell will teach you something with their free tutorial. They even have a brief placement test.
  • Keep local copies of e-pubs that are important to you, even if you don't know what to do with them. Maybe you can find a partner who does!
  • Think about joining the LOCKSS (Lots of Copies Keep Stuff Safe) alliance that has been preserving e-journals for several years and may expand to federal materials.
  • Advocate for open data and operating system standards (i.e. Use or suggest open source whenever possible). This isn't a pipe dream – it is already a reality in Massachusetts and several foreign countries.
  • Finally, if you run a Federal Depository Library, make plans today to store and serve federal e-content on local servers, whether or not those federal publications are in the Federal Depository Library Program. Post those plans to govdoc-l.
Although I could talk about digital preservation for hours, like my partners I was alloted 15 minutes. So I hope that I have given you a taste of what's at stake, what are the unique challenges associated with digital materials, what some of the solutions MIGHT be, and hopefully encouraged and empowered some of you to start taking charge of our “born digital” heritage. It will be difficult and costly, but as the only institutions with a specific charge and interest in preserving our historical, cultural, and scientific heritage, libraries must lead as best they can. We are a rare "yesterday and tomorrow" institution in a "only today matters" society. We know that today will be unworkable if people cannot find and use what was done yesterday to plan for tomorrow. Thank you all for being here today. Especially the two-thirds of you that are not documents librarians. It just thrills me to see the wider community caring about the huge amount of federal information that is at stake. Thank you! Continue reading

Continue Reading →

Latest Posts

Latest Comments

Blogroll

Archives

Meta

Archives

Powered by WordPress / Academica WordPress Theme by WPZOOM