Vint Cerf, Google’s “Internet Evangelist,” speaking at the Computer World Honors awards program on Monday, warned about the dangers and difficulties of long-term digital preservation. He said what’s needed is a “digital vellum” to do long-term digital preservation in the same way as physical media has been preserved. Perhaps he needs to talk to libraries 😉
Cerf warned that digital things created today — spreadsheets, documents, presentations as well as mountains of scientific data — won’t be readable in the years and centuries ahead.
Cerf illustrated the problem in a simple way. He runs Microsoft Office 2011 on Macintosh, but it cannot read a 1997 PowerPoint file. “It doesn’t know what it is,” he said.
“I’m not blaming Microsoft,” said Cerf, who is Google’s vice president and chief Internet evangelist. “What I’m saying is that backward compatibility is very hard to preserve over very long periods of time.”
The data objects are only meaningful if the application software is available to interpret them, Cerf said. “We won’t lose the disk, but we may lose the ability to understand the disk.”
It’s not just PowerPoint slides either, he said. The scientific community collects large amounts of data from simulations and instrument readings. But unless the metadata survives, which will tell under what conditions the data was collected, how the instruments were calibrated, and the correct interpretation of units, the information may be lost.