Digital library - Preservation and Long Term Management
Understand digital preservation goals, key strategies (migration, emulation, bit‑stream), and standards like OAIS for long‑term management.
Summary
Read Summary
Flashcards
Save Flashcards
Quiz
Take Quiz
Quick Practice
What is the primary goal of digital preservation?
1 of 10
Summary
Digital Preservation
Introduction
Digital materials face a unique challenge that printed books do not: technological obsolescence. A document saved on a floppy disk in 1990 cannot be easily read today because we lack the hardware and software to access it. Digital preservation addresses this problem by developing strategies to keep digital materials accessible and interpretable indefinitely into the future.
The core challenge of digital preservation is that digital information depends on working technology. Unlike physical books that remain readable for centuries if stored properly, digital files are vulnerable to format obsolescence, media degradation, and loss of the software or operating systems needed to interpret them. Without deliberate preservation strategies, valuable digital materials risk becoming inaccessible within just decades.
Preservation Goals and Core Strategies
Digital preservation seeks to maintain the interpretability of digital materials—meaning they remain readable and usable even as technology changes around them. To achieve this, preservation professionals use three main approaches:
Migration involves converting digital content from older formats to newer, currently-supported formats. For example, converting a document from an outdated word processor format to PDF or modern formats ensures it can be opened by contemporary software. Migration is straightforward but can introduce subtle changes during conversion, and you must repeat the process each time a new format becomes standard.
Emulation preserves the original digital object by recreating its original operating environment as a virtual machine. Instead of changing the file format, emulation runs the original software on a simulated computer that mimics the hardware and operating system from when the file was created. This approach maintains authenticity—the digital object itself never changes—but requires maintaining detailed technical specifications of historical systems.
Bit-stream preservation stores the raw, unchanged bits of the original digital file indefinitely. While this sounds simple, it actually requires significant infrastructure. The preserved bits remain meaningless without someone knowing how to interpret them later, so bit-stream preservation alone must be paired with careful documentation about what the file format is and how to render it.
Preservation Standards and Frameworks
The field of digital preservation has developed important standards and frameworks to guide institutions in building sustainable preservation systems.
The Open Archival Information System (OAIS) model provides a reference architecture that defines how digital materials should be organized, managed, and preserved. OAIS distinguishes between different types of information packages: the original materials (Submission Information Package), the internal preservation format (Archival Information Package), and the format delivered to users (Dissemination Information Package). This structured approach helps institutions understand and implement preservation at scale.
Trusted Digital Repository (TDR) standards establish criteria for what makes a digital repository trustworthy. A trusted repository must demonstrate reliability (it actually stores and maintains materials as promised), authenticity (it can verify that materials haven't been altered), and sustainability (it has the long-term organizational and financial capacity to continue preservation operations). These standards matter because institutions and individuals need confidence that their digital materials will actually be preserved if they deposit them in a repository.
Risk Management and Sustainability
Successful digital preservation requires ongoing attention to multiple risks. Media degradation is a physical threat—optical discs can degrade over time, and magnetic tape can become unreadable. Software dependencies create obsolescence risks when specialized applications become unsupported. Funding volatility threatens the organizational ability to maintain preservation systems over decades.
Addressing these risks requires proactive planning. Libraries and archives assess which materials face the highest risk and prioritize those for preservation. Long-term sustainability plans include regular technology refresh cycles, where preservation systems are deliberately updated to current technology before the old systems become completely obsolete. Community partnerships also strengthen sustainability by distributing responsibility across multiple organizations—if one organization fails, others maintain copies.
Preservation Tools and Services
Digital preservation relies on practical tools and automated processes to function at scale.
Fixity checking uses checksums—special numerical codes computed from file data—to verify that digital files haven't become corrupted over time. When a file is first preserved, its checksum is calculated and stored. Periodically, the checksum is recalculated; if it differs, this indicates the file data has changed, and administrators can take corrective action before the corruption spreads. This automated monitoring allows institutions to catch problems before materials become permanently damaged.
Preservation services also perform batch conversions of legacy formats into contemporary, accessible standards. For example, a library might convert thousands of documents from proprietary formats into widely-supported formats like PDF/A (an archival-friendly PDF variant), ensuring accessibility for researchers and the public.
<extrainfo>
Example: Google Books
Google's book-scanning project exemplifies digital preservation at massive scale. By digitizing millions of books, Google increased access to published knowledge while creating digital backups that preserve these materials against physical deterioration. This project demonstrates how digital preservation can serve the dual goals of long-term archival safety and immediate public access.
</extrainfo>
Flashcards
What is the primary goal of digital preservation?
To keep digital media interpretable indefinitely.
Which three primary methods are used to ensure digital media remains interpretable?
Migration
Emulation
Bit-stream preservation
How does migration help preserve digital content?
By moving content to newer formats to prevent obsolescence.
In the context of digital preservation, what is the purpose of emulation?
To reproduce original operating environments as virtual machines.
What does bit-stream preservation ensure regarding a digital object?
That the exact raw file bits remain unchanged.
What is the purpose of Google’s book-scanning project?
To digitize millions of books for broad access.
What three aspects must a holistic approach to digital preservation planning consider?
Technical aspects
Organizational aspects
Financial aspects
What model serves as the reference architecture for preserving digital information?
The Open Archival Information System (OAIS) model.
Which criteria are defined by Trusted Digital Repository standards?
Reliability
Authenticity
Sustainability
How do automated preservation tools monitor file fixity?
By using checksums to detect and alert administrators to corruption.
Quiz
Digital library - Preservation and Long Term Management Quiz Question 1: Which method involves converting files to newer formats?
- Migration (correct)
- Emulation
- Bit‑stream preservation
- Physical duplication
Digital library - Preservation and Long Term Management Quiz Question 2: What does bit‑stream preservation aim to preserve?
- The exact raw file bits (correct)
- The visual appearance only
- The software functionality only
- The metadata summary
Digital library - Preservation and Long Term Management Quiz Question 3: Which project digitizes millions of books for broad access?
- Google’s book‑scanning project (correct)
- eGranary offline digital library
- Library of Congress paper archiving
- Project Gutenberg audio collection
Digital library - Preservation and Long Term Management Quiz Question 4: Which of the following is NOT typically part of a holistic digital preservation approach?
- Marketing strategies (correct)
- Technical considerations
- Organizational considerations
- Financial considerations
Digital library - Preservation and Long Term Management Quiz Question 5: Which of the following is a risk that libraries assess in digital preservation?
- Media degradation (correct)
- Increased user demand
- Improved software performance
- Expanded storage capacity
Digital library - Preservation and Long Term Management Quiz Question 6: Which method is used by automated tools to monitor file integrity?
- Checksums (correct)
- File size comparison
- Timestamp analysis
- Color histogram
Digital library - Preservation and Long Term Management Quiz Question 7: What service might preservation teams provide to handle outdated file formats?
- Batch conversion of legacy formats (correct)
- Physical printing of digital files
- Deleting old files
- Encrypting all files
Digital library - Preservation and Long Term Management Quiz Question 8: Which preservation technique maintains the original software environment to enable future access to digital objects?
- Emulation (correct)
- Migration
- Bit‑stream preservation
- Physical backup
Digital library - Preservation and Long Term Management Quiz Question 9: In the context of digital preservation, what does the acronym OAIS stand for?
- Open Archival Information System (correct)
- Operational Archive Integration Service
- Organized Access and Indexing Scheme
- Online Access and Information Store
Which method involves converting files to newer formats?
1 of 9
Key Concepts
Digital Preservation Techniques
Digital preservation
Migration (digital preservation)
Emulation (digital preservation)
Bit‑stream preservation
Digital Preservation Frameworks
Open Archival Information System (OAIS)
Trusted Digital Repository
Risk management in digital preservation
Long‑term sustainability planning
Preservation tools (checksum/fixity)
Definitions
Digital preservation
The practice of maintaining digital media so it remains accessible and interpretable over the long term.
Migration (digital preservation)
The process of moving digital content to newer file formats or storage media to avoid obsolescence.
Emulation (digital preservation)
The technique of recreating original hardware or software environments virtually to render legacy digital objects.
Bit‑stream preservation
The method of preserving the exact binary representation of a digital object without alteration.
Open Archival Information System (OAIS)
An ISO‑standard reference model that defines the functions of an archival system for preserving and providing access to digital information.
Trusted Digital Repository
A digital repository that meets defined criteria for reliability, authenticity, and long‑term sustainability of its holdings.
Risk management in digital preservation
The systematic identification, assessment, and mitigation of threats such as media degradation, software dependencies, and funding volatility.
Preservation tools (checksum/fixity)
Automated software that monitors digital files for integrity using cryptographic checksums and alerts administrators to corruption.
Long‑term sustainability planning
Strategies that ensure ongoing resources, technology refresh cycles, and community partnerships to support digital preservation efforts.