Archiving research data refers to the long-term storage and preservation of data. This preservation of research data ensures data are accessible while retaining their integrity and authenticity over time. The location and process of archiving data should be considered in data management planning including maintaining data integrity, preserving data provenance, allowing data sharing and reuse, and complying with institutional and funder regulations. Many of these preservation actions overlap with the guidance in the Publishing and Sharing section, and similarly, a sharing platform such as a repository may also offer preservation and archiving services. Here are a few key components of data preservation:
Shared through the LTER Network DataBits Stories, by John Porter & An Nguyen with input from the LTER IM Committee
The Purdue University Research Repository (PURR) is Purdue's institutional data repository that supports long-term preservation and access. Looking at PURR's digital preservation policies can give researchers a glimpse into how this process works and what can be expected when preparing to preserve your data. PURR will provide preservation support for as many formats as possible but the system considers three levels of support for archiving data:
Here are a few examples of sustainable, supported, and unsustainable formats:
| File Type | Sustainable | Supported | Unsustainable |
| Word Processing | PDF/A, OpenDocument Text | PDF/B, Microsoft Word, Microsoft Open XML, Rich Text Format | CorelWordPerfect, Lotus WordPro, PDF |
| Plain Text | Plain Text, Comma-separated file, Tab-delimited file | ||
| Structural Markup | SGML w/DTD, XML w/DTD | SGML w/o DTD, XML w/o DTD | |
| Spreadsheets | Comma-separated file, Tab-delimited file, PDF/A | Microsoft Excel, Microsoft Excel Open XML | |
| Databases | Delimited Flat File w/DDL | Microsoft Access, dBase Format | |
| Audio | WAVE | AIFF (uncompressed), Standard MIDI, MPEG, MP2AAC | Audio CD, DVD-Audio, RealAudio, Shorten, RIFF-RMID, Extended MIDI |
| Video | AVI, MPEG-1, MPEG-2, MPEG-4, Quicktime | Windows Media Video | |
| Images | TIFF, JPG 2000 | JPEG, PNG, PDF/A, GIF | RAW, Adobe Photoshop, PDF |
When choosing a location to archive your research data, it is important to find out the level of support provided for preservation through policies. As an example, PURR offers a preservation support policy following the sustainable, supported, and unsustainable specifications described above. The following show which preservation actions are supported by PURR and at what level:
Bit-level preservation
Limited Preservation
Full Preservation