z-logo
open-access-imgOpen Access
Selecting Efficient and Reliable Preservation Strategies
Author(s) -
Micah Altman,
Richard L. Landau
Publication year - 2020
Publication title -
international journal of digital curation
Language(s) - English
Resource type - Journals
ISSN - 1746-8256
DOI - 10.2218/ijdc.v15i1.727
Subject(s) - computer science , flexibility (engineering) , set (abstract data type) , risk analysis (engineering) , event (particle physics) , reliability (semiconductor) , range (aeronautics) , scenario analysis , function (biology) , operations research , medicine , power (physics) , statistics , physics , quantum mechanics , evolutionary biology , engineering , composite material , biology , programming language , materials science , mathematics
This article addresses the problem of formulating efficient and reliable operational preservation policies that ensure bit-level information integrity over long periods, and in the presence of a diverse range of real-world technical, legal, organizational, and economic threats. We develop a systematic, quantitative prediction framework that combines formal modeling, discrete-event-based simulation, hierarchical modeling, And then use empirically calibrated sensitivity analysis to identify effective strategies. Specifically, the framework formally defines an objective function for preservation that maps a set of preservation policies and a risk profile to a set of preservation costs, and an expected collection loss distribution. In this framework, a curator’s objective is to select optimal policies that minimize expected loss subject to their budget constraint. To estimate preservation loss under different policy conditions optimal policies, we develop a statistical hierarchical risk model – that statistically four sources of risk: the storage hardware; the physical environment; the curating institution; and the global environment. We then employ a general discrete event-based simulation framework to evaluate the expected loss and cost of employing varying preservation strategies under specific parameterization of risks. The framework offers flexibility for the modeling of a wide range of preservation policies and threats. Since this framework is open source, and easily deployed in a cloud computing environment, it can be used to produce analysis based on independent estimates of scenario-specific costs, reliability, and risks. We present results summarizing hundreds of thousands of simulations using this framework. This analysis points to a number of robust and broadly applicable preservation strategies, provides novel insights into specific preservation tactics, and provides evidence that challenges received wisdom.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here