z-logo
open-access-imgOpen Access
Towards the Development of a Test Corpus of Digital Objects for the Evaluation of File Format Identification Tools and Signatures
Author(s) -
Andrew Fetherston,
Tim Gollins
Publication year - 2012
Publication title -
international journal of digital curation
Language(s) - English
Resource type - Journals
ISSN - 1746-8256
DOI - 10.2218/ijdc.v7i1.211
Subject(s) - computer science , identification (biology) , transparency (behavior) , set (abstract data type) , data science , test (biology) , world wide web , information retrieval , digital preservation , point (geometry) , software engineering , human–computer interaction , computer security , programming language , paleontology , botany , biology , geometry , mathematics
The digital preservation community currently utilises a number of tools and automated processes to identify and validate digital objects. The identification of digital objects is a vital first step in their long-term preservation, but the results returned by tools used for this purpose are lacking in transparency, and are not easily tested or verified. This paper suggests that a test corpus of digital objects is one way of providing this verification and validation, ultimately improving trust in the tools, and providing further stimulus to their development. Issues to be considered are outlined, and attention is drawn to particular examples of existing digital corpora which could conceivably provide a useable framework or starting point for our own communities needs. This paper does not seek to answer all questions in this area, but merely attempts to set out areas for consideration in any next step that is taken

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom