A Short Story about XML Schemas, Digital Preservation and Format Libraries
Author(s) -
Steve Knight
Publication year - 2012
Publication title -
international journal of digital curation
Language(s) - English
Resource type - Journals
ISSN - 1746-8256
DOI - 10.2218/ijdc.v7i1.215
Subject(s) - computer science , xml , xml schema editor , world wide web , document structure description , efficient xml interchange , schema (genetic algorithms) , xml schema (w3c) , xml validation , process (computing) , streaming xml , digital library , information retrieval , xml signature , programming language , art , literature , poetry
One morning we came in to work to find that one of our servers had made 1.5 million attempts to contact an external server in the preceding hour. It turned out that the calls were being generated by the Library’s digital preservation system (Rosetta) while attempting to validate XML Schema Definition (XSD) declarations included in the XML files of the Library’s online newspaper application Papers Past, which we were in the process of loading into Rosetta. This paper describes our response to this situation and outlines some of the issues that needed to be canvassed before we were able to arrive at a suitable solution, including the digital preservation status of these XSDs; their impact on validation tools, such as JHOVE; and where these objects should reside if they are considered material to the digital preservation process
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom