z-logo
open-access-imgOpen Access
Quantifying the dark data in museum fossil collections as palaeontology undergoes a second digital revolution
Author(s) -
Charles R. Marshall,
Seth Finnegan,
Erica C. Clites,
Patricia A. Holroyd,
Nicole Bonuso,
Célia Martins Cortez,
Edward Davis,
Gregory P. Dietl,
Patrick S. Druckenmiller,
Ronald Eng,
Christine Garcia,
Kathryn Estes-Smargiassi,
Austin Hendy,
Kathy Hollis,
Holly Little,
Elizabeth A. Nesbitt,
Peter D. Roopnarine,
L. Skibinski,
Jann E. Vendetti,
Lisa D. White
Publication year - 2018
Publication title -
biology letters
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.596
H-Index - 110
eISSN - 1744-957X
pISSN - 1744-9561
DOI - 10.1098/rsbl.2018.0431
Subject(s) - digitization , fossil record , paleontology , paleobiology , biology , ecology , archaeology , geography , computer science , computer vision
Large-scale analysis of the fossil record requires aggregation of palaeontological data from individual fossil localities. Prior to computers, these synoptic datasets were compiled by hand, a laborious undertaking that took years of effort and forced palaeontologists to make difficult choices about what types of data to tabulate. The advent of desktop computers ushered in palaeontology's first digital revolution-online literature-based databases, such as the Paleobiology Database (PBDB). However, the published literature represents only a small proportion of the palaeontological data housed in museum collections. Although this issue has long been appreciated, the magnitude, and thus potential significance, of these so-called 'dark data' has been difficult to determine. Here, in the early phases of a second digital revolution in palaeontology--the digitization of museum collections-we provide an estimate of the magnitude of palaeontology's dark data. Digitization of our nine institutions' holdings of Cenozoic marine invertebrate collections from California, Oregon and Washington in the USA reveals that they represent 23 times the number of unique localities than are currently available in the PBDB. These data, and the vast quantity of similarly untapped dark data in other museum collections, will, when digitally mobilized, enhance palaeontologists' ability to make inferences about the patterns and processes of past evolutionary and ecological changes.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom