Open Access
The phylogeny of a dataset
Author(s) -
Thomer Andrea K.,
Weber Nicholas M.
Publication year - 2014
Publication title -
proceedings of the american society for information science and technology
Language(s) - English
Resource type - Journals
eISSN - 1550-8390
pISSN - 0044-7870
DOI - 10.1002/meet.2014.14505101064
Subject(s) - phylogenetic tree , data science , cluster analysis , computer science , phylogenetics , field (mathematics) , spawn (biology) , evolutionary biology , biology , artificial intelligence , ecology , mathematics , biochemistry , gene , pure mathematics
ABSTRACT The field of evolutionary biology offers many approaches to study the changes that occur between and within generations of species; these methods have recently been adopted by cultural anthropologists, linguists and archaeologists to study the evolution of physical artifacts. In this paper, we further extend these approaches by using phylogenetic methods to model and visualize the evolution of a long‐standing, widely used digital dataset in climate science. Our case study shows that clustering algorithms developed specifically for phylogenetic studies in evolutionary biology can be successfully adapted to the study of digital objects, and their known offspring. Although we note a number of limitations with our initial effort, we argue that a quantitative approach to studying how digital objects evolve, are reused, and spawn new digital objects represents an important direction for the future of Information Science.