Premium
D ata P ractices and C uration V ocabulary ( DPCV ocab): An empirically derived framework of scientific data practices and curatorial processes
Author(s) -
Chao Tiffany C.,
Cragin Melissa H.,
Palmer Carole L.
Publication year - 2015
Publication title -
journal of the association for information science and technology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.903
H-Index - 145
eISSN - 2330-1643
pISSN - 2330-1635
DOI - 10.1002/asi.23184
Subject(s) - terminology , data curation , vocabulary , computer science , digital curation , knowledge management , data science , world wide web , linguistics , philosophy
Conceptual frameworks and taxonomies are an important part of the emerging base of knowledge on the curation of research data. We present the D ata P ractices and C uration V ocabulary ( DPCV ocab), a functional vocabulary created for specifying relationships among data practices in research, types of data produced and used, and curation roles and activities. The vocabulary consists of 3 categories— R esearch D ata P ractices, D ata, and C uration—with 187 terms validated through empirical studies of scientific data practices in the E arth and life sciences. The present article covers the DPCV ocab development process and examines applications for mapping relationships across the 3 categories, identifying factors for projecting curation costs and important differences in curation requirements across disciplines. As a tool for curators, the vocabulary provides a framework for charting curation options and guiding systematic administration of curation services. It can serve as a shared terminology or lingua franca to support interactions and collaboration among curators, data producers, system developers, and other stakeholders in data infrastructure and services. The DPCV ocab as a whole supports both the technical and the human aspects of professional curation work essential to the modern research system.