
rBEF data: documenting data exchange and analysis for a collaborative data management platform
Author(s) -
Pfaff ClaasThido,
KönigRies Birgitta,
Lang Anne C.,
Ratcliffe Sophia,
Wirth Christian,
Man Xingxing,
Nadrowski Karin
Publication year - 2015
Publication title -
ecology and evolution
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.17
H-Index - 63
ISSN - 2045-7758
DOI - 10.1002/ece3.1547
Subject(s) - metadata , computer science , upload , scripting language , documentation , data management , download , information retrieval , metadata repository , data exchange , data science , data mining , world wide web , database , programming language , operating system
We are witnessing a growing gap separating primary research data from derived data products presented as knowledge in publications. Although journals today more often require the underlying data products used to derive the results as a prerequisite for a publication, the important link to the primary data is lost. However, documenting the postprocessing steps of data linking, the primary data with derived data products has the potential to increase the accuracy and the reproducibility of scientific findings significantly. Here, we introduce the rBEF data R package as companion to the collaborative data management platform BEF data. The R package provides programmatic access to features of the platform. It allows to search for data and integrates the search with external thesauri to improve the data discovery. It allows to download and import data and metadata into R for analysis. A batched download is available as well which works along a paper proposal mechanism implemented by BEF data. This feature of BEF data allows to group primary data and metadata and streamlines discussions and collaborations revolving around a certain research idea. The upload functionality of the R package in combination with the paper proposal mechanism of the portal allows to attach derived data products and scripts directly from R, thus addressing major aspects of documenting data postprocessing. We present the core features of the rBEF data R package along an ecological analysis example and further discuss the potential of postprocessing documentation for data, linking primary data with derived data products and knowledge.