Premium
Data libraries – the missing element for modeling biological systems
Author(s) -
Baryshnikova Anastasia
Publication year - 2020
Publication title -
the febs journal
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.981
H-Index - 204
eISSN - 1742-4658
pISSN - 1742-464X
DOI - 10.1111/febs.15261
Subject(s) - data curation , bottleneck , computer science , data science , reusability , process (computing) , visibility , data integration , data management , world wide web , information retrieval , database , physics , optics , software , programming language , embedded system , operating system
The primary bottleneck in understanding and modeling biological systems is shifting from data collection to data analysis and integration. This process critically depends on data being available in an organized form, so that they can be accessed, understood, and reused by a broad community of scientists. A proven solution for organizing data is literature curation, which extracts, aggregates, and distributes findings from publications. Here, I describe the benefits of extending curation practices to datasets, especially those that are not deposited in centralized databases. I argue that dataset curation (or ‘data librarianship’ as I suggest we call it) will overcome many barriers in data visibility and reusability and make a unique contribution to integration and modeling.