
Microbial natural product databases: moving forward in the multi-omics era
Author(s) -
Jeffrey A. van Santen,
Satria A. Kautsar,
Marnix H. Medema,
Roger G. Linington
Publication year - 2021
Publication title -
natural product reports
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 2.703
H-Index - 177
eISSN - 1460-4752
pISSN - 0265-0568
DOI - 10.1039/d0np00053a
Subject(s) - interoperability , computer science , data science , field (mathematics) , key (lock) , inference , database , data curation , knowledge extraction , world wide web , data mining , artificial intelligence , mathematics , computer security , pure mathematics
Covering: 2010-2020The digital revolution is driving significant changes in how people store, distribute, and use information. With the advent of new technologies around linked data, machine learning and large-scale network inference, the natural products research field is beginning to embrace real-time sharing and large-scale analysis of digitized experimental data. Databases play a key role in this, as they allow systematic annotation and storage of data for both basic and advanced applications. The quality of the content, structure, and accessibility of these databases all contribute to their usefulness for the scientific community in practice. This review covers the development of databases relevant for microbial natural product discovery during the past decade (2010-2020), including repositories of chemical structures/properties, metabolomics, and genomic data (biosynthetic gene clusters). It provides an overview of the most important databases and their functionalities, highlights some early meta-analyses using such databases, and discusses basic principles to enable widespread interoperability between databases. Furthermore, it points out conceptual and practical challenges in the curation and usage of natural products databases. Finally, the review closes with a discussion of key action points required for the field moving forward, not only for database developers but for any scientist active in the field.