
Machado: Open source genomics data integration framework
Author(s) -
Maurício de Alvarenga Mudadu,
Adhemar Zerlotini
Publication year - 2020
Publication title -
gigascience
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 2.947
H-Index - 54
ISSN - 2047-217X
DOI - 10.1093/gigascience/giaa097
Subject(s) - computer science , python (programming language) , software , visualization , data integration , world wide web , annotation , data science , relational database , open source , genomics , information retrieval , biological database , database schema , database , data mining , database design , bioinformatics , genome , programming language , biology , gene , biochemistry , chemistry , artificial intelligence
Genome projects and multiomics experiments generate huge volumes of data that must be stored, mined, and transformed into useful knowledge. All this information is supposed to be accessible and, if possible, browsable afterwards. Computational biologists have been dealing with this scenario for more than a decade and have been implementing software and databases to meet this challenge. The GMOD's (Generic Model Organism Database) biological relational database schema, known as Chado, is one of the few successful open source initiatives; it is widely adopted and many software packages are able to connect to it.