Methods for biological data integration: perspectives and challenges
Author(s) -
Vladimir Gligorijević,
Nataša Pržulj
Publication year - 2015
Publication title -
journal of the royal society interface
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.655
H-Index - 139
eISSN - 1742-5689
pISSN - 1742-5662
DOI - 10.1098/rsif.2015.0571
Subject(s) - computer science , biological data , data integration , biological network , data science , data type , data mining , bioinformatics , biology , programming language
Rapid technological advances have led to the production of different types of biological data and enabled construction of complex networks with various types of interactions between diverse biological entities. Standard network data analysis methods were shown to be limited in dealing with such heterogeneous networked data and consequently, new methods for integrative data analyses have been proposed. The integrative methods can collectively mine multiple types of biological data and produce more holistic, systems-level biological insights. We survey recent methods for collective mining (integration) of various types of networked biological data. We compare different state-of-the-art methods for data integration and highlight their advantages and disadvantages in addressing important biological problems. We identify the important computational challenges of these methods and provide a general guideline for which methods are suited for specific biological problems, or specific data types. Moreover, we propose that recent non-negative matrix factorization-based approaches may become the integration methodology of choice, as they are well suited and accurate in dealing with heterogeneous data and have many opportunities for further development.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom