Premium
Inferring Association between Compound and Pathway with an Improved Ensemble Learning Method
Author(s) -
Song Meiyue,
Jiang Zhenran
Publication year - 2015
Publication title -
molecular informatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.481
H-Index - 68
eISSN - 1868-1751
pISSN - 1868-1743
DOI - 10.1002/minf.201500033
Subject(s) - inference , computer science , artificial intelligence , cheminformatics , machine learning , drug discovery , classifier (uml) , data mining , chemical space , flexibility (engineering) , ensemble learning , bioinformatics , mathematics , biology , statistics
Emergence of compound molecular data coupled to pathway information offers the possibility of using machine learning methods for compound‐pathway associations’ inference. To provide insights into the global relationship between compounds and their affected pathways, a improved Rotation Forest ensemble learning method called RGRF (Relief & GBSSL – Rotation Forest) was proposed to predict their potential associations. The main characteristic of the RGRF lies in using the Relief algorithm for feature extraction and regarding the Graph‐Based Semi‐Supervised Learning method as classifier. By incorporating the chemical structure information, drug mode of action information and genomic space information, our method can achieve a better precision and flexibility on compound‐pathway prediction. Moreover, several new compound‐pathway associations that having the potential for further clinical investigation have been identified by database searching. In the end, a prediction tool was developed using RGRF algorithm, which can predict the interactions between pathways and all of the compounds in cMap database.