Projection-Based Neighborhood Non-Negative Matrix Factorization for lncRNA-Protein Interaction Prediction
Author(s) -
Yingjun Ma,
Tingting He,
Xingpeng Jiang
Publication year - 2019
Publication title -
frontiers in genetics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.413
H-Index - 81
ISSN - 1664-8021
DOI - 10.3389/fgene.2019.01148
Subject(s) - computer science , projection (relational algebra) , matrix decomposition , benchmark (surveying) , computational biology , feature (linguistics) , interaction network , similarity (geometry) , protein–protein interaction , non negative matrix factorization , data mining , artificial intelligence , machine learning , algorithm , biology , genetics , gene , image (mathematics) , linguistics , eigenvalues and eigenvectors , physics , philosophy , geodesy , quantum mechanics , geography
Many long ncRNAs (lncRNA) make their effort by interacting with the corresponding RNA-binding proteins, and identifying the interactions between lncRNAs and proteins is important to understand the functions of lncRNA. Compared with the time-consuming and laborious experimental methods, more and more computational models are proposed to predict lncRNA-protein interactions. However, few models can effectively utilize the biological network topology of lncRNA (protein) and combine its sequence structure features, and most models cannot effectively predict new proteins (lncRNA) that do not interact with any lncRNA (proteins). In this study, we proposed a projection-based neighborhood non-negative matrix decomposition model (PMKDN) to predict potential lncRNA-protein interactions by integrating multiple biological features of lncRNAs (proteins). First, according to lncRNA (protein) sequences and lncRNA expression profile data, we extracted multiple features of lncRNA (protein). Second, based on protein GO ontology annotation, lncRNA sequences, lncRNA(protein) feature information, and modified lncRNA-protein interaction network, we calculated multiple similarities of lncRNA (protein), and fused them to obtain a more accurate lncRNA(protein) similarity network. Finally, combining the similarity and various feature information of lncRNA (protein), as well as the modified interaction network, we proposed a projection-based neighborhood non-negative matrix decomposition algorithm to predict the potential lncRNA-protein interactions. On two benchmark datasets, PMKDN showed better performance than other state-of-the-art methods for the prediction of new lncRNA-protein interactions, new lncRNAs, and new proteins. Case study further indicates that PMKDN can be used as an effective tool for lncRNA-protein interaction prediction.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom