z-logo
open-access-imgOpen Access
Sequence Representations and Their Utility for Predicting Protein-Protein Interactions
Author(s) -
Dhananjay Kimothi,
Pravesh Biyani,
James M. Hogan,
Melissa J. Davis
Publication year - 2021
Publication title -
ieee/acm transactions on computational biology and bioinformatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.745
H-Index - 71
eISSN - 1557-9964
pISSN - 1545-5963
DOI - 10.1109/tcbb.2021.3137325
Subject(s) - bioengineering , computing and processing
Protein-Protein Interactions (PPIs) are a crucial mechanism underpinning the function of the cell. So far, a wide range of machine-learning based methods have been proposed for predicting these relationships. Their success is heavily dependent on the construction of the underlying feature vectors, with most using a set of physico-chemical properties derived from the sequence. Few work directly with the sequence itself. In this paper, we explore the utility of sequence embeddings for predicting protein-protein interactions. We construct a protein pair feature vector by concatenating the embeddings of their constituent sequence. These feature vectors are then used as input to a binary classifier to make predictions. To learn sequence embeddings, we use two established Word2Vec based methods - Seq2Vec and BioVec - and we also introduce a novel feature construction method called SuperVecNW . The embeddings generated through SuperVecNW capture some network information in addition to the contextual information present in the sequences. We test the efficacy of our proposed approach on human and yeast PPI datasets and on three well-known networks: CD9, the Ras-Raf-Mek-Erk-Elk-Srf pathway, and a Wnt-related network. We demonstrate that low dimensional sequence embeddings provide better results than most alternative representations based on physico-chemical properties while offering a far simple approach to feature vector construction.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here