Premium
Improving protein structural class prediction using novel combined sequence information and predicted secondary structural features
Author(s) -
Dai Qi,
Wu Li,
Li Lihua
Publication year - 2011
Publication title -
journal of computational chemistry
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.907
H-Index - 188
eISSN - 1096-987X
pISSN - 0192-8651
DOI - 10.1002/jcc.21918
Subject(s) - sequence (biology) , benchmark (surveying) , computer science , protein structure prediction , protein sequencing , protein secondary structure , computational biology , threading (protein sequence) , sequence alignment , class (philosophy) , artificial intelligence , peptide sequence , pattern recognition (psychology) , protein structure , biology , genetics , biochemistry , gene , geodesy , geography
Protein structural class prediction solely from protein sequences is a challenging problem in bioinformatics. Numerous efficient methods have been proposed for protein structural class prediction, but challenges remain. Using novel combined sequence information coupled with predicted secondary structural features (PSSF), we proposed a novel scheme to improve prediction of protein structural classes. Given an amino acid sequence, we first transformed it into a reduced amino acid sequence and calculated its word frequencies and word position features to combine novel sequence information. Then we added the PSSF to the combine sequence information to predict protein structural classes. The proposed method was tested on four benchmark datasets in low homology and achieved the overall prediction accuracies of 83.1%, 87.0%, 94.5%, and 85.2%, respectively. The comparison with existing methods demonstrates that the overall improvements range from 2.3% to 27.5%, which indicates that the proposed method is more efficient, especially for low‐homology amino acid sequences. © 2011 Wiley Periodicals, Inc. J Comput Chem, 2011