Premium
An information‐theoretic approach to the prediction of protein structural class
Author(s) -
Zheng Xiaoqi,
Li Chun,
Wang Jun
Publication year - 2009
Publication title -
journal of computational chemistry
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.907
H-Index - 188
eISSN - 1096-987X
pISSN - 0192-8651
DOI - 10.1002/jcc.21406
Subject(s) - jackknife resampling , computer science , sequence (biology) , class (philosophy) , cluster analysis , data mining , fuzzy logic , k nearest neighbors algorithm , set (abstract data type) , fuzzy set , pattern recognition (psychology) , decomposition , artificial intelligence , process (computing) , algorithm , mathematics , statistics , ecology , genetics , estimator , biology , programming language , operating system
An information‐theoretical approach, which combines a sequence decomposition technique and a fuzzy clustering algorithm, is proposed for prediction of protein structural class. This approach could bypass the process of selecting and comparing sequence features as done previously. First, distances between each pair of protein sequences are estimated using a conditional decomposition technique in information theory. Then, the fuzzy k ‐nearest neighbor algorithm is used to identify the structural class of a protein given as set of sample sequences. To verify the strength of our method, we choose three widely used datasets constructed by Chou and Zhou. It is shown by the Jackknife test that our approach represents an improvement in the prediction of accuracy over existing methods. © 2009 Wiley Periodicals, Inc. J Comput Chem, 2010