Premium
Exploiting heterogeneous sequence properties improves prediction of protein disorder
Author(s) -
Obradovic Zoran,
Peng Kang,
Vucetic Slobodan,
Radivojac Predrag,
Dunker A. Keith
Publication year - 2005
Publication title -
proteins: structure, function, and bioinformatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.699
H-Index - 191
eISSN - 1097-0134
pISSN - 0887-3585
DOI - 10.1002/prot.20735
Subject(s) - intrinsically disordered proteins , sequence (biology) , protein structure prediction , computer science , statistics , mathematics , biology , protein structure , genetics , biophysics , biochemistry
During the past few years we have investigated methods to improve predictors of intrinsically disordered regions longer than 30 consecutive residues. Experimental evidence, however, showed that these predictors were less successful on short disordered regions, as observed two years ago during the fifth Critical Assessment of Techniques for Protein Structure Prediction (CASP5). To address this shortcoming, we developed a two‐level model called VSL1 (CASP6 id: 193‐1). At the first level, VSL1 consists of two specialized predictors, one of which was optimized for long disordered regions (>30 residues) and the other for short disordered regions (≤30 residues). At the second level, a meta‐predictor was built to assign weights for combining the two first‐level predictors. As the results of the CASP6 experiment showed, this new predictor has achieved the highest accuracy yet and significantly improved performance on short disordered regions, while maintaining high performance on long disordered regions. Proteins 2005;Suppl 7:176–182. © 2005 Wiley‐Liss, Inc.