z-logo
Premium
Statistical potentials for fold assessment
Author(s) -
Melo Francisco,
Sánchez Roberto,
Sali Andrej
Publication year - 2002
Publication title -
protein science
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.353
H-Index - 175
eISSN - 1469-896X
pISSN - 0961-8368
DOI - 10.1002/pro.110430
Subject(s) - dihedral angle , statistical model , statistical potential , fold (higher order function) , statistical physics , function (biology) , biological system , mathematics , protein structure prediction , algorithm , physics , statistics , computer science , protein structure , nuclear magnetic resonance , molecule , biology , hydrogen bond , quantum mechanics , evolutionary biology , programming language
A protein structure model generally needs to be evaluated to assess whether or not it has the correct fold. To improve fold assessment, four types of a residue‐level statistical potential were optimized, including distance‐dependent, contact, ϕ/Ψ dihedral angle, and accessible surface statistical potentials. Approximately 10,000 test models with the correct and incorrect folds were built by automated comparative modeling of protein sequences of known structure. The criterion used to discriminate between the correct and incorrect models was the Z‐score of the model energy. The performance of a Z‐score was determined as a function of many variables in the derivation and use of the corresponding statistical potential. The performance was measured by the fractions of the correctly and incorrectly assessed test models. The most discriminating combination of any one of the four tested potentials is the sum of the normalized distance‐dependent and accessible surface potentials. The distance‐dependent potential that is optimal for assessing models of all sizes uses both C α and C β atoms as interaction centers, distinguishes between all 20 standard residue types, has the distance range of 30 Å, and is derived and used by taking into account the sequence separation of the interacting atom pairs. The terms for the sequentially local interactions are significantly less informative than those for the sequentially nonlocal interactions. The accessible surface potential that is optimal for assessing models of all sizes uses C β atoms as interaction centers and distinguishes between all 20 standard residue types. The performance of the tested statistical potentials is not likely to improve significantly with an increase in the number of known protein structures used in their derivation. The parameters of fold assessment whose optimal values vary significantly with model size include the size of the known protein structures used to derive the potential and the distance range of the accessible surface potential. Fold assessment by statistical potentials is most difficult for the very small models. This difficulty presents a challenge to fold assessment in large‐scale comparative modeling, which produces many small and incomplete models. The results described in this study provide a basis for an optimal use of statistical potentials in fold assessment.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here