Premium
Identifying native‐like protein structures using physics‐based potentials
Author(s) -
Dominy Brian N.,
Brooks, Charles L.
Publication year - 2001
Publication title -
journal of computational chemistry
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.907
H-Index - 188
eISSN - 1096-987X
pISSN - 0192-8651
DOI - 10.1002/jcc.10018
Subject(s) - solvation , implicit solvation , force field (fiction) , intramolecular force , folding (dsp implementation) , protein folding , ionic bonding , chemistry , physics , chemical physics , crystallography , statistical physics , computer science , computational chemistry , ion , quantum mechanics , stereochemistry , nuclear magnetic resonance , electrical engineering , engineering
Abstract As the field of structural genomics matures, new methods will be required that can accurately and rapidly distinguish reliable structure predictions from those that are more dubious. We present a method based on the CHARMM gas phase implicit hydrogen force field in conjunction with a generalized Born implicit solvation term that allows one to make such discrimination. We begin by analyzing pairs of threaded structures from the EMBL database, and find that it is possible to identify the misfolded structures with over 90% accuracy. Further, we find that misfolded states are generally favored by the solvation term due to the mispairing of favorable intramolecular ionic contacts. We also examine 29 sets of 29 misfolded globin sequences from Levitt's “Decoys ‘R’ Us” database generated using a sequence homology‐based method. Again, we find that discrimination is possible with approximately 90% accuracy. Also, even in these less distorted structures, mispairing of ionic contacts results in a more favorable solvation energy for misfolded states. This is also found to be the case for collapsed, partially folded conformations of CspA and protein G taken from folding free energy calculations. We also find that the inclusion of the generalized Born solvation term, in postprocess energy evaluation, improves the correlation between structural similarity and energy in the globin database. This significantly improves the reliability of the hypothesis that more energetically favorable structures are also more similar to the native conformation. Additionally, we examine seven extensive collections of misfolded structures created by Park and Levitt using a four‐state reduced model also contained in the “Decoys ‘R’ Us” database. Results from these large databases confirm those obtained in the EMBL and misfolded globin databases concerning predictive accuracy, the energetic advantage of misfolded proteins regarding the solvation component, and the improved correlation between energy and structural similarity due to implicit solvation. Z‐scores computed for these databases are improved by including the generalized Born implicit solvation term, and are found to be comparable to trained and knowledge‐based scoring functions. Finally, we briefly explore the dynamic behavior of a misfolded protein relative to properly folded conformations. We demonstrate that the misfolded conformation diverges quickly from its initial structure while the properly folded states remain stable. Proteins in this study are shown to be more stable than their misfolded counterparts and readily identified based on energetic as well as dynamic criteria. In summary, we demonstrate the utility of physics‐based force fields in identifying native‐like conformations in a variety of preconstructed structural databases. The details of this discrimination are shown to be dependent on the construction of the structural database. © 2002 Wiley Periodicals, Inc. J Comput Chem 23: 147–160, 2002