Premium
Environmental features are important in determining protein secondary structure
Author(s) -
Macdonald J. Randy,
Johnson W. Curtis
Publication year - 2001
Publication title -
protein science
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.353
H-Index - 175
eISSN - 1469-896X
pISSN - 0961-8368
DOI - 10.1110/ps.420101
Subject(s) - pairwise comparison , side chain , residue (chemistry) , protein secondary structure , biological system , amino acid residue , chemistry , solvent , protein structure , computer science , mathematics , peptide sequence , biochemistry , biology , artificial intelligence , organic chemistry , polymer , gene
We have investigated amino acid features that determine secondary structure: (1) the solvent accessibility of each side chain, and (2) the interaction of each side chain with others one to four residues apart. Solvent accessibility is a simple model that distinguishes residue environment. The pairwise interactions represent a simple model of local side chain to side chain interactions. To test the importance of these features we developed an algorithm to separate α‐helices, β‐strands, and “other” structure. Single residue and pairwise probabilities were determined for 25,141 samples from proteins with <30% homology. Combining the features of solvent accessibility with pairwise probabilities allows us to distinguish the three structures after cross validation at the 82.0% level. We gain 1.4% to 2.0% accuracy by optimizing the propensities, demonstrating that probabilities do not necessarily reflect propensities. Optimization of residue exposures, weights of all probabilities, and propensities increased accuracy to 84.0%.