z-logo
open-access-imgOpen Access
Characterization of Non-Trivial Neighborhood Fold Constraints from Protein Sequences using Generalized Topohydrophobicity.
Author(s) -
Guillaume Fourty,
Isabelle Callebaut,
JeanPaul Mor
Publication year - 2008
Publication title -
bioinformatics and biology insights
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.556
H-Index - 23
ISSN - 1177-9322
DOI - 10.4137/bbi.s426
Subject(s) - tetrahedron , fold (higher order function) , protein structure prediction , sequence (biology) , ab initio , protein structure , protein folding , computer science , structural alignment , computational biology , crystallography , mathematics , combinatorics , chemistry , sequence alignment , bioinformatics , peptide sequence , biology , genetics , biochemistry , organic chemistry , gene , programming language
Prediction of key features of protein structures, such as secondary structure, solvent accessibility and number of contacts between residues, provides useful structural constraints for comparative modeling, fold recognition, ab-initio fold prediction and detection of remote relationships. In this study, we aim at characterizing the number of non-trivial close neighbors, or long-range contacts of a residue, as a function of its "topohydrophobic" index deduced from multiple sequence alignments and of the secondary structure in which it is embedded. The "topohydrophobic" index is calculated using a two-class distribution of amino acids, based on their mean atom depths. From a large set of structural alignments processed from the FSSP database, we selected 1485 structural sub-families including at least 8 members, with accurate alignments and limited redundancy. We show that residues within helices, even when deeply buried, have few non-trivial neighbors (0-2), whereas beta-strand residues clearly exhibit a multimodal behavior, dominated by the local geometry of the tetrahedron (3 non-trivial close neighbors associated with one tetrahedron; 6 with two tetrahedra). This observed behavior allows the distinction, from sequence profiles, between edge and central beta-strands within beta-sheets. Useful topological constraints on the immediate neighborhood of an amino acid, but also on its correlated solvent accessibility, can thus be derived using this approach, from the simple knowledge of multiple sequence alignments.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here
Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom