Premium
Thirty‐plus functional families from a single motif
Author(s) -
Yu Lihua,
Smith Temple F.,
Gaitatzes Chrysanthe
Publication year - 2000
Publication title -
protein science
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 3.353
H-Index - 175
eISSN - 1469-896X
pISSN - 0961-8368
DOI - 10.1110/ps.9.12.2470
Subject(s) - biology , caenorhabditis elegans , computational biology , genetics , genome , context (archaeology) , structural motif , saccharomyces cerevisiae , protein family , evolutionary biology , gene , biochemistry , paleontology
It is now possible to identify over 30 functional subfamilies among the WD‐repeat‐containing proteins found in the completed genomes. The majority of these subfamilies have at least one member for which experimental data allow assignment to a cellular pathway or process. Half of the 63 WD‐repeat‐containing proteins in Saccharomyces cerevisiae , half of the 70 in Caenorhabditis elegans , and a third of the 100 plus predicted in Drosophila can be assigned to 23 of these functional subfamilies. Perhaps indicative of the future, 33 WD‐repeat‐containing proteins from the partial genome of Arabidopsis thaliana can now be assigned to 18 of these subfamilies. These assignments have been made possible by combining traditional sequence similarity with an implied common beta propeller structural context to obtain measures of protein‐protein surface similarity. The beta propeller structural context is represented in the form of a Hidden Markov Model. The procedure is completely automated.