z-logo
Premium
Estimating protein function from combinatorial sequence data using decision algorithms and neural networks
Author(s) -
Goldman Ellen R.,
Füllen Georg,
Youvan Douglas C.
Publication year - 1994
Publication title -
drug development research
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.582
H-Index - 60
eISSN - 1098-2299
pISSN - 0272-4391
DOI - 10.1002/ddr.430330209
Subject(s) - artificial neural network , perceptron , algorithm , computer science , artificial intelligence , sequence (biology) , sorting , machine learning , set (abstract data type) , computational biology , biology , genetics , programming language
Correlations between protein sequences and phenotypes were explored using databases of combinatorial cassette mutants of pigment‐protein complexes. Heuristically formulated decision algorithms and computer implemented neural networks were compared to determine their accuracy in classification of phenotypic categories. For the databases examined, decision algorithms employing very simple rules were able to properly classify mutants 80–84% of the time, based only on the amino acid sequence of the mutagenized region. Such decision algorithms did not require the formulation of any rules that involved site‐to‐site interactions, but rather, performed well based on the stringency of specific critical sites in the protein that accept only a restricted set of amino acids. In some cases, neural networks scored almost 10% higher than decision algorithms on the same databases (i.e., 94%). However, the success of the primitive decision algorithms and perceptrons at sorting sequences into categories suggests that linear effects predominate in the classification of a mutant's phenotype. Such methods should be generally applicable to the broad spectrum of databases that are currently being generated in combinatorial chemistry and biology experiments. © 1994 Wiley‐Less, Inc.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here