z-logo
Premium
Discrimination of outer membrane proteins using machine learning algorithms
Author(s) -
Gromiha M. Michael,
Suwa Makiko
Publication year - 2006
Publication title -
proteins: structure, function, and bioinformatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.699
H-Index - 191
eISSN - 1097-0134
pISSN - 0887-3585
DOI - 10.1002/prot.20929
Subject(s) - globular protein , support vector machine , membrane protein , pattern recognition (psychology) , artificial intelligence , transmembrane protein , computational biology , bacterial outer membrane , bayes' theorem , folding (dsp implementation) , artificial neural network , naive bayes classifier , transmembrane domain , biology , computer science , biochemistry , membrane , gene , engineering , bayesian probability , receptor , escherichia coli , electrical engineering
Discriminating outer membrane proteins (OMPs) from other folding types of globular and membrane proteins is an important task both for identifying OMPs from genomic sequences and for the successful prediction of their secondary and tertiary structures. In this work, we have analyzed the performance of different methods, based on Bayes rules, logistic functions, neural networks, support vector machines, decision trees, etc. for discriminating OMPs. We found that most of the machine learning techniques discriminate OMPs with similar accuracy. The neural network‐based method could discriminate the OMPs from other proteins [globular/transmembrane helical (TMH)] at the fivefold cross‐validation accuracy of 91.0% in a dataset of 1,088 proteins. The accuracy of discriminating globular proteins is 88.8% and that of TMH proteins is 93.7%. Further, the neural network method is tested with globular proteins belonging to 30 different folding types and it could successfully exclude 95% of the considered proteins. The proteins with SAM domain such as knottins, rubredoxin, and thioredoxin folds are eliminated with 100% accuracy. These accuracy levels are comparable to or better than other methods in the literature. We suggest that this method could be effectively used to discriminate OMPs and for detecting OMPs in genomic sequences. Proteins 2006. © 2006 Wiley‐Liss, Inc.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here