Discrimination of outer membrane proteins using machine learning algorithms | Zendy

Gromiha M. Michael | Zendy; Suwa Makiko | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

Discrimination of outer membrane proteins using machine learning algorithms

Author(s) -

Gromiha M. Michael,

Suwa Makiko

Publication year - 2006

Publication title -

proteins: structure, function, and bioinformatics

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 1.699

H-Index - 191

eISSN - 1097-0134

pISSN - 0887-3585

DOI - 10.1002/prot.20929

Subject(s) - globular protein , support vector machine , membrane protein , pattern recognition (psychology) , artificial intelligence , transmembrane protein , computational biology , bacterial outer membrane , bayes' theorem , folding (dsp implementation) , artificial neural network , naive bayes classifier , transmembrane domain , biology , computer science , biochemistry , membrane , gene , engineering , bayesian probability , receptor , escherichia coli , electrical engineering

Discriminating outer membrane proteins (OMPs) from other folding types of globular and membrane proteins is an important task both for identifying OMPs from genomic sequences and for the successful prediction of their secondary and tertiary structures. In this work, we have analyzed the performance of different methods, based on Bayes rules, logistic functions, neural networks, support vector machines, decision trees, etc. for discriminating OMPs. We found that most of the machine learning techniques discriminate OMPs with similar accuracy. The neural network‐based method could discriminate the OMPs from other proteins [globular/transmembrane helical (TMH)] at the fivefold cross‐validation accuracy of 91.0% in a dataset of 1,088 proteins. The accuracy of discriminating globular proteins is 88.8% and that of TMH proteins is 93.7%. Further, the neural network method is tested with globular proteins belonging to 30 different folding types and it could successfully exclude 95% of the considered proteins. The proteins with SAM domain such as knottins, rubredoxin, and thioredoxin folds are eliminated with 100% accuracy. These accuracy levels are comparable to or better than other methods in the literature. We suggest that this method could be effectively used to discriminate OMPs and for detecting OMPs in genomic sequences. Proteins 2006. © 2006 Wiley‐Liss, Inc.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research