
Improved Machine Learning using Adaptive Boosting algorithm in Membrane Protein Prediction
Author(s) -
Anjna Jayant Deen*,
Manasi Gyanchandani
Publication year - 2019
Publication title -
international journal of innovative technology and exploring engineering
Language(s) - English
Resource type - Journals
ISSN - 2278-3075
DOI - 10.35940/ijitee.k2207.1081219
Subject(s) - adaboost , boosting (machine learning) , artificial intelligence , random forest , pseudo amino acid composition , gradient boosting , computer science , pattern recognition (psychology) , machine learning , feature extraction , ensemble learning , algorithm , membrane protein , mathematics , membrane , amino acid , support vector machine , chemistry , biochemistry , dipeptide
Membrane protein are very important and play significantly in the field of biology and medicine. The main purpose is to find suitable features of a membrane protein. Various features extraction methods are use to find membrane protein and their types. PseAAC (Pseudo Amino Acid Composition) is a one of the feature extraction method which was used to find the localization of the protein, which helps in the detection of membrane types. Therefore, in this study, a novel feature extraction method which is an integration of the pseudo amino acid composition integer values mapped in discrete sequence numbers in a matrix. The proposed scheme avoids biasing among the different membrane proteins and their types. Decision making for predicting the identification of membrane protein types was performed using an algorithm framework to improve the learning accuracy, by putting the training samples weights in the learning process of AdaBoost. The performance of different ensemble classifiers such as Random Forest, AdaBoost, is analyzed. The best accuracy achieved is 91.50% for with the Matthews correlation coefficient is 83.0%, and Cohen’s Kappa value is 82.7%