A Hybrid Dimensionality Reduction Model for Classification of Microarray Dataset
Author(s) -
Micheal O. Arowol,
Sulaiman Olaniyi Abdulsalam,
Rafiu Mope Isiaka,
Kazeem Alagbe Gbolagade
Publication year - 2017
Publication title -
international journal of information technology and computer science
Language(s) - English
Resource type - Journals
eISSN - 2074-9015
pISSN - 2074-9007
DOI - 10.5815/ijitcs.2017.11.06
Subject(s) - dimensionality reduction , computer science , principal component analysis , feature selection , pattern recognition (psychology) , support vector machine , artificial intelligence , curse of dimensionality , data mining , feature extraction , set (abstract data type) , feature vector , reduction (mathematics) , mathematics , geometry , programming language
In this paper, a combination of dimensionality reduction technique, to address the problems of highly correlated data and selection of significant variables out of set of features, by assessing important and significant dimensionality reduction techniques contributing to efficient classification of genes is proposed. One-WayANOVA is employed for feature selection to obtain an optimal number of genes, Principal Component Analysis (PCA) as well as Partial Least Squares (PLS) are employed as feature extraction methods separately, to reduce the selected features from microarray dataset. An experimental result on colon cancer dataset uses Support Vector Machine (SVM) as a classification method. Combining feature selection and feature extraction into a generalized model, a robust and efficient dimensional space is obtained. In this approach, redundant and irrelevant features are removed at each step; classification presents an efficient performance of accuracy of about 98% over the state of art.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom