z-logo
open-access-imgOpen Access
A hybrid gene selection algorithm based on interaction information for microarray-based cancer classification
Author(s) -
Songyot Nakariyakul
Publication year - 2019
Publication title -
plos one
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.99
H-Index - 332
ISSN - 1932-6203
DOI - 10.1371/journal.pone.0212333
Subject(s) - gene selection , selection (genetic algorithm) , microarray analysis techniques , computer science , gene , dna microarray , feature selection , data mining , computational biology , artificial intelligence , biology , gene expression , genetics
We address gene selection and machine learning methods for cancer classification using microarray gene expression data. Due to the high dimensionality of microarray data, traditional gene selection algorithms are filter-based, focusing on intrinsic properties of the data such as distance, dependency, and correlation. These methods are fast but select far too many genes to use for the classification task. In this work, we present a new hybrid filter-wrapper gene subset selection algorithm that is an improved modification of our prior algorithm. Our proposed method employs interaction information to rank candidate genes to add into a gene subset. It then conditionally adds one gene at a time into the current subset and verifies whether the resultant subset improves the classification performance significantly. Only significant genes are selected, and the candidate gene list is updated every time a gene is added to the subset. Thus, our gene selection algorithm is very dynamic. Experimental results on ten public cancer microarray data sets show that our method consistently outperforms prior gene selection algorithms in terms of classification accuracy, while requiring a small number of selected genes.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here