
Gene expression feature selection for prostate cancer diagnosis using a two‐phase heuristic–deterministic search strategy
Author(s) -
Shahbeig Saleh,
Rahideh Akbar,
Helfroush Mohammad Sadegh,
Kazemi Kamran
Publication year - 2018
Publication title -
iet systems biology
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.367
H-Index - 50
eISSN - 1751-8857
pISSN - 1751-8849
DOI - 10.1049/iet-syb.2017.0044
Subject(s) - feature selection , particle swarm optimization , computer science , selection (genetic algorithm) , heuristic , genetic algorithm , pattern recognition (psychology) , data set , set (abstract data type) , algorithm , artificial intelligence , data mining , machine learning , programming language
Here, a two‐phase search strategy is proposed to identify the biomarkers in gene expression data set for the prostate cancer diagnosis. A statistical filtering method is initially employed to remove the noisiest data. In the first phase of the search strategy, a multi‐objective optimisation based on the binary particle swarm optimisation algorithm tuned by a chaotic method is proposed to select the optimal subset of genes with the minimum number of genes and the maximum classification accuracy . Finally, in the second phase of the search strategy, the cache‐based modification of the sequential forward floating selection algorithm is used to find the most discriminant genes from the optimal subset of genes selected in the first phase. The results of applying the proposed algorithm on the available challenging prostate cancer data set demonstrate that the proposed algorithm can perfectly identify the informative genes such that the classification accuracy , sensitivity, and specificity of 100% are achieved with only nine biomarkers.