z-logo
open-access-imgOpen Access
Isolation Forest Wrapper Approach for Feature Selection in Software Defect Prediction
Author(s) -
Zhiguo Ding
Publication year - 2021
Publication title -
iop conference series. materials science and engineering
Language(s) - English
Resource type - Journals
eISSN - 1757-899X
pISSN - 1757-8981
DOI - 10.1088/1757-899x/1043/3/032030
Subject(s) - feature selection , curse of dimensionality , dimensionality reduction , computer science , software , data mining , feature (linguistics) , machine learning , artificial intelligence , selection (genetic algorithm) , pattern recognition (psychology) , linguistics , programming language , philosophy
Software defect prediction is one of the hot research topics in the software engineering application. The performance of predictor largely depends on the quality of dataset used for learning the predictor. High dimensionality is a noteworthy characteristic of software defect dataset, which has some side-effect on the predictor building using data mining or machine learning algorithm. Feature selection, being an effective measure of dimensionality reduction, uses the optimal feature subset to represent the entire feature space and alleviate the dimensionality curse problem. In this paper, a wrapper feature selection approach applying genetic algorithm as a search strategy to find the optimal feature subset is firstly introduced. Secondly, an improved isolation forest based defect prediction method is proposed. The exploring experiments on 5 real NASA software defect datasets demonstrate the proposed method can improve the defect prediction performance to some extent and proves the positive effect of feature selection in SDP application.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here