
Feature Selection in Cross-Project Software Defect Prediction
Author(s) -
Aries Saifudin,
Agung Trisetyarso,
Wawan Suparta,
Chuanze Kang,
Bahtiar Saleh Abbas,
Yaya Heryadi
Publication year - 2020
Publication title -
journal of physics. conference series
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.21
H-Index - 85
eISSN - 1742-6596
pISSN - 1742-6588
DOI - 10.1088/1742-6596/1569/2/022001
Subject(s) - computer science , feature selection , naive bayes classifier , software bug , software , software metric , software sizing , data mining , software regression , machine learning , feature (linguistics) , software development , software construction , software engineering , selection (genetic algorithm) , verification and validation , artificial intelligence , schedule , support vector machine , engineering , operating system , linguistics , philosophy , operations management
Advances in technology have increased the use and complexity of software. The complexity of the software can increase the possibility of defects. Defective software can cause high losses. Fixing defective software requires a high cost because it can spend up 50% of the project schedule. Most software developers don’t document their work properly so that making it difficult to analyse software development history data. Software metrics which use in cross-project software defects prediction have many features. Software metrics usually consist of various measurement techniques, so there are possibilities for their features to be similar. It is possible that these features are similar or irrelevant so that they can cause a decrease in the performance of classifiers. In this study, several feature selection techniques were proposed to select the relevant features. The classification algorithm used is Naive Bayes. Based on the analysis using ANOVA, the SBS and SBFS models can significantly improve the performance of the Naïve Bayes model.