Application of Two-Stage Data Pre-processing Approach for Software Fault Prediction | Zendy

Prediction | Zendy; Perinthalmanna  Nellikkunn-Vengoor | Zendy; Malappuram | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Open Access

Application of Two-Stage Data Pre-processing Approach for Software Fault Prediction

Author(s) -

Prediction,

Perinthalmanna Nellikkunn-Vengoor,

Malappuram

Publication year - 2017

Publication title -

international journal of science and research (ijsr)

Language(s) - English

Resource type - Journals

ISSN - 2319-7064

DOI - 10.21275/art20175357

Subject(s) - computer science , software , stage (stratigraphy) , fault (geology) , data processing , data mining , reliability engineering , engineering , database , operating system , geology , seismology , paleontology

Software fault prediction is a valuable exercise in software quality assurance to better allocate limited testing resources. Classification is one of the effective strategies for predicting software errors. The classification models are trained based on data sets obtained by historical repositories of mining software. In this project, a new Two-stage data preprocessing approach is applied with classification models such as Naive Bayes, Decision Tree, Knn Classifier and SVM to improve the prediction accuracy of each classification model. The data preprocessing approach in two stages incorporates both the selection of features and the reduction of instances. Specifically, in the feature selection stage, first relevance analysis is done, second, a threshold-based clustering method is proposed, termed novel threshold-based clustering algorithm, to drive redundancy control. In the instance reduction stage, random sampling is applied to maintain the balance between defective and non defective instances. To demonstrate this project chose real-world software project dataset, such as Eclipse.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research