z-logo
open-access-imgOpen Access
Bug Severity Prediction using Class Imbalance Problem
Author(s) -
Shubhra Goyal Jindal,
Amandeep Kaur
Publication year - 2019
Publication title -
international journal of recent technology and engineering
Language(s) - English
Resource type - Journals
ISSN - 2277-3878
DOI - 10.35940/ijrte.d7297.118419
Subject(s) - oversampling , computer science , machine learning , class (philosophy) , artificial intelligence , software bug , component (thermodynamics) , software , data mining , computer network , physics , bandwidth (computing) , thermodynamics , programming language
Class imbalance problem is often observed when instances of major class exceed instances of minor class. The performance of machine learning techniques is immensely afflicted by imbalanced data in several fields. The skewed distribution either predicts the majority class with high error rate or will not foresee the minority class. To solve the problem of imbalanced data of software bugs, Synthetic minority oversampling technique (SMOTE) is used which balances the imbalanced datasets of Apache Projects. It is applied on summary of bugs to balance the dataset and predicts severity at system and component level. Several machine learning techniques are applied on imbalanced as well as balanced datasets to predict the severity of software bugs using textual description. Test outcomes and statistical analysis shows improved results on balanced datasets in respect to Gmean and balance metrics instead of machine learning techniques applied on imbalanced data. Evaluation metrics Gmean improves by 34% and balance by 11% at system level and by 42% and 62% at component level. Further, it was observed that solving class imbalance problem on textual data is helpful in augmenting the performance.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here