z-logo
open-access-imgOpen Access
Systematic literature review of preprocessing techniques for imbalanced data
Author(s) -
Felix Ebubeogu Amarachukwu,
Lee Sai Peck
Publication year - 2019
Publication title -
iet software
Language(s) - English
Resource type - Journals
ISSN - 1751-8814
DOI - 10.1049/iet-sen.2018.5193
Subject(s) - preprocessor , data pre processing , computer science , systematic review , machine learning , data mining , artificial intelligence , data collection , data quality , quality (philosophy) , data science , engineering , medline , metric (unit) , philosophy , statistics , operations management , mathematics , epistemology , political science , law
Data preprocessing remains an important step in machine learning studies. This is because proper preprocessing of imbalanced data can enable researchers to reduce defects as much as possible, which, in turn, may lead to the elimination of defects in existing data sets. Despite the remarkable achievements that have been accomplished in machine learning studies, systematic literature reviews of imbalanced data preprocessing techniques are lacking. Consequently, there are a limited number of systematic literature review studies on imbalanced data preprocessing. In this study, the authors assess the existing literature to identify the key issues related to data quality and handling and to provide a convenient collection of the techniques used to address these issues when performing data preprocessing. They applied a systematic literature review method involving a manual search to select articles published from January 2010 to September 2018 for review. The qualities of the existing studies were assessed using certain quality assessment criteria. Of the 118 relevant studies found, only 2% were identified as having been conducted following systematic literature review guidelines. This study, therefore, calls for more systematic literature review studies on data preprocessing to improve the quality of the data applied in machine learning studies.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here