z-logo
open-access-imgOpen Access
Boosting with crossover for improving imbalanced medical datasets classification
Author(s) -
Abeer S. Desuky,
Asmaa Hekal Omar,
Naglaa Mostafa
Publication year - 2021
Publication title -
bulletin of electrical engineering and informatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.251
H-Index - 12
ISSN - 2302-9285
DOI - 10.11591/eei.v10i5.3121
Subject(s) - boosting (machine learning) , preprocessor , crossover , computer science , artificial intelligence , machine learning , data pre processing , data mining , field (mathematics) , health care , class (philosophy) , statistical classification , mathematics , pure mathematics , economics , economic growth
Due to the common use of electronic health databases in many healthcare services, healthcare data are available for researchers in the classification field to make diseases’ diagnosis more efficient. However, healthcare-medical data classification is most challenging because it is often imbalanced data. Most proposed algorithms are susceptible to classify the samples into the majority class, resulting in the insufficient prediction of the minority class. In this paper, a novel preprocessing method is proposed, using boosting and crossover to optimize the ratio of the two classes by progressively rebuilding the training dataset. This approach is shown to give better performance than other state-of-the-art ensemble methods, which is demonstrated by experiments on seven real-world medical datasets with different imbalance ratios and various distributions.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.
Having issues? You can contact us here