A novel cost‐sensitive algorithm and new evaluation strategies for regression in imbalanced domains | Zendy

Sadouk Lamyaa | Zendy; Gadi Taoufiq | Zendy; Essoufi El Hassan | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

A novel cost‐sensitive algorithm and new evaluation strategies for regression in imbalanced domains

Author(s) -

Sadouk Lamyaa,

Gadi Taoufiq,

Essoufi El Hassan

Publication year - 2021

Publication title -

expert systems

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.365

H-Index - 38

eISSN - 1468-0394

pISSN - 0266-4720

DOI - 10.1111/exsy.12680

Subject(s) - computer science , regression , machine learning , artificial neural network , convergence (economics) , artificial intelligence , minification , structural risk minimization , mean squared error , function (biology) , data mining , algorithm , statistics , mathematics , evolutionary biology , biology , economics , programming language , economic growth

Many real‐world data mining applications involve obtaining predictive models using imbalanced datasets. Frequently, the least common target variables present within datasets are associated with events that are highly relevant for end users. When these variables are nominal, we have a class‐imbalance problem which has been thoroughly studied within machine learning. As for regression tasks where target variables are continuous, few predictive models and evaluation techniques exist. This paper proposes a solution to these challenges. First, we introduce a cost‐sensitive learning algorithm based on a neural network trained on the minimization of a biased loss function. Results show a higher or comparable performance and convergence speed to existent techniques. Second, we develop new approaches for performance assessment of regression tasks within imbalanced domains by proposing new scalar measures, namely Geometric Mean Error ( GME ) and Class‐Weighted Error ( CWE ), as well as new graphical‐based measures, namely REC TPR , REC TNR , REC G − Mean and REC CWA curves. Unlike standard measures, our evaluation strategies are shown to be more robust to data imbalance as they reflect the performance of both rare and frequent events.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research