Resampling with neighbourhood bias on imbalanced domains | Zendy

Branco Paula | Zendy; Torgo Luis | Zendy; Ribeiro Rita P. | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

Resampling with neighbourhood bias on imbalanced domains

Author(s) -

Branco Paula,

Torgo Luis,

Ribeiro Rita P.

Publication year - 2018

Publication title -

expert systems

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.365

H-Index - 38

eISSN - 1468-0394

pISSN - 0266-4720

DOI - 10.1111/exsy.12311

Subject(s) - resampling , computer science , neighbourhood (mathematics) , machine learning , artificial intelligence , regression , data mining , variable (mathematics) , set (abstract data type) , statistics , mathematics , mathematical analysis , programming language

Imbalanced domains are an important problem that arises in predictive tasks causing a loss in the performance on the most relevant cases for the user. This problem has been extensively studied for classification problems, where the target variable is nominal. Recently, it was recognized that imbalanced domains occur in several other contexts and for multiple tasks, such as regression tasks, where the target variable is continuous. This paper focuses on imbalanced domains in both classification and regression tasks. Resampling strategies are among the most successful approaches to address imbalanced domains. In this work, we propose variants of existing resampling strategies that are able to take into account the information regarding the neighbourhood of the examples. Instead of performing sampling uniformly, our proposals bias the strategies to reinforce some regions of the data sets. With an extensive set of experiments, we provide evidence of the advantage of introducing a neighbourhood bias in the resampling strategies for both classification and regression tasks with imbalanced data sets.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research