Credit scoring for a microcredit data set using the synthetic minority oversampling technique and ensemble classifiers | Zendy

Gicić Adaleta | Zendy; Subasi Abdulhamit | Zendy

AI Assistant Blog Pricing

Home ZAIA Blog

Premium

Credit scoring for a microcredit data set using the synthetic minority oversampling technique and ensemble classifiers

Author(s) -

Gicić Adaleta,

Subasi Abdulhamit

Publication year - 2019

Publication title -

expert systems

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.365

H-Index - 38

eISSN - 1468-0394

pISSN - 0266-4720

DOI - 10.1111/exsy.12363

Subject(s) - oversampling , microfinance , computer science , machine learning , classifier (uml) , artificial intelligence , preprocessor , boosting (machine learning) , data mining , context (archaeology) , data pre processing , credit risk , data set , finance , business , bandwidth (computing) , computer network , paleontology , economics , biology , economic growth

Although microfinance organizations play an important role in developing economies, decision support models for microfinance credit scoring have not been sufficiently covered in the literature, particularly for microcredit enterprises. The aim of this paper is to create a three‐class model that can improve credit risk assessment in the microfinance context. The real‐world microcredit data set used in this study includes data from retail, micro, and small enterprises. To the best of the authors' knowledge, existing research on microfinance credit scoring has been limited to regression and genetic algorithms, thereby excluding novel machine learning algorithms. The aim of this research is to close this gap. The proposed models predict default events by analysing different ensemble classification methods that empower the effects of the synthetic minority oversampling technique (SMOTE) used in the preprocessing of the imbalanced microcredit data set. Initial results have shown improvement in the prediction results for certain classes when the oversampling technique with homogeneous and heterogeneous ensemble classifier methods was applied. A prediction improvement for all classes was achieved via application of SMOTE and the Consolidated Trees Construction algorithm together with Rotation Forest. To obtain a complete view of all aspects, an additional set of metrics is used in the evaluation of performance.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here

Accelerating Research