Application Analysis of the Machine Learning Fusion Model in Building a Financial Fraud Prediction Model | Zendy

Hongsheng Xu | Zendy; Ganglong Fan | Zendy; Yanping Song | Zendy

AI Assistant Blog Pricing

Open Access

Application Analysis of the Machine Learning Fusion Model in Building a Financial Fraud Prediction Model

Author(s) -

Hongsheng Xu,

Ganglong Fan,

Yanping Song

Publication year - 2022

Publication title -

security and communication networks

Language(s) - English

Resource type - Journals

SCImago Journal Rank - 0.446

H-Index - 43

eISSN - 1939-0122

pISSN - 1939-0114

DOI - 10.1155/2022/8402329

Subject(s) - computer science , machine learning , artificial intelligence , hyperparameter optimization , random forest , hyperparameter , oversampling , decision tree , support vector machine , set (abstract data type) , predictive modelling , finance , data set , key (lock) , data mining , bandwidth (computing) , economics , computer network , computer security , programming language

Financial data fraud by listed companies has brought an extremely bad impact on the market and society. Predicting the financial data fraud of listed companies in advance may reduce losses. Therefore, the key to solving the problem is to build a financial fraud prediction model. This paper analyzes the prediction and identification models of financial fraud at home and abroad in detail, and finds the problems existing in these prediction models. In view of these shortcomings, this paper proposes to build a financial fraud prediction model based on a machine learning fusion model. The first is the unbalanced processing of data samples. The oversampling method is used to improve the model prediction effect by setting a reasonable sampling ratio. Then, four machine learning models (GBDT, random forest, support vector machine, and decision tree) are selected suitable for financial data. The training set is used to optimize the hyperparameters of the four machine learning models separately. This paper proposes integrating the random search and grid search mechanisms to adjust the parameters to the optimum. Finally, a financial fraud prediction model is constructed based on the multimodel fusion of the integrated learning framework. First, the base learner integrates the predicted results of the four models and performs five-fold crossvalidation on the training set. The meta-learner then uses the GBDT model to train integrated data from the first layer, resulting in a fusion model. The experimental results show that the AUC value of the fusion model is significantly higher than that of the single model. Therefore, the fusion model proposed in this paper can effectively improve the prediction effect.

The content you want is available to Zendy users.

Already have an account? Click here to sign in.

Having issues? You can contact us here

Accelerating Research

Address

John Eccles House
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom

About

About Careers Publisher Partners Contact Us Our institutional solutions Get Organisational Trial or Quote

Learn

FAQs Blog Terms of Use Privacy Policy

Download the Zendy App

Discover

Explore

Home ZAIA Blog