
An Optimized LightGBM Model for Fraud Detection
Author(s) -
Kezhen Huang
Publication year - 2020
Publication title -
journal of physics. conference series
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.21
H-Index - 85
eISSN - 1742-6596
pISSN - 1742-6588
DOI - 10.1088/1742-6596/1651/1/012111
Subject(s) - computer science , random forest , credit card fraud , feature selection , hyperparameter , support vector machine , feature engineering , artificial intelligence , feature (linguistics) , machine learning , credit card , data mining , deep learning , payment , linguistics , philosophy , world wide web
The rapid development of e-commerce and the growing popularity of credit cards have made online transactions smooth and convenient. However, large numbers of online transactions are also the targets of online credit card fraud, which aggregate to enormous losses annually. In response to this trend, many machine learning and deep learning methods have been proposed to solve this problem. Unfortunately, most models have been developed on small datasets and require tedious fine-tuning processes. In this paper, a LightGBM-based method for fraud detection is proposed. The dataset used for this study is the IEEE-CIS Fraud Detection dataset provided by Vesta Corporation, which includes over 1 million samples. Experiments have shown that the LightGBM-based method outperforms most classical methods based on Support Vector Machine, XGBoost, or Random Forest. Besides, effective feature engineering methods for feature selection and Bayesian fine-tuning for automatic hyperparameter searching are also proposed.