A Case Study in Credit Fraud Detection With SMOTE and XGBoost

Cuizhu Meng,Li Zhou,Bisong Liu
DOI: https://doi.org/10.1088/1742-6596/1601/5/052016
2020-08-01
Journal of Physics: Conference Series
Abstract:Abstract Credit fraud observations are minority in the sample set, variables tend to be seriously unbalanced, and the prediction results tend to be biased towards more observed classes. Common resolution usually constructs 1:1 data, either cutting off part of more classes (undersampling) or reducing classes for bootstrap sampling (oversampling). XGBoost is an efficient system implementation of Gradient Boosting, and also GB algorithm based on CART. Based on the real online transaction data of an Internet financial institution, this paper studies the performance of XGBoost algorithm on the original data set, the undersampling and SMOTE data sets respectively.
English Else
What problem does this paper attempt to address?