Abstract:Recently, fraud debt has been one of the major issues for Internet financial institutions. Due to fraudulent activities, huge losses are occurring in financial institutions. Hence, there is a need for a method of analyzing and detecting fraudulent transactions and separating them from genuine ones. Supervised learning approaches are mainly used for fraud detection since they consider the fraudulent set, which can be known from past transaction analysis. Though these models are interpretable, the prediction accuracy of these models remains challenging. However, these approaches fail to perform well when there are changes in customer behaviour. Moreover, it is complex to identify abnormal transactions due to data imbalance. Hence, this work presents a semi-supervised outlier score-based Anti-Fraud model to identify the loan applicant as a genuine or fraudulent debtor. The proposed work has the stages like a pre-processing module, Data augmentation and classification model. After per-processing the data, different outlier models such as Z-score and Isolation forest (IF) are applied to generate more data. Then, the Unsupervised K-Means Clustering (KMC) granularity-based Outlier scoring method is proposed to augment the datasets with too many scores. This clustering module clusters the loan applicants based on their credit history. Then, the Z-score and IF are applied to each cluster to augment the original dataset with different scores. This normalized data is input to the XGBoost-bidirectional Gated Recurrent unit (BiGRU) self-attention network (SAN). This XGB-BiGRU-SAN is used to capture more efficient dynamic information. Further, a mathematical model, an Arithmetic Optimization algorithm (AOA), is used to optimize the network weights. The performance of a proposed XGB-BiGRU-SAN Internet loan fraud detection is analyzed on the two benchmark datasets, like the leading club and bank loan status. The proposed XGB-BiGRU-SAN achieved better classification accuracy, precision and recall of 99.05%, 99.11% and 99.34% on the leading club dataset. Further, the accuracy, precision and recall values achieved in the bank loan status dataset are 98.67%, 98.82% and 98.62%, respectively.

Semi-Supervised Anti-Fraud Models For Cash Pre-Loan In Internet Consumer Finance

Loan Fraud Users Detection in Online Lending Leveraging Multiple Data Views

Automated Feature Engineering for Fraud Prediction in Online Credit Loan Services.

Deep Learning Anti-Fraud Model For Internet Loan: Where We Are Going

A semi-supervised Anti-Fraud model based on integrated XGBoost and BiGRU with self-attention network: an application to internet loan fraud detection

Detection and Analysis of Credit Card Application Fraud Using Machine Learning Algorithms

Efficient Bank Fraud Detection with Machine Learning

A Comparison Study of Credit Card Fraud Detection: Supervised versus Unsupervised

Identifying Features For Detecting Fraudulent Loan Requests On P2p Platforms

An Intelligent Financial Fraud Detection Support System Based on Three-Level Relationship Penetration

Peer-to-Peer Loan Fraud Detection: Constructing Features from Transaction Data

Real-Time Online Banking Fraud Detection Model by Unsupervised Learning Fusion

Refined Analysis and a Hierarchical Multi-Task Learning Approach for Loan Fraud Detection

K‐Fuse: Credit card fraud detection based on a classification method with a priori class partitioning and a novel feature selection strategy

LongArms: Fraud Prediction in Online Lending Services Using Sparse Knowledge Graph

FDHelper: Assist Unsupervised Fraud Detection Experts with Interactive Feature Selection and Evaluation

A Semi-supervised Graph Attentive Network for Financial Fraud Detection

Research on anti-Financial fraud Technology based on Machine learning

Credit Card Fraud Detection via Intelligent Sampling and Self-supervised Learning

A State of the Art Survey of Data Mining-Based Fraud Detection and Credit Scoring

Personalized Approach Based on SVM and ANN for Detecting Credit Card Fraud