A Graph-Powered Large-Scale Fraud Detection System
Zhao Li,Biao Wang,Jiaming Huang,Yilun Jin,Zenghui Xu,Ji Zhang,Jianliang Gao
DOI: https://doi.org/10.1007/s13042-023-01786-w
2024-01-01
International Journal of Machine Learning and Cybernetics
Abstract:Graph-powered fraud detection is a common issue in various areas, such as e-commerce, banking, insurance and social networks, where data can be naturally formulated as graph structure. Especially in e-commerce, due to its large scale and enormous amount of real-time transactions over millions of merchandises, fraud detection has become an important and serious problem. The challenges lie in three aspects: sparse fraud samples, complex features in online transactions and extra-large scale of e-commerce data. To deal with above issues, in this paper, we propose an efficient graph-powered large-scale fraud detection framework. Concretely, we first present a heterogeneous label propagation algorithm to recall more potentially fraudulent samples for further model training; then, we design a novel multi-view heterogeneous graph neural network model to obtain more accurate fraud predictions; finally, a fraud pattern analysis approach is presented to discover hidden fraud groups. In addition, in order to improve the efficiency and scalability of our proposed fraud detection framework, we present a large-scale fraud detection system deployed on a general graph computing engine. We conduct experiments on two real-world datasets. Results show that the proposed graph-powered fraud detection framework achieves high accuracy and superior scalability on large-scale graph data.