ReMEMBeR: Ranking Metric Embedding-Based Multicontextual Behavior Profiling for Online Banking Fraud Detection

Jipeng Cui,Chungang Yan,Cheng Wang
DOI: https://doi.org/10.1109/tcss.2021.3052950
2021-01-01
IEEE Transactions on Computational Social Systems
Abstract:Anomaly detection relies on individuals' behavior profiling and works by detecting any deviation from the norm. When used for online banking fraud detection, however, it mainly suffers from three disadvantages. First, for an individual, the historical behavior data are often too limited to profile his/her behavior pattern. Second, due to the heterogeneous nature of transaction data, there lacks a uniform treatment of different kinds of attribute values, which becomes a potential barrier for model development and further usage. Third, the transaction data are highly skewed, and it becomes a challenge to utilize the label information effectively. The three disadvantages result in both poor generalization and high false positive rate of anomaly detection, and we propose a ranking metric embedding based multi-contextual behavior profiling (ReMEMBeR) model to battle them effectively. We solve the original fraud detection problem as a pseudo-recommender system problem, where an individual is treated as a pseudo-user, his/her behavior as a pseudo-item, and the label as the corresponding pseudo-rating. With the idea of collaborative filtering, for an individual, information from other similar individuals can be used to establish his/her behavior profile. In order to obtain a uniform treatment of heterogeneous attributes, we turn to an embedding based method to learn both attribute embedding and individuals' behavior profiles within a common latent space simultaneously. To utilize the label information better, our model is designed to fit pseudo-users' correct preference ranking for pseudo-items. By doing so, it explicitly learns to tell the fraudulent from the legitimate. Last but not least, we propose to identify and distinguish individuals under different contexts and further generalize the behavior profiling model to be a multi-contextual one. The proposed model can, thus, integrate the multi-contextual behavior patterns and allow transactions to be examined under the different contexts. Extensive experiments on a real-world online banking transaction dataset demonstrate that our model not only outperforms benchmarks on all metrics but also can be combined with them to achieve even better performance.
What problem does this paper attempt to address?