Tax Evasion Detection With FBNE-PU Algorithm Based on PnCGCN and PU Learning

Yuda Gao,Bin Shi,Bo Dong,Yiyang Wang,Lingyun Mi,Qinghua Zheng
DOI: https://doi.org/10.1109/TKDE.2021.3090075
IF: 9.235
2023-01-01
IEEE Transactions on Knowledge and Data Engineering
Abstract:Tax evasion is an illegal activity in which individuals or entities avoid paying their true tax liabilities. Efficient detection of tax evasion has always been a crucial issue for both governments and academic researchers. Recent research has proposed the use of machine learning technology to detect tax evasion and has shown good results in some specific areas. Regrettably, there are still two major obstacles to detect tax evasion. First, it is hard to extract powerful features because of the complexity of tax data. Second, due to the complicated process of tax auditing, labeled data are limited in practice. Such obstacles motivate the contributions of this work. In this paper, we propose a novel tax evasion detection framework named FBNE-PU (Fusion of the basic feature and network embedding with PU learning for tax evasion detection), a multistage method for detecting tax evasion in real-life scenarios. In this paper, we perform an in-depth analysis of the characteristics of the transaction network and propose a novel network embedding algorithm, the PnCGCN. It significantly improves detection performance by extracting powerful features from basic features and the tax-related transaction network. Moreover, we use nnPU (positive-unlabeled learning with non-negative risk estimator) to assign pseudo labels for unlabeled data. Finally, an MLP is trained as the decision function. Experiments on three real-world datasets demonstrate that our method significantly outperforms the comparison methods in the tax evasion detection task. Additionally, the source code and the experimental details have been made available at (https://github.com/PiggyGaGa/FBNE-PU).
What problem does this paper attempt to address?