A Positive-Unlabeled Learning Approach for Detecting Malicious In-app Purchases on the App Store

Bowen Hu,Ziyi Yu,Yadong Zhou,Sizhe He,Yang Liu,Ting Liu,Xiaohong Guan
DOI: https://doi.org/10.1109/case59546.2024.10711841
2024-01-01
Abstract:Malicious in-app purchases have been rampant recently and caused a tremendous financial loss for app developers. These purchases rarely leave anomalous content information and are difficult to label, so only a few labeled positive (malicious) samples can be obtained which are insufficient for supervised learning. Facing the challenge above, this paper deals with the problem from a novel perspective by modeling Positive-Unlabeled learning. Our proposed approach (PULA) first leverages the prior knowledge of in-app purchases and gets likely positive and negative examples from unlabeled ones. Then, we divide likely examples into several subsets and iteratively extract reliable positive and negative examples from the likely examples. Finally, the transaction association graph is constructed, and a belief propagation algorithm is developed to propagate existing labels to the unlabeled ones on the graph. For more effective classification, we also deliberately design features of the purchases and test their validity. The experimental results on the real data of in-app purchases show that after extracting reliable positive and negative samples from the unlabeled ones by PULA, classic classification methods can be easily used to detect malicious purchases and outperform baseline algorithms by 23.04% in AUC at least.
What problem does this paper attempt to address?