Semisupervised Game Player Categorization from Very Big Behavior Log Data

Jing Fan,Shaowen Gao,Yong Liu,Xinqiang Ma,Jiandang Yang,Changjie Fan
DOI: https://doi.org/10.1109/tsmc.2021.3066545
2022-01-01
Abstract:Extracting the specific category of the players, such as the malignant Bot, from the huge log data of the massive multiplayer online role playing games, denoted as MMORPGs, is an important basic task in game security and personal recommendation. In this article, we propose a parallel semisupervised framework to categorize specific game players with a few label-known target samples, which are denoted as bait players. Our approach first presents a feature representation model based on the players' level granularity, which can acquire aligned feature representations in the lower dimensional space from the players' original action sequences. Then, we propose a semisupervised clustering method, extended from the bisecting k-means model, to extract the specified players with the help of those bait players. Due to massive amounts of game log data, the computation complexity is an extreme challenge to implement our feature representation and semisupervised extraction approaches. We also propose a hierarchical parallelism framework, which allows the data to be computed horizontally and vertically simultaneously and enables varied parallel combinations for the steps of our semisupervised categorization approach. The comparable experiments on real-world MMORPGs' log data, containing more than 465 Gbytes and million players, are carried out to demonstrate the effectiveness and efficiency of our proposed approach compared with the state-of-the-art methods.
What problem does this paper attempt to address?