SEGCN: a subgraph encoding based graph convolutional network model for social bot detection

Feng Liu,Zhenyu Li,Chunfang Yang,Daofu Gong,Haoyu Lu,Fenlin Liu
DOI: https://doi.org/10.1038/s41598-024-54809-z
IF: 4.6
2024-02-20
Scientific Reports
Abstract:Message passing neural networks such as graph convolutional networks (GCN) can jointly consider various types of features for social bot detection. However, the expressive power of GCN is upper-bounded by the 1st-order Weisfeiler–Leman isomorphism test, which limits the detection performance for the social bots. In this paper, we propose a subgraph encoding based GCN model, SEGCN, with stronger expressive power for social bot detection. Each node representation of this model is computed as the encoding of a surrounding induced subgraph rather than encoding of immediate neighbors only. Extensive experimental results on two publicly available datasets, Twibot-20 and Twibot-22, showed that the proposed model improves the accuracy of the state-of-the-art social bot detection models by around 2.4%, 3.1%, respectively.
multidisciplinary sciences
What problem does this paper attempt to address?
The paper aims to address the problem of detecting social bots in social media. Specifically, existing Graph Convolutional Networks (GCNs) are limited by the expressive power of the 1st-order Weisfeiler–Leman isomorphism test when detecting social bots, which affects their detection performance. To this end, the paper proposes a Subgraph Encoding-based Graph Convolutional Network model (SEGCN), which enhances the model's expressive power by encoding the induced subgraph around a node rather than just encoding its direct neighbors, thereby improving the accuracy of social bot detection. The main contributions of the paper include: 1. Proposing an end-to-end social bot detection model that combines account semantic features, attribute features, and structural features. 2. Improving the expressive power of GCNs through subgraph encoding, which can capture fundamental structural information (such as cycles and triangles), making the model more suitable for detecting social bots. 3. Experimental results show that the model outperforms existing state-of-the-art social bot detection models on two public datasets (Twibot-20 and Twibot-22), with accuracy improvements of approximately 2.4% and 3.1%, respectively.