Mapping the Russian Internet Troll Network on Twitter using a Predictive Model

Sachith Dassanayaka,Ori Swed,Dimitri Volchenkov
DOI: https://doi.org/10.5890/JVTSD.2023.06.001
2024-09-12
Abstract:Russian Internet Trolls use fake personas to spread disinformation through multiple social media streams. Given the increased frequency of this threat across social media platforms, understanding those operations is paramount in combating their influence. Using Twitter content identified as part of the Russian influence network, we created a predictive model to map the network operations. We classify accounts type based on their authenticity function for a sub-sample of accounts by introducing logical categories and training a predictive model to identify similar behavior patterns across the network. Our model attains 88% prediction accuracy for the test set. Validation is done by comparing the similarities with the 3 million Russian troll tweets dataset. The result indicates a 90.7% similarity between the two datasets. Furthermore, we compare our model predictions on a Russian tweets dataset, and the results state that there is 90.5% correspondence between the predictions and the actual categories. The prediction and validation results suggest that our predictive model can assist with mapping the actors in such networks.
Social and Information Networks,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to identify and classify the activities of Russian Internet trolls on Twitter, in order to reveal how these fake accounts spread false information by disguising themselves as legitimate users. Specifically, the research aims to: 1. **Distinguish between real and fake users**: Since Russian Internet trolls use false identities to spread false information on the Internet, it is difficult to distinguish between real and fake users. Therefore, the main problem of the research is how to identify these fake accounts by analyzing user behavior patterns. 2. **Understand the network structure**: The research attempts to map the operation modes of these troll networks by constructing a prediction model, so as to better understand their internal structures and function distributions. This helps to reveal different types of troll roles and their distributions and activity patterns in the network. 3. **Improve the detection ability**: In order to deal with this threat, the research proposes a new method to detect and classify these troll accounts, so as to more effectively counter their influence. By introducing logical categories and training prediction models, the research hopes to improve the detection ability of this type of operation. ### Research background Russian Internet trolls (ITN) spread false information through fake accounts on social media platforms, which pose a serious threat to democratic discussions. In particular, Russia's intervention activities through the Internet Research Agency (IRA) during the 2016 US presidential election have attracted wide attention. These troll accounts disguise their true identities by pretending to be ordinary American users, making it difficult for users and regulatory agencies to identify and stop these operations. ### Solutions In order to solve the above problems, the researchers adopted the following methods: 1. **Dataset description**: The research used the publicly available IRA Twitter dataset, covering about 9 million tweets in 58 languages. The English and Russian tweets related to IRA were mainly analyzed. 2. **Feature extraction**: A variety of features were extracted from the dataset, including user ID, username, user profile, number of tweets, number of retweets, number of followers, etc. These features are used to describe the user's activity patterns. 3. **Concept classification**: According to the authenticity of false identities, the researchers proposed four concept categories: - **Fake news**: Accounts disguised as news media. - **Organization**: Accounts disguised as non - government organizations or enterprises. - **Political affiliation**: Accounts with obvious political tendencies. - **Individual**: Accounts disguised as ordinary individuals. 4. **Prediction model**: The researchers trained a machine - learning prediction model to identify and classify these accounts based on the extracted features. The model achieved a prediction accuracy rate of 88% and verified the effectiveness of the model by comparing it with the known dataset of 3 million Russian troll tweets. ### Verification results Through the verification of the 3 - million - Russian - troll - tweet dataset and other related datasets, the results show that the prediction accuracy rate of the model is 90.7%, indicating that this prediction model can effectively assist in identifying and classifying these fake accounts. ### Conclusion This research successfully mapped the activities of Russian Internet trolls on Twitter by constructing a prediction model and provided effective tools to distinguish between real and fake users, which is of great significance for understanding and dealing with this network threat.