Analyzing large scale political discussions on Twitter: the use case of the Greek wiretapping scandal (#ypoklopes)

Ilias Dimitriadis,Dimitrios P. Giakatos,Stelios Karamanidis,Pavlos Sermpezis,Kelly Kiki,Athena Vakali
2024-09-17
Abstract:In this paper, we study the Greek wiretappings scandal, which has been revealed in 2022 and attracted a lot of attention by press and citizens. Specifically, we propose a methodology for collecting data and analyzing patterns of online public discussions on Twitter. We apply our methodology to the Greek wiretappings use case, and present findings related to the evolution of the discussion over time, its polarization, and the role of the media. The methodology can be of wider use and replicated to other topics. Finally, we provide publicly an open dataset, and online resources with the results.
Social and Information Networks
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is to analyze the large - scale political discussions on Twitter regarding the Greek wiretapping scandal. Specifically, the authors propose a methodology to collect and analyze the patterns of public discussions on Twitter and apply it to the specific case of the Greek wiretapping scandal. The following are the specific problems that this paper attempts to solve: 1. **Monitoring and Analysis of Large - scale Political Discussions**: - The author designs a general methodology (§3 and §4) for monitoring and analyzing large - scale political discussions on Twitter. This methodology includes steps such as data collection, political inference, bot detection, polarization quantification, and user and content analysis. - The methodology is not only applicable to the Greek wiretapping scandal but can also be extended to other online discussion topics. 2. **Specific Research on the Greek Wiretapping Scandal**: - Researchers collected a data set covering more than one year during the entire discussion period (§3.2) and compiled some auxiliary data sets, such as political affiliations, media accounts, and Twitter bot accounts (§3.3). - These data sets have been publicly shared for use by other researchers. 3. **Evolution, Polarization, and the Role of the Media in the Discussion**: - The authors analyzed the changes in the discussion over time (§5.1) and found that the number of tweets increased significantly after major events or news releases. - They studied the role of the media as a major driver and influencer in online discussions (§5.2). - They quantified the participation of users attributed to the "left - wing" and "right - wing" and their degree of polarization (§5.3 and §5.4). ### Formula Representation Although this article mainly involves data analysis and social network analysis, in order to ensure the correctness and readability of the formulas, some formulas that may be used are listed here: - **Friedkin & Johnsen (FJ) Polarization Measure**: \[ P=\frac{1}{n}\sum_{i = 1}^{n}\left(\frac{\sum_{j = 1}^{n}A_{ij}x_j}{\sum_{j = 1}^{n}A_{ij}}-x_i\right)^2 \] where \(A\) is the adjacency matrix, \(x_i\) is the initial opinion of node \(i\), and \(P\) is the polarization index. - **PageRank Algorithm**: \[ PR(p_i)=\frac{1 - d}{N}+d\sum_{p_j\in M(p_i)}\frac{PR(p_j)}{L(p_j)} \] where \(d\) is the damping coefficient, \(N\) is the total number of nodes, \(M(p_i)\) is the set of nodes pointing to \(p_i\), and \(L(p_j)\) is the number of links pointing out from \(p_j\). Through these methods and analyses, the authors hope to reveal trends and insights in public discussions and provide references for future research.