Abstract:Background: Digital spaces, and in particular social networking sites, are becoming increasingly present and influential in the functioning of our democracies. In this paper, we propose an integrated methodology for the data collection, the reconstruction, the analysis and the visualization of the development of a country's political landscape from Twitter data. Method: The proposed method relies solely on the interactions between Twitter accounts and is independent of the characteristics of the shared contents such as the language of the tweets. We validate our methodology on a case study on the 2017 French presidential election (60 million Twitter exchanges between more than 2.4 million users) via two independent methods: the comparison between our automated political categorization and a human categorization based on the evaluation of a sample of 5000 profiles descriptions; the correspondence between the reconfigurations detected in the reconstructed political landscape and key political events reported in the media. This latter validation demonstrated the ability of our approach to accurately reflect the reconfigurations at play in the off-line political scene. Results: We built on this reconstruction to give insights into the opinion dynamics and the reconfigurations of political communities at play during a presidential election. First, we propose a quantitative description and analysis of the political engagement of members of political communities. Second, we analyze the impact of political communities on information diffusion and in particular on their role in the fake news phenomena. We measure a differential echo chamber effect on the different types of political news (fake news, debunks, standard news) caused by the community structure and emphasize the importance of addressing the meso-structures of political networks in understanding the fake news phenomena. Conclusions: Giving access to an intermediate level, between sociological surveys in the field and large statistical studies (such as those conducted by national or international organizations) we demonstrate that social networks data make it possible to qualify and quantify the activity of political communities in a multi-polar political environment; as well as their temporal evolution and reconfiguration, their structure, their alliance strategies and their semantic particularities during a presidential campaign through the analysis of their digital traces. We conclude this paper with a comment on the political and ethical implications of the use of social networks data in politics. We stress the importance of developing social macroscopes that will enable citizens to better understand how they collectively make society and propose as example the "Politoscope", a macroscope that delivers some of our results in an interactive way.

Crawling political communities in Twitter and extracting political affiliations

Crawling Twitter data through API: A technical/legal perspective

Identifying Topical Twitter Communities via User List Aggregation

Political Communities on Twitter: Case Study of the 2022 French Presidential Election

Reconstruction of the socio-semantic dynamics of political activist Twitter networks-Method and application to the 2017 French presidential election

Towards a Standard Sampling Methodology on Online Social Networks: Collecting Global Trends on Twitter

Collecting Influencers: A Comparative Study of Online Network Crawlers

A customisable pipeline for continuously harvesting socially-minded Twitter users

Party Prediction for Twitter

A General Method to Find Highly Coordinating Communities in Social Media through Inferred Interaction Links

Breaking the Communities: Characterizing community changing users using text mining and graph machine learning on Twitter

Social network analysis of Twitter interactions: a directed multilayer network approach

Identifying the Community Roles of Social Capitalists in the Twitter Network

Contributive Social Capital Extraction From Different Types of Online Data Sources

On Crawling Community-aware Online Social Network Data

Big Questions for Social Media Big Data: Representativeness, Validity and Other Methodological Pitfalls

A Method for Estimating Individual Socioeconomic Status of Twitter Users

A multilevel clustering technique for community detection

Investigating the Potential of Aggregated Tweets as Surrogate Data for Forecasting Civil Protests

Joint Inference of User Community and Interest Patterns in Social Interaction Networks

Curating Social Media Data