A Public Dataset Tracking Social Media Discourse about the 2024 U.S. Presidential Election on Twitter/X

Ashwin Balasubramanian,Vito Zou,Hitesh Narayana,Christina You,Luca Luceri,Emilio Ferrara
2024-11-01
Abstract:In this paper, we introduce the first release of a large-scale dataset capturing discourse on $\mathbb{X}$ (a.k.a., Twitter) related to the upcoming 2024 U.S. Presidential Election. Our dataset comprises 22 million publicly available posts on <a class="link-external link-http" href="http://X.com" rel="external noopener nofollow">this http URL</a>, collected from May 1, 2024, to July 31, 2024, using a custom-built scraper, which we describe in detail. By employing targeted keywords linked to key political figures, events, and emerging issues, we aligned data collection with the election cycle to capture evolving public sentiment and the dynamics of political engagement on social media. This dataset offers researchers a robust foundation to investigate critical questions about the influence of social media in shaping political discourse, the propagation of election-related narratives, and the spread of misinformation. We also present a preliminary analysis that highlights prominent hashtags and keywords within the dataset, offering initial insights into the dominant themes and conversations occurring in the lead-up to the election. Our dataset is available at: url{<a class="link-external link-https" href="https://github.com/sinking8/usc-x-24-us-election" rel="external noopener nofollow">this https URL</a>
Social and Information Networks
What problem does this paper attempt to address?