Val Alvern Cueco Ligo,Lam Yin Cheung,Roy Ka-Wei Lee,Koustuv Saha,Edson C. Tandoc Jr.,Navin Kumar
Abstract:Social media platforms, particularly Telegram, play a pivotal role in shaping public perceptions and opinions on global and national issues. Unlike traditional news media, Telegram allows for the proliferation of user-generated content with minimal oversight, making it a significant venue for the spread of controversial and misinformative content. During the COVID-19 pandemic, Telegram's popularity surged in Singapore, a country with one of the highest rates of social media use globally. We leverage Singapore-based Telegram data to analyze information flows within groups focused on COVID-19 and climate change. Using k-means clustering, we identified distinct user archetypes, including Strategic Disruptor, Empirical Enthusiast, Inquisitive Moderate, and Critical Examiner, each contributing uniquely to the discourse. We developed a model to classify users into these clusters (Precision: Climate change: 0.99; COVID-19: 0.95).
What problem does this paper attempt to address?
### Problems the Paper Aims to Solve
This paper aims to explore user behavior patterns and information flow in discussions about COVID-19 and climate change on the Telegram platform within Singapore. Specifically, the paper seeks to address the following issues:
1. **Identify User Archetypes**: By analyzing Telegram group data within Singapore, identify different user archetypes in discussions about COVID-19 and climate change. These archetypes include Strategic Disruptor, Empirical Enthusiast, Inquisitive Moderate, and Critical Examiner, among others.
2. **Understand Information Dissemination Dynamics**: Study how these different user archetypes contribute to the dissemination of information, particularly the pathways of information flow involving controversial and misleading content.
3. **Develop a Classification Model**: Develop a model to classify users into the identified archetypes to better understand and manage their behavior.
### Research Background
Social media platforms, especially Telegram, play an increasingly important role in shaping public perceptions and opinions on global and national issues. Unlike traditional news media, Telegram allows user-generated content with minimal regulation, making it a significant venue for the spread of controversial and misleading content. During the COVID-19 pandemic, Telegram's popularity in Singapore surged, becoming an important discussion platform. Therefore, analyzing Telegram data within Singapore is crucial for understanding information flow and managing misleading information.
### Research Methods
1. **Data Collection**: Selected multiple Telegram groups related to COVID-19 and climate change and downloaded all chat records since the groups' creation.
2. **Feature Extraction**: Extracted two main types of features: behavioral features (e.g., the number of times links are shared, the number of replies sent) and textual features (e.g., n-grams, word embeddings, sentiment analysis).
3. **Clustering Analysis**: Used Principal Component Analysis (PCA) and K-means clustering methods to cluster COVID-19 and climate change data separately, identifying different user archetypes.
4. **Classification Model**: Used a decision tree classifier for multi-label classification to categorize users into the corresponding archetypes.
### Main Findings
1. **User Archetypes**:
- **Strategic Disruptor**: Actively engages in challenging mainstream views, frequently sharing controversial and disruptive content.
- **Empirical Enthusiast**: Actively shares data and empirical evidence to support their views on climate change.
- **Inquisitive Moderate**: Moderately participates in discussions, seeking clarification and sharing content without strong biases.
- **Critical Examiner**: Highly active, posting a large amount of data-driven and analytical content, focusing on demographics, technological impacts, and social trends.
- **Conspiratorial Amplifier**: Highly active, potentially amplifying conspiracy theories and questioning the integrity of mainstream narratives.
2. **Information Dissemination Dynamics**:
- Different user archetypes play varying roles and contribute differently to information dissemination. For example, Strategic Disruptors tend to question mainstream media and government narratives, while Empirical Enthusiasts tend to share data and empirical evidence.
- In climate change discussions, Strategic Disruptors mainly criticize the motives and integrity of mainstream environmental narratives, while in COVID-19 discussions, Conspiratorial Amplifiers broadly question the credibility of public health narratives, government policies, and media reports.
### Model Performance
- **COVID-19**: Precision is 0.95, Recall is