Abstract:The wide spread of rumors on social media has caused a negative impact on people's daily life, leading to potential panic, fear, and mental health problems for the public. How to debunk rumors as early as possible remains a challenging problem. Existing studies mainly leverage information propagation structure to detect rumors, while very few works focus on correlation among users that they may coordinate to spread rumors in order to gain large popularity. In this paper, we propose a new detection model, that jointly learns both the representations of user correlation and information propagation to detect rumors on social media. Specifically, we leverage graph neural networks to learn the representations of user correlation from a bipartite graph that describes the correlations between users and source tweets, and the representations of information propagation with a tree structure. Then we combine the learned representations from these two modules to classify the rumors. Since malicious users intend to subvert our model after deployment, we further develop a greedy attack scheme to analyze the cost of three adversarial attacks: graph attack, comment attack, and joint attack. Evaluation results on two public datasets illustrate that the proposed MODEL outperforms the state-of-the-art rumor detection models. We also demonstrate our method performs well for early rumor detection. Moreover, the proposed detection method is more robust to adversarial attacks compared to the best existing method. Importantly, we show that it requires a high cost for attackers to subvert user correlation pattern, demonstrating the importance of considering user correlation for rumor detection.

Message Injection Attack on Rumor Detection under the Black-Box Evasion Setting Using Large Language Model

HAT4RD: Hierarchical Adversarial Training for Rumor Detection in Social Media

Detect Rumors in Microblog Posts for Low-Resource Domains via Adversarial Contrastive Learning

Target-driven Attack for Large Language Models

Can Large Language Models Detect Rumors on Social Media?

Red Teaming Language Model Detectors with Language Models

Interpretable and Effective Reinforcement Learning for Attacking against Graph-based Rumor Detection

Black-Box Opinion Manipulation Attacks to Retrieval-Augmented Generation of Large Language Models

Rumor Detection with a novel graph neural network approach

The Philosopher's Stone: Trojaning Plugins of Large Language Models

Imposter.AI: Adversarial Attacks with Hidden Intentions towards Aligned Large Language Models

What Does the Bot Say? Opportunities and Risks of Large Language Models in Social Media Bot Detection

A Black-Box Attack Method against Machine-Learning-Based Anomaly Network Flow Detection Models

Hide Your Malicious Goal Into Benign Narratives: Jailbreak Large Language Models through Neural Carrier Articles

On the Risk of Misinformation Pollution with Large Language Models

Prompt Injection Attacks in Defended Systems

Exploiting Programmatic Behavior of LLMs: Dual-Use Through Standard Security Attacks

Composite Backdoor Attacks Against Large Language Models

Data Stealing Attacks against Large Language Models via Backdooring

RAFT: Realistic Attacks to Fool Text Detectors

Hidden You Malicious Goal Into Benign Narratives: Jailbreak Large Language Models through Logic Chain Injection