Abstract:Microblogging websites, such as Twitter, have become popular platforms for information dissemination and sharing. However, they are also full of spammers who frequently conduct social spamming on them. Massive social spammers and spam messages heavily hurt the user experience and hinder the healthy development of microblogging systems. Thus, effectively detecting the social spammers and spam messages is of great value to both microblogging users and websites. Existing studies usually treat social spammer detection and spam message detection as two separate tasks. However, social spammers and spam messages have strong inherent connections, since social spammers tend to post more spam messages and spam messages have high probabilities to be posted by social spammers. Thus combining social spammer detection with spam message detection has the potential to boost the performance of both tasks. In this paper, we propose a unified approach for social spammer and spam message co-detection in microblogging. Our approach utilizes the posting relations between users and messages to combine social spammer detection with spam message detection. In addition, we extract the social relations between users and the connections between messages to refine detection results. We regard these social contexts as the graph structure over the detection results and incorporate them into our approach as regularization terms. Besides, we introduce an efficient optimization algorithm to solve the model of our approach and propose an accelerated method to tackle the most time-consuming step. Extensive experiments on a real-world microblog dataset demonstrate that our approach can improve the performance of both social spammer detection and spam message detection effectively and efficiently.

Spam comments detection with self-extensible dictionary and text-based features

Detecting Spam Comments Posted in Micro-Blogs Using the Self-Extensible Spam Dictionary

Detecting Spam in Chinese Microblogs - A Study on Sina Weibo

Detecting Comment Spam Through Content Analysis

Camouflaged Chinese Spam Content Detection with Semi-supervised Generative Active Learning.

Feature Importance Analysis for Spammer Detection in Sina Weibo

Detecting Spam on Sina Weibo

Social Spammer and Spam Message Co-Detection in Microblogging with Social Context Regularization.

Co-detecting Social Spammers and Spam Messages in Microblogging Via Exploiting Social Contexts

Health-Related Spammer Detection on Chinese Social Media.

Robust Spammer Detection in Microblogs

Semorph: A Morphology Semantic Enhanced Pre-trained Model for Chinese Spam Text Detection.

A Multi-dimension and Multi-granularity Feature Fusion Method for Chinese Microblog Sentiment Classification

Online Social Spammer Detection

Robust Spammer Detection in Microblogs: Leveraging User Carefulness

Chinese Microblog Topic Detection through POS-Based Semantic Expansion

A Study of Discriminatory Speech Classification Based on Improved Smote and SVM-RF

Detecting Social Spammers in Sina Weibo Using Extreme Deep Factorization Machine

A Novel Chinese Text Mining Method for E-Commerce Review Spam Detection

Identifying Web Spam with the Wisdom of the Crowds

Leveraging Careful Microblog Users For Spammer Detection