Abstract:Twitter spam has long been a critical but difficult problem to be addressed. So far, researchers have developed a series of machine learning–based methods and blacklisting techniques to detect spamming activities on Twitter. According to our investigation, current methods and techniques have achieved the accuracy of around 87%. However, because of the problems of spam drift and information fabrication, these machine learning–based methods cannot efficiently detect spam activities in real‐life scenarios. Meanwhile, the blacklisting method also cannot catch up with the variations of spamming activities, as manually inspecting suspicious URLs is extremely timeconsuming. In this paper, we proposed a novel technique based on deep‐learning technique to address the above challenges. The syntax of each tweet will be learned through WordVector and trained by deep learning. We then constructed a binary classifier to differentiate spam and regular tweets. In experiments, we collected and labeled a 10‐day real tweet dataset as ground truth to evaluate our proposed method. We first went for empirical analysis with a series of comparisons to other methods: (1) performance of different classifiers, (2) other existing text‐based methods, and (3) nontext‐based detection techniques. According to the experiment results, our proposed method largely outperformed previous methods. We further conducted principle component analysis on typical methods to theoretically justify the outperformance of our method. We extracted all kinds of features via dimensionality reduction. It was found that our features were most distinct among all the detection methods. This well demonstrated the outperformance of our method.

Spammer Detection Based on Comprehensive Features in Sina Microblog

Spammer detection on Sina Micro-Blog

Feature Importance Analysis for Spammer Detection in Sina Weibo

Detection Method of Spam Based on Multi-Features of Micro-Blog

Robust Spammer Detection in Microblogs

SpamDia: Spammer Diagnosis in Sina Weibo Microblog

Detecting Spam on Sina Weibo

Analysis and Identification of Spamming Behaviors in Sina Weibo Microblog

The Spammer Detection based on Logistic Regression

Detecting Spam in Chinese Microblogs - A Study on Sina Weibo

Robust Spammer Detection in Microblogs: Leveraging User Carefulness

Spammer Detection Based On Hidden Markov Model In Micro-Blogging

Leveraging Careful Microblog Users For Spammer Detection

Community Based Spammer Detection In Social Networks

Spammer Detection On Online Social Networks Based On Logistic Regression

Leveraging Behavior Diversity to Detect Spammers in Online Social Networks.

Sina-Weibo Spammer Detection with GBDT.

Detection of spam mutual concerns in micro-blogs based on multi-features

Detecting Spamming Activities in Twitter Based on Deep‐learning Technique

ELM-based spammer detection in social networks

Health-Related Spammer Detection on Chinese Social Media.