Abstract:As is the case with many social media websites, the Community Question Answering (CQA) portal has become a target for spammers to disseminate promotion information. Previous works mainly focus on identifying low-quality answers or detecting spam information in question-answer (QA) pairs. However, these works suffer from long delay since they all rely on the information of answers or answerers while questions have been displayed on the websites for some time and attracted certain user traffic. As a matter of fact, spammers on CQA platforms also act as questioners and involve promotion information in their questions. So if they can be detected as early as possible, the questions will not appear on the websites and affect legitimate users. In this paper, we design a framework for early detection of promotion campaigns in CQA based on only question information and questioner profile. First, we propose a novel sampling method for identifying the questions that contain promotion information, which compose the positive dataset. We also sample an unlabeled dataset of unsolved questions during a certain period of time. Then, we compare the characteristics of question information and user profiles between the two datasets, which are also used as features in the learning process. Finally, we apply and compare several PU (Positive and Unlabeled examples) learning algorithms to find positive examples in the unlabeled dataset. In our approach, no answer side information is needed, which means that it can detect spamming activities as soon as the question is posted. Experimental results based on about 0.7 million questions derived from a popular Chinese CQA portal indicate that our approach can detect questions related to promotion campaigns as effectively as but more efficiently than the state-of-the-art QA pair level detection methods.

Detecting Collusive Spamming Activities in Community Question Answering

Collusive spam detection from Chinese community question answering sites: A collective classification framework

Detecting Spammers in Community Question Answering

Detecting Promotion Campaigns in Community Question Answering.

Early Detection of Promotion Campaigns in Community Question Answering.

Toward Personalized Activity Level Prediction in Community Question Answering Websites

Analyzing and Detecting Adversarial Spam on a Large-scale Online APP Review System.

The Best Answers? Think Twice: Online Detection of Commercial Campaigns in the CQA Forums

Community-Based Question Answering Via Heterogeneous Social Network Learning

Collusion-aware detection of review spammers in location based social networks

Detecting high-quality posts in community question answering sites

A New Approach to Detect User Collusion Behavior in Online QA System

Answer Quality Analysis on Community Question Answering

Leveraging Crowdsourcing for Efficient Malicious Users Detection in Large-Scale Social Networks

Network Embedding-Based Approach for Detecting Collusive Spamming Groups on E-Commerce Platforms

Automatically Grouping Questions in Yahoo! Answers.

Question Retrieval for Community-Based Question Answering Via Heterogeneous Social Influential Network.

Malicious Crowdsourcing Worker Detection Using Privacy-Aware Group Queries.

Predicting Long-Term Impact Of Cqa Posts: A Comprehensive Viewpoint

Community-Based Question Answering Via Asymmetric Multi-Faceted Ranking Network Learning.

Camouflage is NOT Easy: Uncovering Adversarial Fraudsters in Large Online App Review Platform