Collusion-aware detection of review spammers in location based social networks

Jiuxin Cao,Rongqing Xia,Yifang Guo,Zhuo Ma
DOI: https://doi.org/10.1007/s11280-018-0614-x
2018-07-16
World Wide Web
Abstract:To ensure the quality of online review, more and more location-based social networks (LBSNs), like Yelp, have established the filtering systems to detect groups of review spammers. This is not an easy task. Review spammers use camouflage methods to maintain their spam behavior in a very low density to try to conceal themselves in normal users. These camouflaged spammers, driven by profits, are hired by some stores to write fake reviews in groups so as to raise these stores or to belittle their competitors. To avoid the unhealthy competition, in this paper, we propose a novel detection mechanism to discern collusive review spammers, including individuals and groups. The key point of our mechanism is to identify hidden spammers through multiple anomalous relationship features, especially the collusive relation between review spammers and the business competition between locations. Based on multi-view anomalous features, two detection models are proposed for individual and group discovery, respectively. For malicious individuals, a detection model based on Markov Random Field (MRF) is constructed to formalize an inference problem, where the corresponding marginal distribution of users and locations are calculated respectively. For review spammer groups, a hierarchical agglomerative clustering algorithm is conceived according to a new validity index to make sure the collusion relation in each group is close at most. Experiment results show that our method can detect collusive spammers and groups more accurately and comprehensively over the current researches. The additional experiments also show the effectiveness of each anomalous feature in detecting review spammers.
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is to detect collaborative comment spammers in location - based social networks (LBSNs), including individuals and groups. Specifically, the paper focuses on how to effectively identify those comment spammers who hide themselves through camouflage behavior in the case of low comment density. These spammers are usually driven by interests and are hired to write false comments for certain merchants to enhance the reputation of these merchants or to贬低 competitors. To deal with this unhealthy competition, the paper proposes a new detection mechanism, aiming to discover hidden spammers by identifying multiple abnormal relationship characteristics, especially the collaborative relationship between comment spammers and the commercial competition relationship between locations. ### Main contributions: 1. **Detection of collaborative comment spammers for individuals and groups**: The paper proposes a detection method for collaborative comment spammers for individuals and groups, which is particularly useful for dealing with cases where the comment network and spammer density are low, while traditional methods have limited effectiveness in such cases. 2. **Multi - view abnormal feature extraction**: Based on the Markov Random Field (MRF) model, multi - view abnormal features of collaborative comment spammers are extracted and analyzed, making this method perform better than other methods on the Yelp dataset. 3. **Validation of the effectiveness of new features**: The experimental results prove the effectiveness of the collaborative relationship features (such as the collaborative relationship between comment spammers and the commercial competition relationship between locations) proposed in the paper in detecting comment spammers. 4. **Hierarchical clustering algorithm**: A hierarchical clustering algorithm based on a new effectiveness index is proposed to ensure that the collaborative relationship within each group is as tight as possible, and further experiments verify that the discovered spammer groups have a strong internal collaborative relationship. ### Method overview: - **Individual detection model**: A detection model based on the Markov Random Field (MRF) is constructed, and the detection problem is formalized as an inference problem to calculate the marginal distributions of users and locations respectively. - **Group detection model**: A hierarchical clustering algorithm is designed to ensure that the collaborative relationship within each group is as tight as possible according to the new effectiveness index. ### Experimental results: - The experimental results show that this method can detect collaborative comment spammers and their groups more accurately and comprehensively, showing higher precision and recall rates compared with existing research methods. - Further experiments also verify the effectiveness of each abnormal feature in detecting comment spammers. ### Conclusion: The method proposed in the paper has achieved remarkable results in detecting collaborative comment spammers, especially in cases where the comment network and spammer density are low. By introducing multi - view abnormal features and a new clustering algorithm, this method can more effectively identify hidden comment spammers and promote fair commercial competition.