Spotting Suspicious Behaviors in Multimodal Data: A General Metric and Algorithms

Meng Jiang,Alex Beutel,Peng Cui,Bryan Hooi,Shiqiang Yang,Christos Faloutsos
DOI: https://doi.org/10.1109/tkde.2016.2555310
IF: 9.235
2016-01-01
IEEE Transactions on Knowledge and Data Engineering
Abstract:Many commercial products and academic research activities are embracing behavior analysis as a technique for improving detection of attacks of many sorts-from retweet boosting, hashtag hijacking to link advertising. Traditional approaches focus on detecting dense blocks in the adjacency matrix of graph data, and recently, the tensors of multimodal data. No method gives a principled way to score the suspiciousness of dense blocks with different numbers of modes and rank them to draw human attention accordingly. In this paper, we first give a list of axioms that any metric of suspiciousness should satisfy; we propose an intuitive, principled metric that satisfies the axioms, and is fast to compute; moreover, we propose CrossSpot, an algorithm to spot dense blocks that are worth inspecting, typically indicating fraud or some other noteworthy deviation from the usual, and sort them in the order of importance (“suspiciousness”). Finally, we apply CrossSpot to the real data, where it improves the F1 score over previous techniques by 68 percent and finds suspicious behavioral patterns in social datasets spanning 0.3 billion posts.
What problem does this paper attempt to address?