Cohesive Subgraph Mining on Attributed Graph

Fan Zhang,Ying Zhang,Lu Qin,Wenjie Zhang,Xuemin Lin
2015-01-01
Abstract:Finding cohesive subgraphs is a fundamental graph problem with a wide spectrum of applications. In this paper, we investigate this problem in the context of attributed graph, where each vertex is associated with content (e.g., geo-locations, tags and keywords). To properly capture the cohesiveness of the vertices in a subgraph from both graph structure and vertices attribute perspectives, we advocate a novel cohesive subgraph model, namely (k,r)-core. In particular, we adopt the popular concept of k-core as the structural constraint such that each vertex in a (k,r)-core connects to at least k other vertices. Meanwhile, there is a user defined similarity threshold (i.e., similarity constraint) to ensure that the contents of the vertices in the same (k,r)-core are similar to each other. We aim to devise efficient algorithms to enumerate all maximal (k,r)-cores and find the maximum (k,r)core, where both problems are shown to be NP-hard. Effective end efficient pruning techniques are proposed to significantly reduce the search space of two algorithms. Novel double k-core based upper bound computation method is also devised to enhance the performance of the maximum (k,r)-core computation algorithm. According to the different nature of two mining algorithms, we devise effective search orders. Our comprehensive experiments on real-life data demonstrate that the maximal/maximum (k,r)cores enable us to find interesting cohesive subgraphs, and the performance of two mining algorithms is significantly improved by our effective pruning techniques and smart search orders.
What problem does this paper attempt to address?