Attributed Graph Clustering with Subspace Stochastic Block Model.

Haoran Chen,Zhongjing Yu,Qinli Yang,Junming Shao
DOI: https://doi.org/10.1016/j.ins.2020.05.044
IF: 8.1
2020-01-01
Information Sciences
Abstract:Inspired by the principle of homophily, most existing graph clustering approaches assume that the formation of clusters is highly related to node attributes, and thus leverage node information to improve graph clustering performance. However, utilizing all attributes as supplemental information for graph clustering may fail on real-world attributed graphs since only a subset of attributes are truly relevant for the formation of clusters, and the relevant attributes (i.e., attribute subspaces) for different clusters often differ largely in real-world graphs. Therefore, in this paper, we propose a subspace stochastic block model (SSB) to explore the cluster structures in attributed graphs. The key point is to view both topological structure and attribute information as the latent factors to drive the formation of clusters in the new proposed generative model. More specifically, relevant attributes are iteratively learned for each cluster, and subsequently used as valuable information to be integrated into the stochastic block model. To solve the likelihood function, an expectation–maximization strategy is developed to infer all parameters efficiently, and finally all clusters and their corresponding attribute subspaces are identified simultaneously. Extensive experimental results on both synthetic and real-world graphs have demonstrated the effectiveness of SSB, and show its superiority over many state-of-art approaches.
What problem does this paper attempt to address?