Phase Transitions and Optimal Algorithms for Semisupervised Classifications on Graphs: from Belief Propagation to Graph Convolution Network

Pengfei Zhou,Tianyi Li,Pan Zhang
DOI: https://doi.org/10.1103/physrevresearch.2.033325
2019-01-01
Abstract:We perform theoretical and algorithmic studies for the problem of clusteringand semi-supervised classification on graphs with both pairwise relationalinformation and single-point feature information, upon a joint stochastic blockmodel for generating synthetic graphs with both edges and node features.Asymptotically exact analysis based on the Bayesian inference of the underlyingmodel are conducted, using the cavity method in statistical physics.Theoretically, we identify a phase transition of the generative model, whichputs fundamental limits on the ability of all possible algorithms in theclustering task of the underlying model. Algorithmically, we propose a beliefpropagation algorithm that is asymptotically optimal on the generative model,and can be further extended to a belief propagation graph convolution neuralnetwork (BPGCN) for semi-supervised classification on graphs. For the firsttime, well-controlled benchmark datasets with asymptotially exact propertiesand optimal solutions could be produced for the evaluation of graph convolutionneural networks, and for the theoretical understanding of their strengths andweaknesses. In particular, on these synthetic benchmark networks we observethat existing graph convolution neural networks are subject to an sparsityissue and an ovefitting issue in practice, both of which are successfullyovercome by our BPGCN. Moreover, when combined with classic neural networkmethods, BPGCN yields extraordinary classification performances on somereal-world datasets that have never been achieved before.
What problem does this paper attempt to address?