Every node counts: Self-ensembling graph convolutional networks for semi-supervised learning

Yawei Luo,Rongrong Ji,Tao Guan,Junqing Yu,Ping Liu,Yi Yang
DOI: https://doi.org/10.1016/j.patcog.2020.107451
IF: 8
2020-10-01
Pattern Recognition
Abstract:<p>Graph convolutional network (GCN) provides a powerful means for graph-based semi-supervised tasks. However, as a localized first-order approximation of spectral graph convolution, the classic GCN can not take full advantage of unlabeled data, especially when the unlabeled node is far from labeled ones. To capitalize on the information from unlabeled nodes to boost the training for GCN, we propose a novel framework named Self-Ensembling GCN (SEGCN), which marries GCN with Mean Teacher – a powerful self-ensemble learning mechanism for semi-supervised task. SEGCN contains a student model and a teacher model. As a student, it not only learns to correctly classify the labeled nodes, but also tries to be consistent with the teacher on unlabeled nodes in more challenging situations, such as a high dropout rate and graph corrosion. As a teacher, it averages the student model weights and generates more accurate predictions to lead the student. In such a mutual-promoting process, both labeled and unlabeled samples can be fully utilized for backpropagating effective gradients to train GCN. In a variety of semi-supervised classification benchmarks, <em>i.e.</em> Citeseer, Cora, Pubmed and NELL, we validate that the proposed method matches the state of the arts in the classification accuracy. The code is publicly available at <a href="https://github.com/RoyalVane/SEGCN">https://github.com/RoyalVane/SEGCN</a>.</p>
computer science, artificial intelligence,engineering, electrical & electronic
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to use the information of unlabeled nodes more effectively to improve classification performance in semi - supervised learning tasks of graph - structured data. Specifically, when the traditional Graph Convolutional Network (GCN) processes graph - structured data, although it can propagate label information through the local aggregation mechanism, it does not make full use of the information of unlabeled nodes far from the labeled nodes. This is mainly because a K - layer GCN can only capture the information of K - hop neighbors, and the propagation effect of information for more distant nodes is limited. In addition, a deep - layer GCN may lead to the over - smoothing problem, making nodes from different clusters difficult to distinguish. To solve the above problems, the paper proposes a new framework named Self - Ensembling Graph Convolutional Network (SEGCN). SEGCN combines the classic GCN and the Mean Teacher method. By introducing the teacher - student model mechanism, it not only utilizes the supervised information of labeled nodes but also makes full use of the information of unlabeled nodes through consistency loss. Specifically, the student model in SEGCN is trained in a high - noise environment, while the teacher model generates more accurate predictions by averaging the weights of the student model. This mutually promoting process enables unlabeled nodes to also provide effective gradient information, thereby improving the classification performance of the entire model. The main contributions of the paper include: 1. **Combining Mean Teacher and GCN**: For the first time, the Mean Teacher strategy is introduced into GCN design, emphasizing the importance of using unlabeled nodes in graph - structured data in the context of semi - supervised learning. 2. **Designing a new perturbation strategy**: According to the characteristics of graph - structured data, a new perturbation strategy is designed, including graph - structure erosion (removing edges) and model dropout, to generate inconsistent predictions. 3. **Experimental verification**: Experiments are carried out on four node - classification benchmark datasets (Citeseer, Cora, Pubmed, and NELL), and the results show that the classification accuracy of SEGCN is comparable to or better than that of the existing state - of - the - art methods. Through these innovations, SEGCN can use the information of unlabeled nodes more effectively in semi - supervised learning tasks of graph - structured data, thereby improving classification performance.