Abstract:Graph, such as citation networks, social networks, and transportation networks, are prevalent in the real world. Graph Neural Networks (GNNs) have gained widespread attention for their robust expressiveness and exceptional performance in various graph applications. However, the efficacy of GNNs is heavily reliant on sufficient data labels and complex network models, with the former obtaining hardly and the latter computing costly. To address the labeled data scarcity and high complexity of GNNs, Knowledge Distillation (KD) has been introduced to enhance existing GNNs. This technique involves transferring the soft-label supervision of the large teacher model to the small student model while maintaining prediction performance. This survey offers a comprehensive overview of Graph-based Knowledge Distillation methods, systematically categorizing and summarizing them while discussing their limitations and future directions. This paper first introduces the background of graph and KD. It then provides a comprehensive summary of three types of Graph-based Knowledge Distillation methods, namely Graph-based Knowledge Distillation for deep neural networks (DKD), Graph-based Knowledge Distillation for GNNs (GKD), and Self-Knowledge Distillation based Graph-based Knowledge Distillation (SKD). Each type is further divided into knowledge distillation methods based on the output layer, middle layer, and constructed graph. Subsequently, various algorithms' ideas are analyzed and compared, concluding with the advantages and disadvantages of each algorithm supported by experimental results. In addition, the applications of graph-based knowledge distillation in CV, NLP, RS, and other fields are listed. Finally, the graph-based knowledge distillation is summarized and prospectively discussed. We have also released related resources at https://github.com/liujing1023/Graph-based-Knowledge-Distillation.

Reliable Data Distillation on Graph Convolutional Network.

GKD: Semi-supervised Graph Knowledge Distillation for Graph-Independent Inference

Distilling Knowledge from Graph Convolutional Networks

Online Adversarial Knowledge Distillation for Graph Neural Networks

Every node counts: Self-ensembling graph convolutional networks for semi-supervised learning

On Representation Knowledge Distillation for Graph Neural Networks

Attention is all you need for boosting graph convolutional neural network

A Safe Semi-supervised Graph Convolution Network

Deeper Insights Into Graph Convolutional Networks for Semi-Supervised Learning

On Self-Distilling Graph Neural Network

Online Adversarial Distillation for Graph Neural Networks

Dynamic Graph Learning Convolutional Networks for Semi-supervised Classification

Select and Calibrate the Low-confidence: Dual-Channel Consistency Based Graph Convolutional Networks

Graph-based Knowledge Distillation: A survey and experimental evaluation

Self-supervised Training of Graph Convolutional Networks

A Teacher-Free Graph Knowledge Distillation Framework with Dual Self-Distillation

Online Cross-Layer Knowledge Distillation on Graph Neural Networks with Deep Supervision

Graph Neural Diffusion Networks for Semi-supervised Learning

Expanding Label Sets for Graph Convolutional Networks

Robust graph learning with graph convolutional network

Multi-view Graph Convolutional Networks with Differentiable Node Selection