GIDN: A Lightweight Graph Inception Diffusion Network for High-efficient Link Prediction

Zixiao Wang,Yuluo Guo,Jin Zhao,Yu Zhang,Hui Yu,Xiaofei Liao,Biao Wang,Ting Yu
2024-04-02
Abstract:In this paper, we propose a Graph Inception Diffusion Networks(GIDN) model. This model generalizes graph diffusion in different feature spaces, and uses the inception module to avoid the large amount of computations caused by complex network structures. We evaluate GIDN model on Open Graph Benchmark(OGB) datasets, reached an 11% higher performance than AGDN on ogbl-collab dataset.
Machine Learning,Social and Information Networks
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the **efficient knowledge graph link prediction problem**. Specifically, the author proposes a model named Graph Inception Diffusion Network (GIDN), aiming to improve the efficiency and performance of the link prediction task through graph diffusion and Inception modules. ### Problem Background Knowledge Graphs (KGs) play an important role in many fields, such as natural language processing, deep learning, and graph processing. However, existing large - scale knowledge graphs still have a large number of missing entity relationships (i.e., valid triples). To make up for these missing parts, researchers have proposed Knowledge Graph Embedding (KGE), also known as Link Prediction. The goal of link prediction is to encode entities and relationships in the real world into low - dimensional vectors, so that they can be stored and calculated efficiently and contain semantic - level information. ### Shortcomings of Existing Methods 1. **High computational complexity**: Traditional Graph Neural Networks (GNNs) and other methods often require a large amount of computational resources when dealing with complex network structures. 2. **Excessive parameters**: Although some shallow embedding techniques and logical rule induction methods are effective, the model has too many parameters, resulting in a significant decrease in computational speed. 3. **Poor applicability**: Some heuristic methods have poor applicability to different types of networks, and it is difficult to find suitable heuristic algorithms for different types of networks. ### Innovations of the GIDN Model 1. **Graph diffusion operation**: GIDN represents the information of each prediction target in the graph by matrices, and uses small - hop nodes and learnable generalized weighted coefficients to achieve multi - layer generalized graph diffusion in different feature spaces while ensuring moderate complexity and running time. 2. **Inception module**: The Inception module is introduced to capture rich features and avoid the increase in computational complexity caused by too - deep networks, making it more suitable for training large - scale samples. 3. **Data augmentation**: Data augmentation is carried out by methods such as random walks to expand the dimension of training data and improve the generalization ability of the model. ### Experimental Results The author evaluated the GIDN model on the Open Graph Benchmark (OGB) dataset, especially its performance on the ogbl - collab dataset, achieving a 11% performance improvement over AGDN. The specific experimental results are shown in the following table: | Method | ogbl - collab (Hits@50) | | ------ | ----------------------- | | AGDN | 0.4480 ± 0.0542 | | PLNLP | 0.7059 ± 0.0029 | | GIDN | 0.7096 ± 0.0055 | In summary, the main purpose of this paper is to solve the problems of high computational complexity, excessive parameters, and poor applicability in existing link prediction methods by proposing the GIDN model, thereby improving the efficiency and performance of the link prediction task.