Geometric instability of graph neural networks on large graphs

Emily Morris,Haotian Shen,Weiling Du,Muhammad Hamza Sajjad,Borun Shi
2023-11-28
Abstract:We analyse the geometric instability of embeddings produced by graph neural networks (GNNs). Existing methods are only applicable for small graphs and lack context in the graph domain. We propose a simple, efficient and graph-native Graph Gram Index (GGI) to measure such instability which is invariant to permutation, orthogonal transformation, translation and order of evaluation. This allows us to study the varying instability behaviour of GNN embeddings on large graphs for both node classification and link prediction.
Machine Learning,Social and Information Networks
What problem does this paper attempt to address?
This paper attempts to solve the problem of geometric instability of graph neural networks (GNNs) on large - scale graphs. Specifically, the author focuses on the stability of embeddings generated by GNNs under different configurations and proposes a simple, efficient and graph - structure - friendly method to measure this instability. ### Core Problems of the Paper 1. **Limitations of Existing Methods**: - Existing stability evaluation methods are mainly applicable to small - scale graphs and lack context in the graph field. - These methods directly borrow stability indices from other fields (such as NLP and topological analysis), but are not fully applicable to graph data. 2. **Research Objectives**: - Propose a new geometric stability index (Graph Gram Index, GGI) to measure the instability of GNN embeddings. - This index should have the characteristics of low time complexity, low space complexity, no hyper - parameters, and being intuitive and easy to understand. - Study the geometrically unstable behavior of GNN embeddings on large - scale graphs, especially in node classification and link prediction tasks. ### Main Contributions 1. **Formal Definition of Geometric Stability Index**: - In the second part, the author formalizes the geometric stability index of embeddings and reviews the existing methods. 2. **Propose Graph Gram Index (GGI)**: - In the third part, the author proposes a simple, efficient and graph - structure - friendly stability index - GGI. GGI has the following properties: - It is invariant to node permutation, orthogonal transformation, translation and evaluation order. - It avoids solving the Procrustes alignment problem and directly uses the Gram matrix for comparison. - It has low time and space complexity and is suitable for large - scale graphs. 3. **Experimental Verification**: - In the fourth part, the author experimentally shows the geometrically stable behaviors of several popular GNN models (GCN, GraphSAGE, GAT, GIN) on graphs of different scales and homogeneity levels. - The experimental results show that GGI can effectively measure the stability of GNN embeddings and has a certain correlation with the test accuracy. ### Formula Presentation - **Gram Matrix**: \[ Z_l Z_l^T \] - where \( Z_l\in\mathbb{R}^{|V|\times d} \) represents the embedding matrix of all nodes under the \( l \) - th configuration. - **GGI Calculation Formula**: \[ s_l=\frac{1}{2|E|}\sum_{i,j\in|V|\times|V|}S_l[i, j] \] where \( S_l = A\circ(Z_l Z_l^T) \), \( A \) is the adjacency matrix, and \( \circ \) represents the Hadamard product. - **Final Stability Index**: \[ s = \text{std}(s_l:l\in[1,\ldots,N]) \] Through these formulas and methods, the author successfully solves the problem that existing methods are difficult to apply on large - scale graphs and provides a more effective tool to evaluate the stability of GNN embeddings.