Abstract:Graph Neural Networks (GNNs) are proposed without considering the agnostic distribution shifts between training graphs and testing graphs, inducing the degeneration of the generalization ability of GNNs in Out-Of-Distribution (OOD) settings. The fundamental reason for such degeneration is that most GNNs are developed based on the I.I.D hypothesis. In such a setting, GNNs tend to exploit subtle statistical correlations existing in the training set for predictions, even though it is a spurious correlation. This learning mechanism inherits from the common characteristics of machine learning approaches. However, such spurious correlations may change in the wild testing environments, leading to the failure of GNNs. Therefore, eliminating the impact of spurious correlations is crucial for stable GNN models. To this end, in this paper, we argue that the spurious correlation exists among subgraph-level units and analyze the degeneration of GNN in causal view. Based on the causal view analysis, we propose a general causal representation framework for stable GNN, called StableGNN. The main idea of this framework is to extract high-level representations from raw graph data first and resort to the distinguishing ability of causal inference to help the model get rid of spurious correlations. Particularly, to extract meaningful high-level representations, we exploit a differentiable graph pooling layer to extract subgraph-based representations by an end-to-end manner. Furthermore, inspired by the confounder balancing techniques from causal inference, based on the learned high-level representations, we propose a causal variable distinguishing regularizer to correct the biased training distribution by learning a set of sample weights. Hence, GNNs would concentrate more on the true connection between discriminative substructures and labels. Extensive experiments are conducted on both synthetic datasets with various distribution shift degrees and eight real-world OOD graph datasets. The results well verify that the proposed model StableGNN not only outperforms the state-of-the-arts but also provides a flexible framework to enhance existing GNNs. In addition, the interpretability experiments validate that StableGNN could leverage causal structures for predictions.

Rethinking Causal Relationships Learning in Graph Neural Networks

Towards Human-like Perception: Learning Structural Causal Model in Heterogeneous Graph

Introducing Diminutive Causal Structure into Graph Representation Learning

When Graph Neural Network Meets Causality: Opportunities, Methodologies and An Outlook

Towards Causal Classification: A Comprehensive Study on Graph Neural Networks

A Twist for Graph Classification: Optimizing Causal Information Flow in Graph Neural Networks

Relating Graph Neural Networks to Structural Causal Models

Graph Neural Network Causal Explanation via Neural Causal Models

Causality-based CTR Prediction using Graph Neural Networks

Heterophilic Graph Neural Networks Optimization with Causal Message-passing

Invariant Graph Learning for Causal Effect Estimation

Should Graph Convolution Trust Neighbors? A Simple Causal Inference Method

Causal GNNs: A GNN-Driven Instrumental Variable Approach for Causal Inference in Networks

A graph neural network framework for causal inference in brain networks

Neural Causal Graph Collaborative Filtering

Incorporating Retrieval-based Causal Learning with Information Bottlenecks for Interpretable Graph Neural Networks

Towards Fair Graph Neural Networks via Graph Counterfactual

Causal Attention for Interpretable and Generalizable Graph Classification

Generalizing Graph Neural Networks on Out-of-Distribution Graphs

CausalGNN: Causal-Based Graph Neural Networks for Spatio-Temporal Epidemic Forecasting