Graph Neural Network Causal Explanation via Neural Causal Models

Arman Behnam,Binghui Wang
2024-07-12
Abstract:Graph neural network (GNN) explainers identify the important subgraph that ensures the prediction for a given graph. Until now, almost all GNN explainers are based on association, which is prone to spurious correlations. We propose {\name}, a GNN causal explainer via causal inference. Our explainer is based on the observation that a graph often consists of a causal underlying subgraph. {\name} includes three main steps: 1) It builds causal structure and the corresponding structural causal model (SCM) for a graph, which enables the cause-effect calculation among nodes. 2) Directly calculating the cause-effect in real-world graphs is computationally challenging. It is then enlightened by the recent neural causal model (NCM), a special type of SCM that is trainable, and design customized NCMs for GNNs. By training these GNN NCMs, the cause-effect can be easily calculated. 3) It uncovers the subgraph that causally explains the GNN predictions via the optimized GNN-NCMs. Evaluation results on multiple synthetic and real-world graphs validate that {\name} significantly outperforms existing GNN explainers in exact groundtruth explanation identification
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?
### Problems Addressed by the Paper This paper attempts to address the issue of causality in Graph Neural Network (GNN) interpreters. Specifically, most existing GNN interpreters are based on association, and these methods are susceptible to spurious correlations in the training data, leading to inaccurate interpretation results. Therefore, the authors propose a new GNN causal interpreter—CXGNN, which identifies causal subgraphs in the graph through causal inference to more accurately interpret the prediction results of GNNs. ### Main Contributions 1. **Proposed the first GNN causal interpreter**: CXGNN is the first method to use causal inference to interpret GNN prediction results. 2. **Utilized neural causal models**: The authors designed a GNN neural causal model (GNN-NCM) and trained these models to identify causal explanation subgraphs. 3. **Significantly outperforms existing methods**: Experimental results show that CXGNN significantly outperforms existing association-based and causally inspired GNN interpreters on multiple synthetic and real-world graph datasets, especially in accurately identifying true explanation subgraphs. ### Method Overview 1. **Define causal structure**: First, for a given graph \( G \), define its causal structure \( G \) and the corresponding Structural Causal Model (SCM), which allows for the calculation of causal relationships between nodes through interventions. 2. **Construct and train GNN-NCM**: Due to the high computational complexity of directly calculating causal effects in real graphs, the authors borrowed from recent neural causal models (NCM) to construct a trainable GNN-NCM. By training these models, causal effects can be easily calculated. 3. **Identify causal explanation subgraphs**: Finally, based on the trained GNN-NCM, determine the subgraph \( \Gamma \) that can causally explain the GNN prediction results. ### Experimental Results The authors conducted experiments on multiple synthetic and real-world graph datasets, including six synthetic datasets (such as BA+House, BA+Grid, etc.) and two real-world datasets (such as Benzene, Fluoride carbonyl). The experimental results show: - **Explanation accuracy**: CXGNN is comparable to or slightly inferior to causally inspired methods in terms of explanation accuracy but significantly outperforms association-based methods. - **Explanation recall**: CXGNN performs well in terms of explanation recall, achieving 100% on some datasets. - **True match accuracy**: CXGNN significantly outperforms all comparison methods in terms of true match accuracy, indicating that it can more accurately identify true explanation subgraphs. ### Conclusion By introducing causal inference, CXGNN can avoid the influence of spurious correlations in GNN interpretation and more accurately reveal the causal relationship between explanation subgraphs and graph labels. This method provides a new direction for the study of GNN interpretability.