Incorporating Retrieval-based Causal Learning with Information Bottlenecks for Interpretable Graph Neural Networks

Jiahua Rao,Jiancong Xie,Hanjing Lin,Shuangjia Zheng,Zhen Wang,Yuedong Yang
2024-02-07
Abstract:Graph Neural Networks (GNNs) have gained considerable traction for their capability to effectively process topological data, yet their interpretability remains a critical concern. Current interpretation methods are dominated by post-hoc explanations to provide a transparent and intuitive understanding of GNNs. However, they have limited performance in interpreting complicated subgraphs and can't utilize the explanation to advance GNN predictions. On the other hand, transparent GNN models are proposed to capture critical subgraphs. While such methods could improve GNN predictions, they usually don't perform well on explanations. Thus, it is desired for a new strategy to better couple GNN explanation and prediction. In this study, we have developed a novel interpretable causal GNN framework that incorporates retrieval-based causal learning with Graph Information Bottleneck (GIB) theory. The framework could semi-parametrically retrieve crucial subgraphs detected by GIB and compress the explanatory subgraphs via a causal module. The framework was demonstrated to consistently outperform state-of-the-art methods, and to achieve 32.71\% higher precision on real-world explanation scenarios with diverse explanation types. More importantly, the learned explanations were shown able to also improve GNN prediction performance.
Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the contradiction between the interpretability and prediction performance of graph neural networks (GNNs). Specifically, current interpretation methods are mainly divided into two categories: post - hoc and transparent models. Although post - hoc interpretation methods can provide a certain degree of transparency and intuitive understanding, they perform poorly in explaining complex sub - graphs and cannot use the interpretations to improve the prediction performance of GNNs. While transparent models can improve the prediction performance of GNNs by capturing key sub - graphs, their interpretation performance is usually not satisfactory. Therefore, the goal of the paper is to develop a new framework that can better combine the interpretation and prediction of GNNs, thereby improving the performance of both simultaneously. To achieve this goal, the authors propose a novel interpretable causal GNN framework (RC - GNN), which combines retrieval - based causal learning and the graph information bottleneck (GIB) theory. By introducing two new modules - sub - graph retrieval and causal graph learning, RC - GNN can improve the prediction performance of GNNs while maintaining interpretability. The experimental results show that RC - GNN has better consistency on multiple benchmark datasets than the existing state - of - the - art methods. In particular, in real - world application scenarios, the interpretation performance has increased by 32.71% in precision, and these interpretations in turn also help to improve the performance of classification tasks.