Rethinking Causal Relationships Learning in Graph Neural Networks

Hang Gao,Chengyu Yao,Jiangmeng Li,Lingyu Si,Yifan Jin,Fengge Wu,Changwen Zheng,Huaping Liu
2023-12-15
Abstract:Graph Neural Networks (GNNs) demonstrate their significance by effectively modeling complex interrelationships within graph-structured data. To enhance the credibility and robustness of GNNs, it becomes exceptionally crucial to bolster their ability to capture causal relationships. However, despite recent advancements that have indeed strengthened GNNs from a causal learning perspective, conducting an in-depth analysis specifically targeting the causal modeling prowess of GNNs remains an unresolved issue. In order to comprehensively analyze various GNN models from a causal learning perspective, we constructed an artificially synthesized dataset with known and controllable causal relationships between data and labels. The rationality of the generated data is further ensured through theoretical foundations. Drawing insights from analyses conducted using our dataset, we introduce a lightweight and highly adaptable GNN module designed to strengthen GNNs' causal learning capabilities across a diverse range of tasks. Through a series of experiments conducted on both synthetic datasets and other real-world datasets, we empirically validate the effectiveness of the proposed module.
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the deficiency of Graph Neural Networks (GNNs) in modeling causal relationships. Although GNNs perform excellently in handling graph - structured data, they usually only model statistical relationships rather than causal relationships. This may affect their reliability and robustness when dealing with complex graph data. Therefore, enhancing the causal modeling ability of GNNs so that they can more accurately capture the causal connections between data and labels has become an important research direction. Specifically, the paper points out that although there are currently some methods aiming to enhance the causal modeling ability of GNNs by eliminating confounding factors in graph data, in - depth analysis of the causal modeling ability of GNNs remains an unsolved problem. In order to comprehensively analyze the causal learning abilities of various GNN models, the author constructs an artificially synthesized dataset (CRCG dataset) with known and controllable causal relationships, and further ensures the rationality of the generated data through theoretical basis. Based on the analysis of the CRCG dataset, the author proposes a lightweight and highly adaptable GNN module, aiming to improve the ability of GNNs to capture causal relationships in different tasks. The effectiveness of the proposed module has been verified through a series of experiments.