Interventional Causal Structure Discovery over Graphical Models with Convergence and Optimality Guarantees

Qiu Chengbo,Yang Kai
2024-08-09
Abstract:Learning causal structure from sampled data is a fundamental problem with applications in various fields, including healthcare, machine learning and artificial intelligence. Traditional methods predominantly rely on observational data, but there exist limits regarding the identifiability of causal structures with only observational data. Interventional data, on the other hand, helps establish a cause-and-effect relationship by breaking the influence of confounding variables. It remains to date under-explored to develop a mathematical framework that seamlessly integrates both observational and interventional data in causal structure learning. Furthermore, existing studies often focus on centralized approaches, necessitating the transfer of entire datasets to a single server, which lead to considerable communication overhead and heightened risks to privacy. To tackle these challenges, we develop a bilevel polynomial optimization (Bloom) framework. Bloom not only provides a powerful mathematical modeling framework, underpinned by theoretical support, for causal structure discovery from both interventional and observational data, but also aspires to an efficient causal discovery algorithm with convergence and optimality guarantees. We further extend Bloom to a distributed setting to reduce the communication overhead and mitigate data privacy risks. It is seen through experiments on both synthetic and real-world datasets that Bloom markedly surpasses other leading learning algorithms.
Machine Learning
What problem does this paper attempt to address?
The paper aims to address the problem of learning causal structures from sampled data, with a particular focus on how to seamlessly integrate observational data with interventional data to improve the accuracy and identifiability of causal structure recognition. Specifically, the paper attempts to solve the following key issues: 1. **Limitations of existing methods**: Traditional methods mainly rely on observational data, which has theoretical limitations in identifying the true causal graph (DAG). Additionally, existing continuous optimization methods (such as gradient descent-based methods) may get stuck in local optima or saddle points, leading to slow and unstable convergence. 2. **Integration of observational and interventional data**: Although some studies have started to combine interventional data to improve the effectiveness of causal structure learning, these methods are often overly complex or lack a unified framework to integrate these two types of data through a single optimization strategy. 3. **Lack of theoretical support**: Many existing works lack sufficient theoretical foundation, including guarantees on convergence and optimality. 4. **Problems with centralized methods**: Most existing methods adopt a centralized processing approach, which may lead to significant communication overhead and higher privacy risks. To address these issues, the authors propose a bi-level polynomial optimization framework (Bloom), which not only learns causal structures from both observational and interventional data but also provides an efficient causal discovery algorithm with guarantees on convergence and optimality. Additionally, the authors extend Bloom to a distributed setting to reduce communication overhead and mitigate data privacy risks. Experimental validation shows that Bloom significantly outperforms other leading algorithms on both synthetic and real-world datasets.