Coresets for fast causal discovery with the additive noise model

Boxiang Zhao,Shuliang Wang,Lianhua Chi,Hanning Yuan,Ye Yuan,Qi Li,Jing Geng,Shao-Liang Zhang
DOI: https://doi.org/10.1016/j.patcog.2023.110149
IF: 8
2023-11-29
Pattern Recognition
Abstract:Causal discovery reveals the true causal relationships behind data and discovering causal relationships from observed data is a particularly challenging problem, especially in large-scale datasets. The functional causal model is an effective method for causal discovery, but its time efficiency cannot be guaranteed. How to efficiently apply it to massive data still needs to be solved. In this paper, we propose a coreset construction for the additive noise model to accelerate causal discovery. According to the asymmetry characteristic of causality, samples were assigned different weights to construct the coreset. With the constructed coreset, we propose a Fast causal discovery algorithm based on the Additive Noise Model (FANM) to improve the time efficiency of the functional causal model while ensuring the result performance of causal discovery. Experiments on synthetic data and real-world data show that our proposed algorithm is much more time-efficient than the methods based on the functional causal model, and the runtime of FANM remains consistent as sample size increases while maintaining or exceeding the accuracy of the original nonlinear additive noise model.
computer science, artificial intelligence,engineering, electrical & electronic
What problem does this paper attempt to address?