Normalizing flows for conditional independence testing

Bao Duong,Thin Nguyen
DOI: https://doi.org/10.1007/s10115-023-01964-w
IF: 2.7
2023-08-28
Knowledge and Information Systems
Abstract:Abstract Detecting conditional independencies plays a key role in several statistical and machine learning tasks, especially in causal discovery algorithms, yet it remains a highly challenging problem due to dimensionality and complex relationships presented in data. In this study, we introduce LCIT (Latent representation-based Conditional Independence Test)—a novel method for conditional independence testing based on representation learning. Our main contribution involves a hypothesis testing framework in which to test for the independence between X and Y given Z , we first learn to infer the latent representations of target variables X and Y that contain no information about the conditioning variable Z . The latent variables are then investigated for any significant remaining dependencies, which can be performed using a conventional correlation test. Moreover, LCIT can also handle discrete and mixed-type data in general by converting discrete variables into the continuous domain via variational dequantization. The empirical evaluations show that LCIT outperforms several state-of-the-art baselines consistently under different evaluation metrics, and is able to adapt really well to both nonlinear, high-dimensional, and mixed data settings on a diverse collection of synthetic and real data sets.
computer science, information systems, artificial intelligence
What problem does this paper attempt to address?