Abstract:Conditional independence (CI) testing is an important problem, especially in causal discovery. Most testing methods assume that all variables are fully observable and then test the CI among the observed data. Such an assumption is often untenable beyond applications dealing with, e.g., psychological analysis about the mental health status and medical diagnosing (researchers need to consider the existence of latent variables in these scenarios); and typically adopted latent CI test schemes mainly suffer from robust or efficient issues. Accordingly, this article investigates the problem of testing CI between latent variables. To this end, we offer an auxiliary regression-based CI (AReCI) test by taking the measured variable as the surrogate variable of the latent variables to conduct the regression over the latent variables under the linear causal models, in which each latent variable has some certain measured variables. Specifically, given a pair of latent variables LX and LY , and a corresponding latent variable set LO , [Formula: see text] holds if and only if [Formula: see text] and [Formula: see text] are statistically independent, where A' and A'' are the two disjoint subset of the measured variable for the corresponding latent variables, A'{LO} ∩A''{LO} = ∅ , and ω1 is a parameter vector characterized from the cross covariance between A{LX} and A'{LO} , and ω2 is a parameter vector characterized from the cross covariance between A{LY} and A''{LO} . We theoretically show that the AReCI test is capable of addressing both Gaussian and non-Gaussian data. In addition, we find that the well-known partial correlation test can be seen as a special case of the AReCI test. Finally, we devise a causal discovery method by using the AReCI test as the CI test. The experimental results on synthetic and real-world data illustrate the effectiveness of our method.

Normalizing flows for conditional independence testing

Learning Cluster Causal Diagrams: an Information-Theoretic Approach

Testing Conditional Independence Between Latent Variables by Independence Residuals

Causal Discovery Using Regression-Based Conditional Independence Tests

Non-parametric Conditional Independence Testing for Mixed Continuous-Categorical Variables: A Novel Method and Numerical Evaluation

Measuring Conditional Independence by Independent Residuals for Causal Discovery

A Conditional Independence Test in the Presence of Discretization

Measuring Conditional Independence By Independent Residuals: Theoretical Results And Application In Causal Discovery

Conditional Independence Test Based on Residual Similarity

Recursively Learning Causal Structures Using Regression-based Conditional Independence Test

A Distribution Free Conditional Independence Test with Applications to Causal Discovery

Effective and Scalable Causal Partitioning Based on Low-Order Conditional Independent Tests.

Testing Independence Between Linear Combinations for Causal Discovery

Causal Discovery via Conditional Independence Testing with Proxy Variables

Latent Causal Invariant Model

Kernel-based independence tests for causal structure learning on functional data

Conditional independence testing based on a nearest-neighbor estimator of conditional mutual information

Learning Causal Structures Based on Divide and Conquer

Model-Powered Conditional Independence Test

Recovering Latent Causal Factor for Generalization to Distributional Shifts.

Reinterpreting causal discovery as the task of predicting unobserved joint statistics