Abstract:Kernel embeddings have emerged as a powerful tool for representing probability measures in a variety of statistical inference problems. By mapping probability measures into a reproducing kernel Hilbert space (RKHS), kernel embeddings enable flexible representations of complex relationships between variables. They serve as a mechanism for efficiently transferring the representation of a distribution downstream to other tasks, such as hypothesis testing or causal effect estimation. In the context of causal inference, the main challenges include identifying causal associations and estimating the average treatment effect from observational data, where confounding variables may obscure direct cause-and-effect relationships. Kernel embeddings provide a robust nonparametric framework for addressing these challenges. They allow for the representations of distributions of observational data and their seamless transformation into representations of interventional distributions to estimate relevant causal quantities. We overview recent research that leverages the expressiveness of kernel embeddings in tandem with causal inference.
What problem does this paper attempt to address?
The core problem that this paper attempts to solve is **how to use the kernel embeddings method to identify causal associations and estimate the Average Treatment Effect (ATE) in causal inference**, especially in the presence of unobserved confounding variables. Specifically, the paper explores the following issues:
1. **Representation of complex dependencies**: Traditional parametric methods perform poorly when dealing with complex dependencies, while kernel embeddings provide a non - parametric and flexible framework that can better represent and manipulate probability measures.
2. **Identification of causal associations**: How to identify causal relationships from observational data, especially in the presence of confounding variables. Kernel embeddings provide powerful tools to estimate causal quantities by transforming the observational distribution into the interventional distribution.
3. **Estimation of the Average Treatment Effect**: How to estimate the Average Treatment Effect (ATE), that is, the expected difference in the outcome variable under different treatment conditions, from observational data. Kernel embeddings allow for the seamless transformation of the distribution of observational data into the interventional distribution, thereby directly estimating these causal quantities.
4. **Dealing with unobserved confounding variables**: In many practical applications, confounding variables may be unobserved, which makes causal inference more difficult. The paper explores how to use the kernel embeddings method to meet this challenge.
### Main contributions of the paper
The main contributions of the paper lie in summarizing the research progress in using kernel embeddings for causal inference in recent years, and focusing on the following aspects:
- **Basic concepts of kernel embeddings**: Including Reproducing Kernel Hilbert Space (RKHS), Maximum Mean Discrepancy (MMD), Conditional Mean Embedding (CME), etc.
- **Applications of the Conditional Mean Operator (CMO) and the De - conditional Mean Operator (DMO)**: The roles of these operators in causal inference, especially their advantages in dealing with complex dependencies and unobserved confounding variables.
- **Bayesian kernel embeddings**: How to quantify and represent the uncertainty of unknown probability measures in kernel embeddings, which is crucial for downstream tasks such as active learning or Bayesian optimization.
- **Causal graph models and do - operators**: Expressing causal relationships through graphical models (such as DAG) and do - operators, and how to combine kernel embeddings for causal inference.
- **Estimation of causal effects in different scenarios**: Including the estimation of the Distributional Treatment Effect (DTE) in scenarios such as back - door adjustment, front - door adjustment, instrumental variables, and proxy variables.
### Conclusion
Through the kernel embeddings method, researchers can more accurately identify causal relationships and estimate causal effects in complex real - world datasets. The non - parametric framework provided by kernel embeddings not only bypasses the restrictive assumptions of traditional methods but also provides strong support for dealing with complex dependencies and unobserved confounding variables.