Abstract:Kernel embeddings have emerged as a powerful tool for representing probability measures in a variety of statistical inference problems. By mapping probability measures into a reproducing kernel Hilbert space (RKHS), kernel embeddings enable flexible representations of complex relationships between variables. They serve as a mechanism for efficiently transferring the representation of a distribution downstream to other tasks, such as hypothesis testing or causal effect estimation. In the context of causal inference, the main challenges include identifying causal associations and estimating the average treatment effect from observational data, where confounding variables may obscure direct cause-and-effect relationships. Kernel embeddings provide a robust nonparametric framework for addressing these challenges. They allow for the representations of distributions of observational data and their seamless transformation into representations of interventional distributions to estimate relevant causal quantities. We overview recent research that leverages the expressiveness of kernel embeddings in tandem with causal inference.

What problem does this paper attempt to address?

The core problem that this paper attempts to solve is **how to use the kernel embeddings method to identify causal associations and estimate the Average Treatment Effect (ATE) in causal inference**, especially in the presence of unobserved confounding variables. Specifically, the paper explores the following issues: 1. **Representation of complex dependencies**: Traditional parametric methods perform poorly when dealing with complex dependencies, while kernel embeddings provide a non - parametric and flexible framework that can better represent and manipulate probability measures. 2. **Identification of causal associations**: How to identify causal relationships from observational data, especially in the presence of confounding variables. Kernel embeddings provide powerful tools to estimate causal quantities by transforming the observational distribution into the interventional distribution. 3. **Estimation of the Average Treatment Effect**: How to estimate the Average Treatment Effect (ATE), that is, the expected difference in the outcome variable under different treatment conditions, from observational data. Kernel embeddings allow for the seamless transformation of the distribution of observational data into the interventional distribution, thereby directly estimating these causal quantities. 4. **Dealing with unobserved confounding variables**: In many practical applications, confounding variables may be unobserved, which makes causal inference more difficult. The paper explores how to use the kernel embeddings method to meet this challenge. ### Main contributions of the paper The main contributions of the paper lie in summarizing the research progress in using kernel embeddings for causal inference in recent years, and focusing on the following aspects: - **Basic concepts of kernel embeddings**: Including Reproducing Kernel Hilbert Space (RKHS), Maximum Mean Discrepancy (MMD), Conditional Mean Embedding (CME), etc. - **Applications of the Conditional Mean Operator (CMO) and the De - conditional Mean Operator (DMO)**: The roles of these operators in causal inference, especially their advantages in dealing with complex dependencies and unobserved confounding variables. - **Bayesian kernel embeddings**: How to quantify and represent the uncertainty of unknown probability measures in kernel embeddings, which is crucial for downstream tasks such as active learning or Bayesian optimization. - **Causal graph models and do - operators**: Expressing causal relationships through graphical models (such as DAG) and do - operators, and how to combine kernel embeddings for causal inference. - **Estimation of causal effects in different scenarios**: Including the estimation of the Distributional Treatment Effect (DTE) in scenarios such as back - door adjustment, front - door adjustment, instrumental variables, and proxy variables. ### Conclusion Through the kernel embeddings method, researchers can more accurately identify causal relationships and estimate causal effects in complex real - world datasets. The non - parametric framework provided by kernel embeddings not only bypasses the restrictive assumptions of traditional methods but also provides strong support for dealing with complex dependencies and unobserved confounding variables.

An Overview of Causal Inference using Kernel Embeddings

Causal Discovery by Kernel Deviance Measures with Heterogeneous Transforms

Doubly Robust Kernel Statistics for Testing Distributional Treatment Effects

Kernel-based independence tests for causal structure learning on functional data

Kernel Methods for Causal Functions: Dose, Heterogeneous, and Incremental Response Curves

Causal Inference in Geosciences with Kernel Sensitivity Maps

Neural Causal Abstractions

CauseKG: A Framework Enhancing Causal Inference With Implicit Knowledge Deduced From Knowledge Graphs

Causal Inference Meets Machine Learning

CausE: Towards Causal Knowledge Graph Embedding

Addressing Dynamic and Sparse Qualitative Data: A Hilbert Space Embedding of Categorical Variables

Kernel Mean Embedding of Probability Measures and its Applications to Functional Data Analysis

Sequential Kernel Embedding for Mediated and Time-Varying Dose Response Curves

Explaining the Behavior of Black-Box Prediction Algorithms with Causal Learning

Statistical Approaches for Causal Inference

Embedding-based statistical inference on generative models

Geodesic Causal Inference

Hebbian Learning with Kernel-Based Embedding of Input Data

Towards Causal Representation Learning and Deconfounding from Indefinite Data

Hypothesis testing using pairwise distances and associated kernels (with Appendix)

Reinterpreting causal discovery as the task of predicting unobserved joint statistics