Abstract:The growing demand for personalized decision-making has led to a surge of interest in estimating the Conditional Average Treatment Effect (CATE). Various types of CATE estimators have been developed with advancements in machine learning and causal inference. However, selecting the desirable CATE estimator through a conventional model validation procedure remains impractical due to the absence of counterfactual outcomes in observational data. Existing approaches for CATE estimator selection, such as plug-in and pseudo-outcome metrics, face two challenges. First, they must determine the metric form and the underlying machine learning models for fitting nuisance parameters (e.g., outcome function, propensity function, and plug-in learner). Second, they lack a specific focus on selecting a robust CATE estimator. To address these challenges, this paper introduces a Distributionally Robust Metric (DRM) for CATE estimator selection. The proposed DRM is nuisance-free, eliminating the need to fit models for nuisance parameters, and it effectively prioritizes the selection of a distributionally robust CATE estimator. The experimental results validate the effectiveness of the DRM method in selecting CATE estimators that are robust to the distribution shift incurred by covariate shift and hidden confounders.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is the challenges faced when choosing the Conditional Average Treatment Effect (CATE) estimator. Specifically, the paper points out that in practical applications, due to the lack of counterfactual outcomes in observational data, traditional model validation methods cannot effectively select the best CATE estimator. Some existing methods for choosing CATE estimators, such as plug - in metrics and pseudo - outcome metrics, although providing some help, have two main problems: 1. **How to determine the metric form and the machine - learning model used to fit the nuisance parameters**: Plug - in and pseudo - outcome metrics need to use machine - learning algorithms (such as linear models, tree - based models, etc.) to estimate nuisance parameters (such as outcome functions, propensity score functions, etc.). It is very difficult to choose the appropriate metric form and machine - learning algorithm because there is no knowledge of the true generating process, which brings the problem back to the original estimator selection problem. 2. **These metric methods are not focused on choosing robust CATE estimators**: In the potential outcomes framework, there may be a distribution shift between the factual distribution \(P_F\) and the counterfactual distribution \(P_{CF}\), and this shift will be more severe when unobserved confounding factors exist. Therefore, it becomes particularly important to choose a CATE estimator that can still maintain good performance under distribution shift. To solve these problems, the paper proposes a new metric method - the Distributionally Robust Metric (DRM). The main contributions of DRM include: 1. **The DRM method is nuisance - free**: There is no need to fit models for nuisance parameters (such as outcome functions, propensity score functions, and plug - in learners). 2. **The DRM method is designed to give priority to selecting distribution - robust CATE estimators**. 3. **Provides a finite - sample analysis**: It is proved that the distribution - robust value \(\hat{V}_t(\hat{\tau})\) converges to \(V_t(\hat{\tau})\) at a rate of \(n^{- 1/2}\). 4. **The experimental results verify the effectiveness of the DRM method**: Under the distribution shift caused by covariate shift and hidden confounding factors, the DRM method can select robust CATE estimators. Through these contributions, the paper aims to provide a more effective and robust method for choosing CATE estimators, thereby improving the quality of personalized decision - making in practical applications.

Unveiling the Potential of Robustness in Selecting Conditional Average Treatment Effect Estimators

Multi-CATE: Multi-Accurate Conditional Average Treatment Effect Estimation Robust to Unknown Covariate Shifts

Robust and Agnostic Learning of Conditional Distributional Treatment Effects

Doubly Robust Targeted Estimation of Conditional Average Treatment Effects for Time-to-event Outcomes with Competing Risks

Empirical Analysis of Model Selection for Heterogeneous Causal Effect Estimation

Doubly Robust Direct Learning for Estimating Conditional Average Treatment Effect

Double-robust and efficient methods for estimating the causal effects of a binary treatment

Flexible machine learning estimation of conditional average treatment effects: a blessing and a curse

Robust Causal Learning for the Estimation of Average Treatment Effects

Are causal effect estimations enough for optimal recommendations under multitreatment scenarios?

Do Contemporary CATE Models Capture Real-World Heterogeneity? Findings from a Large-Scale Benchmark

Multiple Robustness Estimation in Causal Inference

A nonparametric super-efficient estimator of the average treatment effect

Estimation and Validation of Ratio-based Conditional Average Treatment Effects Using Observational Data

Triple/Debiased Lasso for Statistical Inference of Conditional Average Treatment Effects

Measuring Variable Importance in Individual Treatment Effect Estimation with High Dimensional Data

Robust Orthogonal Machine Learning of Treatment Effects

CATE meets ML -- The Conditional Average Treatment Effect and Machine Learning

CATE Lasso: Conditional Average Treatment Effect Estimation with High-Dimensional Linear Regression

Causal machine learning for heterogeneous treatment effects in the presence of missing outcome data

Doubly Robust Estimation in Causal Inference with Missing Outcomes: with an Application to the Aerobics Center Longitudinal Study