Alexander W. Levis,Edward H. Kennedy,Alec McClean,Sivaraman Balakrishnan,Larry Wasserman
Abstract:Recent methodological research in causal inference has focused on effects of stochastic interventions, which assign treatment randomly, often according to subject-specific covariates. In this work, we demonstrate that the usual notion of stochastic interventions have a surprising property: when there is unmeasured confounding, bounds on their effects do not collapse when the policy approaches the observational regime. As an alternative, we propose to study generalized policies, treatment rules that can depend on covariates, the natural value of treatment, and auxiliary randomness. We show that certain generalized policy formulations can resolve the "non-collapsing" bound issue: bounds narrow to a point when the target treatment distribution approaches that in the observed data. Moreover, drawing connections to the theory of optimal transport, we characterize generalized policies that minimize worst-case bound width in various sensitivity analysis models, as well as corresponding sharp bounds on their causal effects. These optimal policies are new, and can have a more parsimonious interpretation compared to their usual stochastic policy analogues. Finally, we develop flexible, efficient, and robust estimators for the sharp nonparametric bounds that emerge from the framework.
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is the problem of estimating the effects of stochastic interventions in causal inference, especially in the presence of unmeasured confounding. Specifically, the author points out an important defect of traditional stochastic interventions in dealing with unobserved confounding: when the intervention policy approaches the treatment distribution in the observed data, the bounds of traditional stochastic interventions do not converge to a point. This leads to the problem of partial identification of the intervention effect.
To solve this problem, the author proposes generalized policies, that is, treatment rules that depend on covariates, natural treatment values, and auxiliary randomness. By introducing the optimal transport theory, the author studies the generalized policies that can minimize the width of the worst - case bounds under different sensitivity analysis models and provides the exact bounds of the causal effects under these policies. In addition, the author also develops flexible, efficient, and robust non - parametric estimation methods to estimate these bound functions.
### Core contributions of the paper
1. **Introduction of generalized policies**: A new class of generalized policies is proposed. These policies can depend on covariates, natural treatment values, and auxiliary randomness, so as to better handle unobserved confounding.
2. **Characterization of optimal generalized policies**: Through the optimal transport theory, the generalized policies that can minimize the width of the worst - case bounds are found, and it is proved that these policies can make the bounds converge to a point in some cases.
3. **New framework for sensitivity analysis**: A sensitivity analysis method for generalized policies is developed, relaxing the commonly assumed unconfoundedness condition, so that effective causal inference can still be carried out in the presence of unobserved confounding.
4. **Non - parametric estimation method**: A flexible and efficient non - parametric estimation method is proposed to estimate the bound functions generated by the new framework.
### Formula summary
- **Conditional cumulative distribution function (Conditional CDF)**:
\[
\Pi(a|X) = P[A \leq a|X]
\]
- **Exponentially Tilted Target Distribution**:
\[
dQ_\delta(a|X)=\frac{e^{\delta a}d\Pi(a|X)}{E_P(e^{\delta A}|X)}
\]
- **Total Variation Distance**:
\[
TV(\Pi(\cdot|X), Q(\cdot|X))=\sup_{B\in\mathcal{B}(\mathbb{R})}|\Pi(B|X) - Q(B|X)|
\]
- **Maximal Coupling**:
\[
d_Q^*(X, A, V)=1(V_1\leq\epsilon_p(A|X))A + 1(V_1 > \epsilon_p(A|X))Q^{-1}(V_2|X)
\]
where,
\[
\epsilon_p(A|X)=\min\left\{1,\frac{q(A|X)}{\pi(A|X)}\right\}
\]
\[
e_q(a|X)=q(a|X)-\min\{\pi(a|X), q(a|X)\}/TV(\Pi(\cdot|X), Q(\cdot|X))
\]
Through these innovations, the paper provides a more robust and flexible method for the field of causal inference to handle complex real - world data.