Limits of Approximating the Median Treatment Effect

Raghavendra Addanki,Siddharth Bhandari
2024-03-16
Abstract:Average Treatment Effect (ATE) estimation is a well-studied problem in causal inference. However, it does not necessarily capture the heterogeneity in the data, and several approaches have been proposed to tackle the issue, including estimating the Quantile Treatment Effects. In the finite population setting containing $n$ individuals, with treatment and control values denoted by the potential outcome vectors $\mathbf{a}, \mathbf{b}$, much of the prior work focused on estimating median$(\mathbf{a}) -$ median$(\mathbf{b})$, where median($\mathbf x$) denotes the median value in the sorted ordering of all the values in vector $\mathbf x$. It is known that estimating the difference of medians is easier than the desired estimand of median$(\mathbf{a-b})$, called the Median Treatment Effect (MTE). The fundamental problem of causal inference -- for every individual $i$, we can only observe one of the potential outcome values, i.e., either the value $a_i$ or $b_i$, but not both, makes estimating MTE particularly challenging. In this work, we argue that MTE is not estimable and detail a novel notion of approximation that relies on the sorted order of the values in $\mathbf{a-b}$. Next, we identify a quantity called variability that exactly captures the complexity of MTE estimation. By drawing connections to instance-optimality studied in theoretical computer science, we show that every algorithm for estimating the MTE obtains an approximation error that is no better than the error of an algorithm that computes variability. Finally, we provide a simple linear time algorithm for computing the variability exactly. Unlike much prior work, a particular highlight of our work is that we make no assumptions about how the potential outcome vectors are generated or how they are correlated, except that the potential outcome values are $k$-ary, i.e., take one of $k$ discrete values.
Machine Learning,Artificial Intelligence,Data Structures and Algorithms,Econometrics,Methodology
What problem does this paper attempt to address?
### Problems the paper attempts to solve This paper attempts to solve the problem of estimating the **Median Treatment Effect (MTE)** in causal inference. Specifically, the paper explores how to estimate the median of the difference vector \( \mathbf{a} - \mathbf{b} \) of the potential outcome vectors \( \mathbf{a} \) and \( \mathbf{b} \) in the treatment group and the control group from observational data in a finite population setting. ### Background and motivation 1. **Average Treatment Effect (ATE)**: - ATE is a classic problem in causal inference and has been extensively studied. However, ATE cannot fully capture the heterogeneity in the data. 2. **Quantile Treatment Effect (QTE)**: - In order to better understand the differences in treatment effects among different individuals or outcomes, researchers proposed QTE, especially the Median Treatment Effect (MTE). 3. **Challenges of MTE**: - Since each individual can only observe one potential outcome in either the treatment group or the control group, but not both simultaneously, this makes the estimation of MTE particularly difficult. - Traditional estimation methods are usually based on the difference between the medians of the treatment group and the control group \( \text{median}(\mathbf{a}) - \text{median}(\mathbf{b}) \), but this does not fully reflect the median of the causal effect \( \mathbf{a} - \mathbf{b} \). ### Research objectives 1. **Is MTE estimable?**: - The author proves that MTE is not estimable, that is, no algorithm can accurately estimate MTE. 2. **Approximate estimation of MTE**: - To overcome this problem, the author introduces a new approximation concept, which does not rely on the absolute error between the estimated value and the median, but approximates the median by quantifying the rank. - Specifically, if an estimated value \( \hat{m} \) is in the quantile interval of 0.5 ± \( \epsilon \) in the sorted \( \mathbf{a} - \mathbf{b} \) values, it is considered an \( \epsilon \)-quantile approximation. 3. **Limits of approximate estimation**: - The author defines a quantity called **variability**, which precisely captures the complexity of MTE estimation. - By relating to the concept of instance - optimality in theoretical computer science, the author proves that the approximate error of any algorithm in estimating MTE is at least equal to the error in calculating variability. - Finally, the author provides a linear - time algorithm to accurately calculate variability. ### Main contributions 1. **Non - estimability**: - Proves that MTE is not estimable and details the reasons. 2. **Approximation concept**: - Introduces a new approximation concept to approximate MTE by quantifying the rank. 3. **Variability**: - Defines variability, which precisely captures the complexity of MTE estimation. - Proves that the approximate error of any algorithm in estimating MTE is at least equal to the error in calculating variability. 4. **Efficient algorithm**: - Provides a linear - time algorithm to accurately calculate variability. ### Conclusion This paper provides a theoretical basis and an efficient algorithm for MTE estimation by introducing a new approximation concept and the variability quantity, thus promoting the research progress in the field of causal inference.