Semiparametric Efficiency Gains From Parametric Restrictions on Propensity Scores

Haruki Kono
DOI: https://doi.org/10.1093/biomet/asae034
2024-07-03
Abstract:We explore how much knowing a parametric restriction on propensity scores improves semiparametric efficiency bounds in the potential outcome framework. For stratified propensity scores, considered as a parametric model, we derive explicit formulas for the efficiency gain from knowing how the covariate space is split. Based on these, we find that the efficiency gain decreases as the partition of the stratification becomes finer. For general parametric models, where it is hard to obtain explicit representations of efficiency bounds, we propose a novel framework that enables us to see whether knowing a parametric model is valuable in terms of efficiency even when it is high-dimensional. In addition to the intuitive fact that knowing the parametric model does not help much if it is sufficiently flexible, we discover that the efficiency gain can be nearly zero even though the parametric assumption significantly restricts the space of possible propensity scores.
Econometrics,Statistics Theory
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to improve semiparametric efficiency bounds in causal inference when there are parametric restrictions on the propensity scores. Specifically, the author explores to what extent knowing the parametric restrictions on the propensity scores can improve the semiparametric efficiency bounds in the potential outcome framework. The paper mainly focuses on two aspects: 1. **Stratified Propensity Scores**: For stratified propensity scores (i.e., the propensity score is constant within each region according to the partition of the covariate space), the author derives explicit formulas for the efficiency gain obtained from knowing how the covariate space is partitioned. These formulas show that as the partition becomes finer, the efficiency gain decreases. 2. **General Parametric Models**: For general parametric models, since it is difficult to obtain an explicit representation of the efficiency bounds, the author proposes a new framework that can determine whether knowing the parametric model is still valuable in high - dimensional cases. The study finds that even if the parametric assumptions significantly restrict the space of possible propensity scores, the efficiency gain may be close to zero. ### Main Contributions - **Explicit Formulas**: For stratified propensity scores, the author obtains explicit formulas for the main efficiency gains \( V_{uk} - V_p \) and \( V_p - V_k \). These formulas provide valuable insights. For example, once the partition method of the covariate space is known, the observations within the same region can be matched, thereby improving the efficiency of parameter estimation. - **New Framework**: For other parametric models, the author introduces a new framework to analyze the limiting behavior of the efficiency bounds by defining an increasing sequence of parametric models. This helps to determine whether knowing the parametric structure is almost equivalent to knowing no information in high - dimensional cases. ### Research Background - **Efficiency Bounds in Causal Inference**: J. Hahn (1998) calculated the asymptotic variance bounds of the average treatment effect and the average treatment effect for the treated group (ATT), and proposed an efficient estimator. Hirano et al. (2003) proposed the inverse probability - weighted estimator. - **Multi - valued Treatments**: Imbens (2000) extended the framework of Rosenbaum and Rubin to the multi - valued treatment case. Cattaneo (2010) provided the efficient influence function and semiparametric efficiency bounds for multi - valued treatment effects. - **Efficiency under Parametric Models**: Chen et al. (2008) derived the semiparametric efficiency bounds when the propensity score is correctly specified by a parametric model in the missing - data literature. ### Experimental Design - **Stratified Experiments**: In many practical studies, the propensity score is usually modeled by a parametric model. In observational studies, the propensity score is usually specified using a logit or probit model. In experimental studies, experiment designers often stratify participants to improve estimation efficiency. - **The Influence of Stratification**: J. Hahn (1998) pointed out that knowing the propensity score affects the semiparametric efficiency bound of ATT, but is auxiliary for the estimation of the average treatment effect. Frölich (2004) qualitatively explained this phenomenon. ### Conclusion Through theoretical analysis and explicit formulas, the paper reveals how knowing the parametric restrictions on the propensity score affects the estimation efficiency in stratified experiments. The study finds that although knowing the parametric model can improve efficiency, in some cases, this gain may be very limited. This provides an important theoretical basis and methodological guidance for future research.