Abstract:We explore how much knowing a parametric restriction on propensity scores improves semiparametric efficiency bounds in the potential outcome framework. For stratified propensity scores, considered as a parametric model, we derive explicit formulas for the efficiency gain from knowing how the covariate space is split. Based on these, we find that the efficiency gain decreases as the partition of the stratification becomes finer. For general parametric models, where it is hard to obtain explicit representations of efficiency bounds, we propose a novel framework that enables us to see whether knowing a parametric model is valuable in terms of efficiency even when it is high-dimensional. In addition to the intuitive fact that knowing the parametric model does not help much if it is sufficiently flexible, we discover that the efficiency gain can be nearly zero even though the parametric assumption significantly restricts the space of possible propensity scores.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is how to improve semiparametric efficiency bounds in causal inference when there are parametric restrictions on the propensity scores. Specifically, the author explores to what extent knowing the parametric restrictions on the propensity scores can improve the semiparametric efficiency bounds in the potential outcome framework. The paper mainly focuses on two aspects: 1. **Stratified Propensity Scores**: For stratified propensity scores (i.e., the propensity score is constant within each region according to the partition of the covariate space), the author derives explicit formulas for the efficiency gain obtained from knowing how the covariate space is partitioned. These formulas show that as the partition becomes finer, the efficiency gain decreases. 2. **General Parametric Models**: For general parametric models, since it is difficult to obtain an explicit representation of the efficiency bounds, the author proposes a new framework that can determine whether knowing the parametric model is still valuable in high - dimensional cases. The study finds that even if the parametric assumptions significantly restrict the space of possible propensity scores, the efficiency gain may be close to zero. ### Main Contributions - **Explicit Formulas**: For stratified propensity scores, the author obtains explicit formulas for the main efficiency gains \( V_{uk} - V_p \) and \( V_p - V_k \). These formulas provide valuable insights. For example, once the partition method of the covariate space is known, the observations within the same region can be matched, thereby improving the efficiency of parameter estimation. - **New Framework**: For other parametric models, the author introduces a new framework to analyze the limiting behavior of the efficiency bounds by defining an increasing sequence of parametric models. This helps to determine whether knowing the parametric structure is almost equivalent to knowing no information in high - dimensional cases. ### Research Background - **Efficiency Bounds in Causal Inference**: J. Hahn (1998) calculated the asymptotic variance bounds of the average treatment effect and the average treatment effect for the treated group (ATT), and proposed an efficient estimator. Hirano et al. (2003) proposed the inverse probability - weighted estimator. - **Multi - valued Treatments**: Imbens (2000) extended the framework of Rosenbaum and Rubin to the multi - valued treatment case. Cattaneo (2010) provided the efficient influence function and semiparametric efficiency bounds for multi - valued treatment effects. - **Efficiency under Parametric Models**: Chen et al. (2008) derived the semiparametric efficiency bounds when the propensity score is correctly specified by a parametric model in the missing - data literature. ### Experimental Design - **Stratified Experiments**: In many practical studies, the propensity score is usually modeled by a parametric model. In observational studies, the propensity score is usually specified using a logit or probit model. In experimental studies, experiment designers often stratify participants to improve estimation efficiency. - **The Influence of Stratification**: J. Hahn (1998) pointed out that knowing the propensity score affects the semiparametric efficiency bound of ATT, but is auxiliary for the estimation of the average treatment effect. Frölich (2004) qualitatively explained this phenomenon. ### Conclusion Through theoretical analysis and explicit formulas, the paper reveals how knowing the parametric restrictions on the propensity score affects the estimation efficiency in stratified experiments. The study finds that although knowing the parametric model can improve efficiency, in some cases, this gain may be very limited. This provides an important theoretical basis and methodological guidance for future research.

Semiparametric Efficiency Gains From Parametric Restrictions on Propensity Scores

Variance Reduction for Causal Inference

Doubly robust estimation of average treatment effect revisited

Efficient combination of observational and experimental datasets under general restrictions on outcome mean functions

A Semiparametric Approach to Model Effect Modification

Robust Estimating Method for Propensity Score Models and its Application to Some Causal Estimands: A review and proposal

Calibrated and Conformal Propensity Scores for Causal Effect Estimation

PENALIZED VARIABLE SELECTION PROCEDURE FOR COX MODELS WITH SEMIPARAMETRIC RELATIVE RISK

Simultaneous Conformal Prediction of Missing Outcomes with Propensity Score $ε$-Discretization

Robust Estimation of Causal Effects via High-Dimensional Covariate Balancing Propensity Score

Causal Effect Estimation after Propensity Score Trimming with Continuous Treatments

Robust Estimation of Causal Effects Via a High-Dimensional Covariate Balancing Propensity Score

Propensity Score Modeling: Key Challenges When Moving Beyond the No-Interference Assumption

Estimating and Using Propensity Scores with Partially Missing Data

High Dimensional Propensity Score Estimation via Covariate Balancing

Estimating the Propensity Score

Censored quantile regression based on multiply robust propensity scores.

Improving Semiparametric Estimation by Using Surrogate Data

Propensity Score Adapted Covariate Selection for Causal Inference

Improving Inverse Probability Weighting by Post-calibrating Its Propensity Scores

Using Propensity Scores to Estimate Effects of Treatment Initiation Decisions: State of the Science