Abstract:Summary The design of any study, whether experimental or observational, that is intended to estimate the causal effects of a treatment condition relative to a control condition refers to those activities that precede any examination of outcome variables. As defined in our 1983 article (Rosenbaum & Rubin, 1983), the propensity score is the unit-level conditional probability of assignment to treatment versus control given the observed covariates; so the propensity score explicitly does not involve any outcome variables, in contrast to other summaries of variables sometimes used in observational studies. Balancing the distributions of covariates in the treatment and control groups by matching or balancing on the propensity score is therefore an aspect of the design of the observational study. In this invited comment on our 1983 article, we review the situation in the early 1980s and recall some apparent paradoxes that propensity scores helped to resolve. We demonstrate that it is possible to balance an enormous number of low-dimensional summaries of a high-dimensional covariate, even though it is generally impossible to match individuals closely for all the components of a high-dimensional covariate. In a sense, there is only one crucial observed covariate, the propensity score, and there is one crucial unobserved covariate, the principal unobserved covariate. The propensity score and the principal unobserved covariate are equal when treatment assignment is strongly ignorable, that is, unconfounded. Controlling for observed covariates is a prelude to the crucial step from association to causation, the step that addresses potential biases from unmeasured covariates. The design of an observational study also prepares for the step to causation: by selecting comparisons to increase the design sensitivity, by seeking opportunities to detect bias, by seeking mutually supportive evidence affected by different biases, by incorporating quasi-experimental devices such as multiple control groups, and by including the economist’s instruments. All of these considerations reflect the formal development of sensitivity analyses that were largely informal prior to the 1980s.

Estimating causal effects from large data sets using propensity scores

Estimating Causal Effects from Large Data Sets Using Propensity Scores

Taking Causality Seriously: Propensity Score Methodology Applied to Estimate the Effects of Marketing Interventions

The central role of the propensity score in observational studies for causal effects

Estimating the Propensity Score

Propensity scores in the design of observational studies for causal effects

What can the millions of random treatments in nonexperimental data reveal about causes?

Adjusting for indirectly measured confounding using large-scale propensity scores

Causal effects in clinical and epidemiological studies via potential outcomes: concepts and analytical approaches.

Estimating causal effects of treatments in experimental and observational studies

Robust Estimation of Causal Effects Via a High-Dimensional Covariate Balancing Propensity Score

Robust Estimation of Causal Effects via High-Dimensional Covariate Balancing Propensity Score

Uncertainty in Propensity Score Estimation: Bayesian Methods for Variable Selection and Model-Averaged Causal Effects

High-dimensional propensity scores for empirical covariate selection in secondary database studies: Planning, implementation, and reporting

Estimating Causal Effects of Treatments in Randomized and Nonrandomized Studies

Estimation of causal effects of multiple treatments in healthcare database studies with rare outcomes

Nonparametric causal effects based on incremental propensity score interventions

GBM Propensity Score Weighting for Causal Inference Research

Comparing methods for estimation of heterogeneous treatment effects using observational data from health care databases

Bayesian propensity scores for high-dimensional causal inference: A comparison of drug-eluting to bare-metal coronary stents

Propensity Score Methods for Creating Covariate Balance in Observational Studies