Abstract:Propensity score matching refers to a class of multivariate methods used in comparative studies to construct treated and matched control samples that have similar distributions on many covariates. This matching is the observational study analog of randomization in ideal experiments, but is far less complete as it can only balance the distribution of observed covariates, whereas randomization balances the distribution of all covariates, both observed and unobserved. An important feature of propensity score matching is that it can be easily combined with model-based regression adjustments or with matching on a subset of special prognostic covariates or combinations of prognostic covariates that have been identified as being especially predictive of the outcome variables. We extend earlier results by developing approximations for the distributions of covariates in matched samples created with linear propensity score methods for the practically important situation where matching uses both the estimated linear propensity scores and a set of special prognostic covariates. Such matching;on a subset of special prognostic covariates is an observational study analog of blocking in a randomized experiment. An example combining propensity score matching with Mahalanobis metric matching and regression adjustment is presented that demonstrates the flexibility of these methods for designing an observational study that effectively reduces both bias due to many observed covariates and bias and variability due to a more limited subset of covariates. Of particular importance, the general approach, which includes propensity score matching, was distinctly superior to methods that focus only on a subset of the prognostically most important covariates, even if those covariates account for most of the variation in the outcome variables. Also of importance, analyses based on matched samples were superior to those based on the full unmatched samples, even when regression adjustment was included.

Comparing Covariate Prioritization via Matching to Machine Learning Methods for Causal Inference using Five Empirical Applications

Matching Methods for Causal Inference: A Review and a Look Forward

Matching as Nonparametric Preprocessing for Reducing Model Dependence in Parametric Causal Inference

Matching Algorithms for Causal Inference with Multiple Treatments

Matched Machine Learning: A Generalized Framework for Treatment Effect Inference With Learned Metrics

Propensity Score Matching and Causal Inference: A methodological review

G-computation, propensity score-based methods, and targeted maximum likelihood estimator for causal inference with different covariates sets: a comparative simulation study

Matching Methods for Causal Inference with Time‐Series Cross‐Sectional Data

Matching with multiple criteria and its application to health disparities research

Propensity Score Augmentation in Matching-based Estimation of Causal Effects

Causal Inference and Counterfactual Prediction in Machine Learning for Actionable Healthcare

The covariate-adjusted residual estimator and its use in both randomized trials and observational settings

Uncertainty in Propensity Score Estimation: Bayesian Methods for Variable Selection and Model-Averaged Causal Effects

Combining Propensity Score Matching with Additional Adjustments for Prognostic Covariates

Covariate-adaptive randomization inference in matched designs

Handbook of Matching and Weighting Adjustments for Causal Inference

Covariate-adjusted Survival Analyses in Propensity-Score Matched Samples: Imputing Potential Time-to-event Outcomes.

Rematching on-the-fly: sequential matched randomization and a case for covariate-adjusted randomization

A Comparative Study of Design-Based and Analysis-Based Approaches to Causal Inference with Observational Data

Comparing methods for estimating causal treatment effects of administrative health data: A plasmode simulation study

A New Covariate Selection Strategy for High Dimensional Data in Causal Effect Estimation with Multivariate Treatments