Inverse probability of treatment weighting with generalized linear outcome models for doubly robust estimation

Erin E Gabriel,Michael C Sachs,Torben Martinussen,Ingeborg Waernbaum,Els Goetghebeur,Stijn Vansteelandt,Arvid Sjölander
DOI: https://doi.org/10.1002/sim.9969
2023-09-11
Abstract:There are now many options for doubly robust estimation; however, there is a concerning trend in the applied literature to believe that the combination of a propensity score and an adjusted outcome model automatically results in a doubly robust estimator and/or to misuse more complex established doubly robust estimators. A simple alternative, canonical link generalized linear models (GLM) fit via inverse probability of treatment (propensity score) weighted maximum likelihood estimation followed by standardization (the g-formula) for the average causal effect, is a doubly robust estimation method. Our aim is for the reader not just to be able to use this method, which we refer to as IPTW GLM, for doubly robust estimation, but to fully understand why it has the doubly robust property. For this reason, we define clearly, and in multiple ways, all concepts needed to understand the method and why it is doubly robust. In addition, we want to make very clear that the mere combination of propensity score weighting and an adjusted outcome model does not generally result in a doubly robust estimator. Finally, we hope to dispel the misconception that one can adjust for residual confounding remaining after propensity score weighting by adjusting in the outcome model for what remains `unbalanced' even when using doubly robust estimators. We provide R code for our simulations and real open-source data examples that can be followed step-by-step to use and hopefully understand the IPTW GLM method. We also compare to a much better-known but still simple doubly robust estimator.
Methodology
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the misinterpretation and misuse of doubly robust estimation in the applied literature. Specifically: 1. **Misinterpreting the combination method**: In the applied literature, there is a worrying trend of believing that combining the propensity score with the adjusted outcome model can automatically produce a doubly robust estimator. In fact, this simple combination does not always guarantee double robustness. 2. **Misusing complex methods**: Some researchers may misuse more complex established doubly robust estimators without fully understanding the principles behind them. 3. **Clarifying the definition of double robustness**: The authors hope to clearly point out that not all methods that combine propensity score weighting and the adjusted outcome model can provide doubly robust estimates. They emphasize that propensity score weighting alone cannot eliminate the influence of residual confounding factors, and even if a doubly robust estimator is used, this problem cannot be solved by adjusting unbalanced variables in the outcome model. 4. **Providing alternative methods**: The paper proposes a simple and easy - to - understand alternative method - fitting canonical link generalized linear models (GLM) by inverse probability - weighted maximum likelihood estimation (IPTW GLM), and then standardizing (g - formula) to estimate the average causal effect (ACE). This method has double robustness. ### Main contributions of the paper - **Theoretical explanation**: All necessary concepts are defined and explained in detail, enabling readers to fully understand why the IPTW GLM method has double robustness. - **Practical guidance**: R code and actual data examples are provided to help readers gradually understand and apply the IPTW GLM method. - **Comparative analysis**: The IPTW GLM is compared with other known doubly robust estimators, showing that their performance is similar in finite - sample simulations. ### Key formulas - **Inverse probability weighting**: \[ W(X, Z; \hat{\alpha})=\frac{X}{g(Z; \hat{\alpha})}+\frac{1 - X}{1 - g(Z; \hat{\alpha})} \] where \(g(Z; \alpha)\) is the propensity score model \(p(X = 1|Z)\). - **Standardized estimation**: \[ \hat{E}\{Y(x)\}=\frac{1}{n}\sum_{i = 1}^{n}q^{-1}(\hat{\gamma}_0+\hat{\beta}x+m(x, Z_i; \hat{\gamma})) \] - **ACE estimation**: \[ \hat{E}\{Y(1)\}-\hat{E}\{Y(0)\}=\frac{1}{n}\left(\sum_{i = 1}^{n}s(1, Z_i; \hat{\beta})-\sum_{i = 1}^{n}s(0, Z_i; \hat{\beta})\right) \] Through these formulas and methods, the paper not only provides researchers with a clear theoretical understanding, but also provides practical tools to correctly apply the doubly robust estimation method.