Prediction-powered Generalization of Causal Inferences

Ilker Demirel,Ahmed Alaa,Anthony Philippakis,David Sontag
2024-06-05
Abstract:Causal inferences from a randomized controlled trial (RCT) may not pertain to a target population where some effect modifiers have a different distribution. Prior work studies generalizing the results of a trial to a target population with no outcome but covariate data available. We show how the limited size of trials makes generalization a statistically infeasible task, as it requires estimating complex nuisance functions. We develop generalization algorithms that supplement the trial data with a prediction model learned from an additional observational study (OS), without making any assumptions on the OS. We theoretically and empirically show that our methods facilitate better generalization when the OS is high-quality, and remain robust when it is not, and e.g., have unmeasured confounding.
Machine Learning
What problem does this paper attempt to address?
This paper mainly discusses how to generalize the results of a randomized controlled trial (RCT) to a target population, especially when the factors that affect the outcomes in the RCT are distributed differently in the target population. The paper points out that generalizing solely based on RCT data may be statistically infeasible because it requires estimating complex interference functions. To address this problem, the paper proposes a new method, which is to use additional observational study (OS) data to complement the RCT data, even if these observational data may be biased, to improve the accuracy of generalization and maintain robustness when the OS quality is high. The authors first point out the limitations of RCT in terms of time and cost, as well as its limited external validity and inability to directly apply to target populations with different characteristic distributions. They propose a generalization algorithm that combines RCT and potentially biased OS data to improve the estimation of causal effects. By using machine learning models for prediction, these methods can leverage additional data without relying on OS assumptions and reduce generalization error when the OS quality is high. The paper introduces several key concepts such as effect modifiers, potential outcomes, and confounding bias. The authors propose several assumptions such as consistency, ignorability of treatment assignment, and positivity to support causal inference. Then, they demonstrate how to estimate the average causal effect in the target population by combining RCT and OS data and using prediction models, even in the presence of unmeasured confounding factors in the OS. The paper also discusses how their method significantly improves estimation performance when compared to previous work, especially when the OS data quality is high, and validates this finding through extensive simulations of the data generation process. Finally, the paper proposes two new identification methods - additive bias correction and enhanced outcome modeling - to integrate prediction models for more statistically efficient generalization estimation. Overall, this paper aims to address how to more accurately generalize the causal inference of RCT to target populations with different characteristic distributions by combining RCT and OS data, thereby improving the accuracy and robustness of estimation.