Joint mixed-effects models for causal inference in clustered network-based observational studies

Vanessa McNealis,Erica E. M. Moodie,Nema Dean
DOI: https://doi.org/10.48550/arXiv.2404.07411
2024-04-11
Abstract:Causal inference on populations embedded in social networks poses technical challenges, since the typical no interference assumption frequently does not hold. Existing methods developed in the context of network interference rely upon the assumption of no unmeasured confounding. However, when faced with multilevel network data, there may be a latent factor influencing both the exposure and the outcome at the cluster level. We propose a Bayesian inference approach that combines a joint mixed-effects model for the outcome and the exposure with direct standardization to identify and estimate causal effects in the presence of network interference and unmeasured cluster confounding. In simulations, we compare our proposed method with linear mixed and fixed effects models and show that unbiased estimation is achieved using the joint model. Having derived valid tools for estimation, we examine the effect of maternal college education on adolescent school performance using data from the National Longitudinal Study of Adolescent Health.
Methodology
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the technical challenges of making causal inferences in groups embedded in social networks. Specifically, the paper focuses on how to identify and estimate causal effects in the presence of network interference and unmeasured cluster confounding factors. Traditional methods usually rely on the "no - interference assumption" (that is, an individual's potential outcome is not affected by the treatment status of other individuals), but in the context of social networks, this assumption is often not valid because social interactions can affect the mechanisms between treatment and outcome, leading to spillover effects. The paper proposes a Bayesian inference method, which combines the joint mixed - effects model and the direct standardization technique to deal with network interference and unmeasured cluster confounding factors. Through this method, the authors aim to provide an effective tool for estimating causal effects, especially in multi - level network data (such as the student social network within a school). To verify the effectiveness of the proposed method, the authors conducted simulation studies and compared it with the linear mixed - effects model and the fixed - effects model. The results show that an unbiased estimate can be achieved by using the joint model. Finally, the authors used data from the National Longitudinal Study of Adolescent Health (Add Health) to explore the impact of mothers' higher education on adolescents' academic performance as a practical application case. In short, the core problem of this paper is to develop a new statistical method that can accurately estimate causal effects in the presence of network interference and unmeasured confounding factors.