Mediation Analysis with Mendelian Randomization and Efficient Multiple GWAS Integration

Rita Qiuran Lyu,Chong Wu,Xinwei Ma,Jingshen Wang
2024-05-17
Abstract:Mediation analysis is a powerful tool for studying causal pathways between exposure, mediator, and outcome variables of interest. While classical mediation analysis using observational data often requires strong and sometimes unrealistic assumptions, such as unconfoundedness, Mendelian Randomization (MR) avoids unmeasured confounding bias by employing genetic variations as instrumental variables. We develop a novel MR framework for mediation analysis with genome-wide associate study (GWAS) summary data, and provide solid statistical guarantees. Our framework employs carefully crafted estimating equations, allowing for different sets of genetic variations to instrument the exposure and the mediator, to efficiently integrate information stored in three independent GWAS. As part of this endeavor, we demonstrate that in mediation analysis, the challenge raised by instrument selection goes beyond the well-known winner's curse issue, and therefore, addressing it requires special treatment. We then develop bias correction techniques to address the instrument selection issue and commonly encountered measurement error bias issue. Collectively, through our theoretical investigations, we show that our framework provides valid statistical inference for both direct and mediation effects with enhanced statistical efficiency compared to existing methods. We further illustrate the finite-sample performance of our approach through simulation experiments and a case study.
Methodology,Statistics Theory
What problem does this paper attempt to address?
The paper attempts to address a series of challenges faced when conducting mediation analysis within the framework of Mendelian Randomization (MR). Specifically, these issues include: 1. **Unmeasured Confounding Factors**: Traditional mediation analysis methods rely on observational data and often require the assumption that there are no unmeasured confounding factors. This assumption is difficult to meet in practical applications, leading to potential bias in causal effect estimation. 2. **Selection of Genetic Instrumental Variables**: When using genetic variants as instrumental variables, selecting appropriate instruments is a crucial issue. Improper selection may introduce biases such as the "Winner’s Curse" and the "Loser’s Curse." 3. **Measurement Error Bias**: Effect size estimates in Genome-Wide Association Studies (GWAS) often contain measurement errors, which can lead to biased estimation results. 4. **Limitations of Existing Methods**: Existing two-step MR and multivariable MR methods lack solid theoretical guarantees when performing mediation analysis and may not provide effective statistical inference. To address these challenges, the authors propose a new MR framework that integrates summary data from multiple GWAS to improve statistical efficiency and provides effective bias correction techniques to ensure valid causal effect estimation and statistical inference even when the selection of instrumental variables is imperfect. Specifically, this framework uses carefully designed estimating equations that allow the use of different sets of genetic variants to instrument the exposure and mediator variables, effectively integrating information from three independent GWAS. Additionally, the authors develop correction techniques for the issues of instrumental variable selection and measurement error bias to enhance the accuracy and reliability of the estimates.