A latent outcome variable approach for Mendelian randomization using the expectation maximization algorithm

Lamessa Dube Amente,Natalie T Mills,Thuc Duy Le,Elina Hypponen,Sang Hong Lee
DOI: https://doi.org/10.1101/2024.08.24.24312485
2024-08-26
Abstract:Mendelian randomization (MR) is a widely used tool to uncover causal relationships between exposures and outcomes. However, existing MR methods can suffer from inflated type I error rates and biased causal effects in the presence of invalid instruments. Our proposed method enhances MR analysis by augmenting latent phenotypes of the outcome, explicitly disentangling horizontal and vertical pleiotropy effects. This allows for explicit assessment of the exclusion restriction assumption and iteratively refines causal estimates through the expectation-maximization algorithm. This approach offers a unique and potentially more precise framework compared to existing MR methods. We rigorously evaluate our method against established MR approaches across diverse simulation scenarios, including balanced and directional pleiotropy, as well as violations of the Instrument Strength Independent of Direct Effect (InSIDE) assumption. Our findings consistently demonstrate superior performance of our method in terms of controlling type I error rates, bias, and robustness to genetic confounding. Additionally, our method facilitates testing for directional horizontal pleiotropy and outperforms MR-Egger in this regard, while also effectively testing for violations of the InSIDE assumption. We apply our method to real data, demonstrating its effectiveness compared to traditional MR methods. This analysis reveals the causal effects of body mass index (BMI) on metabolic syndrome (MetS) and a composite MetS score calculated by the weighted sum of its component factors. While the causal relationship is consistent across most methods, our proposed method shows fewer violations of the exclusion restriction assumption, especially for MetS scores where horizontal pleiotropy persists and other methods suffer from inflation.
What problem does this paper attempt to address?
### Problems the Paper Aims to Solve This paper aims to address some key issues in Mendelian Randomization (MR) analysis. Specifically: 1. **Impact of Invalid Instrumental Variables**: Existing MR methods may lead to increased Type I error rates and biased causal effect estimates in the presence of invalid instrumental variables. 2. **Handling of Pleiotropy**: By enhancing the modeling of potential phenotypes for the outcome, the paper aims to clearly distinguish between horizontal and vertical pleiotropy effects. This allows for better evaluation of the exclusion restriction assumption and iterative improvement of causal effect estimates through the Expectation-Maximization algorithm. 3. **InSIDE Assumption Verification**: The proposed method not only detects directional horizontal pleiotropy but also effectively tests whether the InSIDE assumption is violated, showing better performance compared to existing MR-Egger methods. 4. **Application to Real Data**: In real data, the method demonstrates its effectiveness over traditional MR methods, particularly in studies of the causal relationship between Metabolic Syndrome (MetS) and Body Mass Index (BMI). In summary, this paper proposes a new MR analysis method (LOVA) that aims to improve the accuracy and reliability of MR analysis by explicitly excluding vertical pleiotropy effects and iteratively optimizing causal effect estimates.