Abstract:Background The success of a Mendelian randomization (MR) study critically depends on the validity of the assumptions underlying MR. We focus on detecting heterogeneity (also known as horizontal pleiotropy) in two‐sample summary‐data MR. A popular approach is to apply Cochran's Q statistic method, developed for meta‐analysis. However, Cochran's Q statistic, including its modifications, is known to lack power when its degrees of freedom are large. Furthermore, there is no theoretical justification for the claimed null distribution of the minimum of the modified Cochran's Q statistic with exact weighting (Qmin ), although it seems to perform well in simulation studies. Method The principle of our proposed method is straightforward: if a set of variables are valid instruments, then any linear combination of these variables is still a valid instrument. Specifically, this principle holds when these linear combinations are formed using eigenvectors derived from a variance matrix. Each linear combination follows a known normal distribution from which a p value can be calculated. We use the minimum p value for these eigenvector‐based linear combinations as the test statistic. Additionally, we explore a modification of the modified Cochran's Q statistic by replacing the weighting matrix with a truncated singular value decomposition. Results Extensive simulation studies reveal that the proposed methods outperform Cochran's Q statistic, including those with modified weights, and MR‐PRESSO, another popular method for detecting heterogeneity, in cases where the number of instruments is not large or the Wald ratios take two values. We also demonstrate these methods using empirical examples. Furthermore, we show that Qmin does not follow, but is dominated by, the claimed null chi‐square distribution. The proposed methods are implemented in an R package iGasso. Conclusions Dimension reduction techniques are useful for generating powerful tests of heterogeneity in MR.

Differentiating the Cochran‐Armitage Trend Test and Pearson's Χ2 Test: Location and Dispersion

Decomposing Pearson's Χ2 Test: A Linear Regression and Its Departure from Linearity

A Nonparametric Alternative to the Cochran-Armitage Trend Test in Genetic Case-Control Association Studies: the Jonckheere-Terpstra Trend Test

A Shrinkage Method for Testing the Hardy–Weinberg Equilibrium in Case‐Control Studies

Power Analysis of Principal Components Regression in Genetic Association Studies.

Single-Locus Genetic Association Analysis By Ordinal Tests

Powerful Test of Heterogeneity in Two‐Sample Summary‐Data Mendelian Randomization

Pearson Chi-squared Conditional Randomization Test

Retrospective Versus Prospective Score Tests For Genetic Association With Case-Control Data

A consideration of the chi-square test of Hardy--Weinberg equilibrium in a non-multinomial situation

Quantifying the Relationship Between Gene Expressions and Trait Values in General Pedigrees.

Enhancing the Power to Detect Low-Frequency Variants in Genome-Wide Screens

Influence of Population Stratification on Population-Based Marker-Disease Association Analysis

Blindly using Wald's test can miss rare disease-causal variants in case-control association studies.

A Powerful Variant-Set Association Test Based on Chi-Square Distribution.

Pearson's chi‐square test and rank correlation inferences for clustered data

Power Comparisons in 2x2 Contingency Tables: Odds Ratio versus Pearson Correlation versus Canonical Correlation

Rare variant association tests for ancestry-matched case-control data based on conditional logistic regression

Haplotypes Vs Single Marker Linkage Disequilibrium Tests: What Do We Gain?

Retrospective Likelihood-Based Methods for Analyzing Case-Cohort Genetic Association Studies.

An unconditional exact test for the Hardy-Weinberg equilibrium law: sample-space ordering using the Bayes factor.