Abstract:Abstract Motivation Genetics hold great promise to precision medicine by tailoring treatment to the individual patient based on their genetic profiles. Toward this goal, many large-scale genome-wide association studies (GWAS) have been performed in the last decade to identify genetic variants associated with various traits and diseases. They have successfully identified tens of thousands of disease-related variants. However they have explained only a small proportion of the overall trait heritability for most traits and are of very limited clinical use. This is partly owing to the small effect sizes of most genetic variants, and the common practice of testing association between one trait and one genetic variant at a time in most GWAS, even when multiple related traits are often measured for each individual. Increasing evidence suggests that many genetic variants can influence multiple traits simultaneously, and we can gain more power by testing association of multiple traits simultaneously. It is appealing to develop novel multi-trait association test methods that need only GWAS summary data, since it is generally very hard to access the individual-level GWAS phenotype and genotype data. Results Many existing GWAS summary data-based association test methods have relied on ad hoc approach or crude Monte Carlo approximation. In this article, we develop rigorous statistical methods for efficient and powerful multi-trait association test. We develop robust and efficient methods to accurately estimate the marginal trait correlation matrix using only GWAS summary data. We construct the principal component (PC)-based association test from the summary statistics. PC-based test has optimal power when the underlying multi-trait signal can be captured by the first PC, and otherwise it will have suboptimal performance. We develop an adaptive test by optimally weighting the PC-based test and the omnibus chi-square test to achieve robust performance under various scenarios. We develop efficient numerical algorithms to compute the analytical P-values for all the proposed tests without the need of Monte Carlo sampling. We illustrate the utility of proposed methods through application to the GWAS meta-analysis summary data for multiple lipids and glycemic traits. We identify multiple novel loci that were missed by individual trait-based association test. Availability and implementation All the proposed methods are implemented in an R package available at http://www.github.com/baolinwu/MTAR. The developed R programs are extremely efficient: it takes less than 2 min to compute the list of genome-wide significant single nucleotide polymorphisms (SNPs) for all proposed multi-trait tests for the lipids GWAS summary data with 2.5 million SNPs on a single Linux desktop. Supplementary information Supplementary data are available at Bioinformatics online.

Subset scanning for multi-trait analysis using GWAS summary statistics

Integrate multiple traits to detect novel trait–gene association using GWAS summary data with an adaptive test approach

Trait selection strategy in multi-trait GWAS: Boosting SNP discoverability

Multi-trait genome-wide analyses of the brain imaging phenotypes in UK Biobank

Multivariate simulation framework reveals performance of multi-trait GWAS methods

The eigen higher criticism and eigen Berk-Jones tests for multiple trait association studies based on GWAS summary statistics.

Multiple-trait Adaptive Fisher's Method for Genome-wide Association Studies

Multitrait transcriptome‐wide association study (TWAS) tests

Multi-trait GWAS for diverse ancestries: mapping the knowledge gap

SCAMPI: A scalable statistical framework for genome-wide interaction testing harnessing cross-trait correlations

Genome-wide large-scale multi-trait analysis characterizes global patterns of pleiotropy and unique trait-specific variants

GWAShug: a comprehensive platform for decoding the shared genetic basis between complex traits based on summary statistics

multi-GPA-Tree: Statistical Approach for Pleiotropy Informed and Functional Annotation Tree Guided Prioritization of GWAS Results

Genome-wide association studies with high-dimensional phenotypes

MTAG: Multi-Trait Analysis of GWAS

PathGPS: discover shared genetic architecture using GWAS summary data

Multi-trait analysis of genome-wide association summary statistics using MTAG

The advantages and limitations of trait analysis with GWAS: a review

Meta-analysis of set-based multiple phenotype association test based on GWAS summary statistics from different cohorts

A statistical framework for powerful multi-trait rare variant analysis in large-scale whole-genome sequencing studies

Integrative functional linear model for genome-wide association studies with multiple traits