Abstract:Abstract Motivation Genetics hold great promise to precision medicine by tailoring treatment to the individual patient based on their genetic profiles. Toward this goal, many large-scale genome-wide association studies (GWAS) have been performed in the last decade to identify genetic variants associated with various traits and diseases. They have successfully identified tens of thousands of disease-related variants. However they have explained only a small proportion of the overall trait heritability for most traits and are of very limited clinical use. This is partly owing to the small effect sizes of most genetic variants, and the common practice of testing association between one trait and one genetic variant at a time in most GWAS, even when multiple related traits are often measured for each individual. Increasing evidence suggests that many genetic variants can influence multiple traits simultaneously, and we can gain more power by testing association of multiple traits simultaneously. It is appealing to develop novel multi-trait association test methods that need only GWAS summary data, since it is generally very hard to access the individual-level GWAS phenotype and genotype data. Results Many existing GWAS summary data-based association test methods have relied on ad hoc approach or crude Monte Carlo approximation. In this article, we develop rigorous statistical methods for efficient and powerful multi-trait association test. We develop robust and efficient methods to accurately estimate the marginal trait correlation matrix using only GWAS summary data. We construct the principal component (PC)-based association test from the summary statistics. PC-based test has optimal power when the underlying multi-trait signal can be captured by the first PC, and otherwise it will have suboptimal performance. We develop an adaptive test by optimally weighting the PC-based test and the omnibus chi-square test to achieve robust performance under various scenarios. We develop efficient numerical algorithms to compute the analytical P-values for all the proposed tests without the need of Monte Carlo sampling. We illustrate the utility of proposed methods through application to the GWAS meta-analysis summary data for multiple lipids and glycemic traits. We identify multiple novel loci that were missed by individual trait-based association test. Availability and implementation All the proposed methods are implemented in an R package available at http://www.github.com/baolinwu/MTAR. The developed R programs are extremely efficient: it takes less than 2 min to compute the list of genome-wide significant single nucleotide polymorphisms (SNPs) for all proposed multi-trait tests for the lipids GWAS summary data with 2.5 million SNPs on a single Linux desktop. Supplementary information Supplementary data are available at Bioinformatics online.

Integrate multiple traits to detect novel trait–gene association using GWAS summary data with an adaptive test approach

A copula-based set-variant association test for bivariate continuous or mixed phenotypes

Multiple-trait Adaptive Fisher's Method for Genome-wide Association Studies

Subset scanning for multi-trait analysis using GWAS summary statistics

The eigen higher criticism and eigen Berk-Jones tests for multiple trait association studies based on GWAS summary statistics.

Multitrait transcriptome‐wide association study (TWAS) tests

Regression-based Approach for Testing the Association Between Multi-Region Haplotype Configuration and Complex Trait

Trait selection strategy in multi-trait GWAS: Boosting SNP discoverability

Multi-trait genome-wide analyses of the brain imaging phenotypes in UK Biobank

Meta-analysis of set-based multiple phenotype association test based on GWAS summary statistics from different cohorts

A Non-Parametric Method for Building Predictive Genetic Tests on High-Dimensional Data

Multivariate simulation framework reveals performance of multi-trait GWAS methods

Genome-wide association studies with high-dimensional phenotypes

Efficient variant set mixed model association tests for continuous and binary traits in large-scale whole genome sequencing studies

Integrative functional linear model for genome-wide association studies with multiple traits

mBAT-combo: a more powerful test to detect gene-trait associations from GWAS data

An iterative approach to detect pleiotropy and perform Mendelian Randomization analysis using GWAS summary statistics

SCAMPI: A scalable statistical framework for genome-wide interaction testing harnessing cross-trait correlations

A statistical framework for powerful multi-trait rare variant analysis in large-scale whole-genome sequencing studies

Multi-trait analysis of genome-wide association summary statistics using MTAG

multi-GPA-Tree: Statistical Approach for Pleiotropy Informed and Functional Annotation Tree Guided Prioritization of GWAS Results