Abstract:Abstract Motivation Genetics hold great promise to precision medicine by tailoring treatment to the individual patient based on their genetic profiles. Toward this goal, many large-scale genome-wide association studies (GWAS) have been performed in the last decade to identify genetic variants associated with various traits and diseases. They have successfully identified tens of thousands of disease-related variants. However they have explained only a small proportion of the overall trait heritability for most traits and are of very limited clinical use. This is partly owing to the small effect sizes of most genetic variants, and the common practice of testing association between one trait and one genetic variant at a time in most GWAS, even when multiple related traits are often measured for each individual. Increasing evidence suggests that many genetic variants can influence multiple traits simultaneously, and we can gain more power by testing association of multiple traits simultaneously. It is appealing to develop novel multi-trait association test methods that need only GWAS summary data, since it is generally very hard to access the individual-level GWAS phenotype and genotype data. Results Many existing GWAS summary data-based association test methods have relied on ad hoc approach or crude Monte Carlo approximation. In this article, we develop rigorous statistical methods for efficient and powerful multi-trait association test. We develop robust and efficient methods to accurately estimate the marginal trait correlation matrix using only GWAS summary data. We construct the principal component (PC)-based association test from the summary statistics. PC-based test has optimal power when the underlying multi-trait signal can be captured by the first PC, and otherwise it will have suboptimal performance. We develop an adaptive test by optimally weighting the PC-based test and the omnibus chi-square test to achieve robust performance under various scenarios. We develop efficient numerical algorithms to compute the analytical P-values for all the proposed tests without the need of Monte Carlo sampling. We illustrate the utility of proposed methods through application to the GWAS meta-analysis summary data for multiple lipids and glycemic traits. We identify multiple novel loci that were missed by individual trait-based association test. Availability and implementation All the proposed methods are implemented in an R package available at http://www.github.com/baolinwu/MTAR. The developed R programs are extremely efficient: it takes less than 2 min to compute the list of genome-wide significant single nucleotide polymorphisms (SNPs) for all proposed multi-trait tests for the lipids GWAS summary data with 2.5 million SNPs on a single Linux desktop. Supplementary information Supplementary data are available at Bioinformatics online.

Gfa2bin enables graph-based GWAS by converting genome graphs to pan-genomic genotypes

Genotype Representation Graphs: Enabling Efficient Analysis of Biobank-Scale Data

GAUSS: a summary-statistics-based R package for accurate estimation of linkage disequilibrium for variants, Gaussian imputation, and TWAS analysis of cosmopolitan cohorts

Gretl - Variation GRaph Evaluation TooLkit

graph-GPA 2.0: A Graphical Model for Multi-disease Analysis of GWAS Results with Integration of Functional Annotation Data

WikiGWA: an Open Platform for Collecting and Using Genome-Wide Association Results

GACT: a Genome Build and Allele Definition Conversion Tool for SNP Imputation and Meta-Analysis in Genetic Association Studies

GwasWA: A GWAS One-Stop Analysis Platform from WGS Data to Variant Effect Assessment

GWASTool: A Web Pipeline for Detecting SNP-phenotype Associations

PathGPS: discover shared genetic architecture using GWAS summary data

SNPTransformer: a lightweight toolkit for genome-wide association studies

The Great Genotyper: A Graph-Based Method for Population Genotyping of Small and Structural Variants

GPA: A Statistical Approach to Prioritizing GWAS Results by Integrating Pleiotropy and Annotation

A fast and agnostic method for bacterial genome-wide association studies: Bridging the gap between k-mers and genetic events

G2P: a Genome-Wide-Association-Study simulation tool for genotype simulation, phenotype simulation and power evaluation.

Bi-level Structured Functional Analysis for Genome-Wide Association Studies.

Reproduction and In-Depth Evaluation of Genome-Wide Association Studies and Genome-Wide Meta-analyses Using Summary Statistics.

BridGE: a pathway-based analysis tool for detecting genetic interactions from GWAS

Integrate multiple traits to detect novel trait–gene association using GWAS summary data with an adaptive test approach

MGAS: a Powerful Tool for Multivariate Gene-Based Genome-Wide Association Analysis

Pan-African genome demonstrates how population-specific genome graphs improve high-throughput sequencing data analysis