Improving polygenic prediction from summary data by learning patterns of effect sharing across multiple phenotypes

Deborah Kunkel,Peter Sørensen,Vijay Shankar,Fabio Morgante
DOI: https://doi.org/10.1101/2024.05.06.592745
2024-05-10
Abstract:Polygenic prediction of complex trait phenotypes has become important in human genetics, especially in the context of precision medicine. Recently, Morgante . introduced , a flexible and computationally efficient method that models multiple phenotypes jointly and leverages sharing of effects across such phenotypes to improve prediction accuracy. However, a drawback of is that it requires individual-level data, which are often not publicly available. In this work, we introduce , an extension of the model that requires only summary statistics from Genome-Wide Association Studies (GWAS) and linkage disequilibrium (LD) estimates from a reference panel. By using summary data, we achieve the twin goal of increasing the applicability of the model to data sets that are not publicly available and making it scalable to biobank-size data. Through simulations, we show that is competitive with, and often outperforms, current state-of-the-art methods for single- and multi-phenotype polygenic prediction in a variety of scenarios that differ in the pattern of effect sharing across phenotypes, the number of phenotypes, the number of causal variants, and the genomic heritability. We also present a real data analysis of 16 blood cell phenotypes in UK Biobank, showing that achieves higher prediction accuracy than competing methods for the majority of traits, especially when the data has smaller sample size.
Genetics
What problem does this paper attempt to address?