SAIGE-GENE+ improves the efficiency and accuracy of set-based rare variant association tests

Wei Zhou,Wenjian Bi,Zhangchen Zhao,Kushal K. Dey,Karthik A. Jagadeesh,Konrad J. Karczewski,Mark J. Daly,Benjamin M. Neale,Seunggeun Lee
DOI: https://doi.org/10.1038/s41588-022-01178-w
IF: 30.8
2022-09-23
Nature Genetics
Abstract:Several biobanks, including UK Biobank (UKBB), are generating large-scale sequencing data. An existing method, SAIGE-GENE, performs well when testing variants with minor allele frequency (MAF) ≤ 1%, but inflation is observed in variance component set-based tests when restricting to variants with MAF ≤ 0.1% or 0.01%. Here, we propose SAIGE-GENE+ with greatly improved type I error control and computational efficiency to facilitate rare variant tests in large-scale data. We further show that incorporating multiple MAF cutoffs and functional annotations can improve power and thus uncover new gene–phenotype associations. In the analysis of UKBB whole exome sequencing data for 30 quantitative and 141 binary traits, SAIGE-GENE+ identified 551 gene–phenotype associations.
genetics & heredity
What problem does this paper attempt to address?