Improving Power of Association Tests Using Multiple Sets of Imputed Genotypes from Distributed Reference Panels

Wei Zhou,Lars G. Fritsche,Sayantan Das,He Zhang,Jonas B. Nielsen,Oddgeir L. Holmen,Jin Chen,Maoxuan Lin,Maiken B. Elvestad,Kristian Hveem,Goncalo R. Abecasis,Hyun Min Kang,Cristen J. Willer
DOI: https://doi.org/10.1002/gepi.22067
2017-01-01
Genetic Epidemiology
Abstract:The accuracy of genotype imputation depends upon two factors: the sample size of the reference panel and the genetic similarity between the reference panel and the target samples. When multiple reference panels are not consented to combine together, it is unclear how to combine the imputation results to optimize the power of genetic association studies. We compared the accuracy of 9,265 Norwegian genomes imputed from three reference panels1000 Genomes phase 3 (1000G), Haplotype Reference Consortium (HRC), and a reference panel containing 2,201 Norwegian participants from the population-based Nord TrOndelag Health Study (HUNT) from low-pass genome sequencing. We observed that the population-matched reference panel allowed for imputation of more population-specific variants with lower frequency (minor allele frequency (MAF) between 0.05% and 0.5%). The overall imputation accuracy from the population-specific panel was substantially higher than 1000G and was comparable with HRC, despite HRC being 15-fold larger. These results recapitulate the value of population-specific reference panels for genotype imputation. We also evaluated different strategies to utilize multiple sets of imputed genotypes to increase the power of association studies. We observed that testing association for all variants imputed from any panel results in higher power to detect association than the alternative strategy of including only one version of each genetic variant, selected for having the highest imputation quality metric. This was particularly true for lower frequency variants (MAF<1%), even after adjusting for the additional multiple testing burden.
What problem does this paper attempt to address?