Semi-supervised machine learning method for predicting homogeneous ancestry groups to assess Hardy-Weinberg equilibrium in diverse whole-genome sequencing studies

Derek Shyr,Rounak Dey,Xihao Li,Hufeng Zhou,Eric Boerwinkle,Steve Buyske,Mark Daly,Richard A. Gibbs,Ira Hall,Tara Matise,Catherine Reeves,Nathan O. Stitziel,Michael Zody,Benjamin M. Neale,Xihong Lin
DOI: https://doi.org/10.1016/j.ajhg.2024.08.018
2024-10-04
The American Journal of Human Genetics
Abstract:We developed a semi-supervised machine learning method for predicting homogeneous ancestry groups in diverse whole-genome sequencing studies to facilitate valid Hardy-Weinberg equilibrium (HWE) subset testing. Compared to previous approaches, our method provided substantially better HWE subset testing performance, which can benefit downstream genetic association analyses.
genetics & heredity
What problem does this paper attempt to address?