Human ancestry inference at scale, from genomic data

René L Warren,Lauren Coombe,Johnathan Wong,Parham Kazemi,Inanc Birol
DOI: https://doi.org/10.1101/2024.03.26.586646
2024-03-29
Abstract:Using an alignment-free single nucleotide variant prediction framework that leverages integrated variant call sets from the 1000 Genomes Project, we demonstrate accurate ancestry inference predictions on over 600 human genome sequencing datasets, including complete genomes, draft assemblies, and >280 independently-generated datasets. The method presented, ntRoot, infers super-population ancestry along an input human genome in 1h15m or less on 30X sequencing data, and will be an enabling technology for cohort studies.
Genetics
What problem does this paper attempt to address?