Ancestry informative SNP panels for discriminating the major East Asian populations: Han Chinese, Japanese and Korean

Cheng-Min Shi,Qi Liu,Shilei Zhao,Hua Chen
DOI: https://doi.org/10.1111/ahg.12320
Abstract:Ancestry informative markers play an important role in medical genetics and forensic analyses. Several ancestry informative SNP panels have been developed and validated that can differentiate global populations into continental or major regional groups. These global panels have served as good first-tier genetic markers; however, their performance in discriminating populations within regions appears unsatisfactory. To boost ancestry inference for regional populations, second-tier panels with more refined discrimination power among subpopulations within each of the regions need to be developed. In East Asia, Han Chinese, Japanese, and Korean show highly similar externally visible characteristics and are genetically closely related. Reliable ancestry informative genetic markers appear invaluable in discriminating these populations. In the present study, we compiled a genome-wide SNP dataset composing of 317,439 clean SNPs for a total of 1101 unrelated individuals from Han Chinese (817), Koreans (184), and Japanese (100). From this starting dataset, we developed a set of four nested ancestry informative SNP panels including 36, 59, 98, and 142 SNPs, respectively. The results of cross-validation tests indicate that these panels can discriminate the Chinese Han, Japanese, and Korean populations with overall average accuracies ranging from 90% to 99%. In the further performance assessments, these panels also manifested high sensitivity and specificity. In combination with the first-tier global panels, these second-tier panels would contribute to medical genetics and forensic research in East Asia.
What problem does this paper attempt to address?