NyuWa Genome resource: A deep whole-genome sequencing-based variation profile and reference panel for the Chinese population

Peng Zhang,Huaxia Luo,Yanyan Li,You Wang,Jiajia Wang,Yu Zheng,Yiwei Niu,Yirong Shi,Honghong Zhou,Tingrui Song,Quan Kang,The Han100K Initiative,Tao Xu,Shunmin He
DOI: https://doi.org/10.1016/j.celrep.2021.110017
IF: 8.8
2021-11-01
Cell Reports
Abstract:The lack of haplotype reference panels and whole-genome sequencing resources specific to the Chinese population has greatly hindered genetic studies in the world's largest population. Here, we present the NyuWa genome resource, based on deep (26.2×) sequencing of 2,999 Chinese individuals, and construct a NyuWa reference panel of 5,804 haplotypes and 19.3 million variants, which is a high-quality publicly available Chinese population-specific reference panel with thousands of samples. Compared with other panels, the NyuWa reference panel reduces the Han Chinese imputation error rate by a margin ranging from 30% to 51%. Population structure and imputation simulation tests support the applicability of one integrated reference panel for northern and southern Chinese. In addition, a total of 22,504 loss-of-function variants in coding and noncoding genes are identified, including 11,493 novel variants. These results highlight the value of the NyuWa genome resource in facilitating genetic research in Chinese and Asian populations.
cell biology
What problem does this paper attempt to address?