Haplotype-resolved de novo assembly of a Tujia genome suggests the necessity for high-quality population-specific genome references

Haiyi Lou,Yang Gao,Bo Xie,Yimin Wang,Haikuan Zhang,Miao Shi,Sen Ma,Xiaoxi Zhang,Chang Liu,Shuhua Xu
DOI: https://doi.org/10.1016/j.cels.2022.01.006
IF: 11.091
2022-04-01
Cell Systems
Abstract:Even though the human reference genome assembly is continually being improved, it remains debatable whether a population-specific reference is necessary for every ethnic group. Here, we de novo assembled an individual genome (TJ1) from the Tujia population, an ethnic minority group most closely related to the Han Chinese. TJ1 provided a high-quality haplotype-resolved assembly of chromosome-scale with a scaffold N50 size >78 Mb. Compared with GRCh38 and other de novo assemblies, TJ1 improved short-read mapping, enhanced calling precision for structural variants, and detected rare and low-frequency variants. This revealed fine-scale differences between the closely related Han and Tujia populations, such as population-stratified variants of LCT and UBXN8, and improved screening for ancestry informative markers. We demonstrated that TJ1 could reduce false positives in clinical diagnosis and analyzed the PRSS1-PRSS2 locus as a test case. Our results suggest that population-specific assemblies are necessary for genetic and medical analysis, especially when closely related populations are studied. A record of this paper's transparent peer review process is included in the supplemental information.
cell biology,biochemistry & molecular biology
What problem does this paper attempt to address?