Rare variants discovery by extensive whole-genome sequencing of the Han Chinese population in Taiwan: Applications to cardiovascular medicine

Jyh-Ming Jimmy Juang,Tzu-Pin Lu,Ming-Wei Su,Chien-Wei Lin,Jenn-Hwai Yang,Hou-Wei Chu,Chien-Hsiun Chen,Yi-Wen Hsiao,Chien-Yueh Lee,Li-Mei Chiang,Qi-You Yu,Chuhsing Kate Hsiao,Ching-Yu Julius Chen,Pei-Ei Wu,Chien-Hua Pai,Eric Y. Chuang,Chen-Yang Shen
DOI: https://doi.org/10.1016/j.jare.2020.12.003
IF: 12.822
2021-05-01
Journal of Advanced Research
Abstract:A population-specific genomic reference is important for research and clinical practice, yet it remains unavailable for Han Chinese (HC) in Taiwan.We report the first whole genome sequencing (WGS) database of HC (1000 Taiwanese genome (1KTW-WGS)) and demonstrate several applications to cardiovascular medicine.Whole genomes of 997 HC were sequenced to at least 30X depth. A total of 20,117 relatively healthy HC individuals were genotyped using a customized Axiom GWAS array. We performed a genome-wide genotype imputation technique using IMPUTE2.We identified 26.7 million single-nucleotide variants (SNVs) and 4.2 million insertions-deletions. Of the SNVs, 16.1% were novel relative to dbSNP (build 152), and 34.2% were novel relative to gnomAD. A total of 18,450 healthy HC individuals were genotyped using a customized Genome-Wide Association Study (GWAS) array. We identified hypertension-associated variants and developed a hypertension prediction model based on the correlation between the WGS data and GWAS data (combined clinical and genetic models, AUC 0.887), and also identified 3 novel hyperlipidemia-associated variants. Each individual carried an average of 16.42 (SD = 3.72) disease-causing variants. Additionally, we established an online <em>SCN5A</em> (an important cardiac gene) database that can be used to explore racial differences. Finally, pharmacogenetics studies identified HC population-specific SNVs in genes (<em>CYP2C9</em> and <em>VKORC1</em>) involved in drug metabolism and blood clotting.This research demonstrates the benefits of constructing a population-specific genomic reference database for precision medicine.
multidisciplinary sciences
What problem does this paper attempt to address?