Assessing the efficacy of target adaptive sampling long-read sequencing through hereditary cancer patient genomes
Wataru Nakamura,Makoto Hirata,Satoyo Oda,Kenichi Chiba,Ai Okada,Raúl Nicolás Mateos,Masahiro Sugawa,Naoko Iida,Mineko Ushiama,Noriko Tanabe,Hiromi Sakamoto,Shigeki Sekine,Akira Hirasawa,Yosuke Kawai,Katsushi Tokunaga,Shin-ichi Tsujimoto,Norio Shiba,Shuichi Ito,Teruhiko Yoshida,Yuichi Shiraishi,Hatsue Ishibashi-Ueda,Tsutomu Tomita,Michio Noguchi,Ayako Takahashi,Yu-ichi Goto,Sumiko Yoshida,Kotaro Hattori,Ryo Matsumura,Aritoshi Iida,Yutaka Maruoka,Hiroyuki Gatanaga,Masaya Sugiyama,Satoshi Suzuki,Kengo Miyo,Yoichi Matsubara,Akihiro Umezawa,Kenichiro Hata,Tadashi Kaname,Kouichi Ozaki,Haruhiko Tokuda,Hiroshi Watanabe,Shumpei Niida,Eisei Noiri,Koji Kitajima,Yosuke Omae,Reiko Miyahara,Hideyuki Shimanuki,NCBN Controls WGS Consortium
DOI: https://doi.org/10.1038/s41525-024-00394-z
2024-02-18
npj Genomic Medicine
Abstract:Innovations in sequencing technology have led to the discovery of novel mutations that cause inherited diseases. However, many patients with suspected genetic diseases remain undiagnosed. Long-read sequencing technologies are expected to significantly improve the diagnostic rate by overcoming the limitations of short-read sequencing. In addition, Oxford Nanopore Technologies (ONT) offers adaptive sampling and computationally driven target enrichment technology. This enables more affordable intensive analysis of target gene regions compared to standard non-selective long-read sequencing. In this study, we developed an efficient computational workflow for target adaptive sampling long-read sequencing (TAS-LRS) and evaluated it through application to 33 genomes collected from suspected hereditary cancer patients. Our workflow can identify single nucleotide variants with nearly the same accuracy as the short-read platform and elucidate complex forms of structural variations. We also newly identified several SINE-R/VNTR/Alu (SVA) elements affecting the APC gene in two patients with familial adenomatous polyposis, as well as their sites of origin. In addition, we demonstrated that off-target reads from adaptive sampling, which is typically discarded, can be effectively used to accurately genotype common single-nucleotide polymorphisms (SNPs) across the entire genome, enabling the calculation of a polygenic risk score. Furthermore, we identified allele-specific MLH1 promoter hypermethylation in a Lynch syndrome patient. In summary, our workflow with TAS-LRS can simultaneously capture monogenic risk variants including complex structural variations, polygenic background as well as epigenetic alterations, and will be an efficient platform for genetic disease research and diagnosis.
genetics & heredity