Toward best practice in cancer mutation detection with whole-genome and whole-exome sequencing
Wenming Xiao,Luyao Ren,Zhong Chen,Li Tai Fang,Yongmei Zhao,Justin Lack,Meijian Guan,Bin Zhu,Erich Jaeger,Liz Kerrigan,Thomas M. Blomquist,Tiffany Hung,Marc Sultan,Kenneth Idler,Charles Lu,Andreas Scherer,Rebecca Kusko,Malcolm Moos,Chunlin Xiao,Stephen T. Sherry,Ogan D. Abaan,Wanqiu Chen,Xin Chen,Jessica Nordlund,Ulrika Liljedahl,Roberta Maestro,Maurizio Polano,Jiri Drabek,Petr Vojta,Sulev Kõks,Ene Reimann,Bindu Swapna Madala,Timothy Mercer,Chris Miller,Howard Jacob,Tiffany Truong,Ali Moshrefi,Aparna Natarajan,Ana Granat,Gary P. Schroth,Rasika Kalamegham,Eric Peters,Virginie Petitjean,Ashley Walton,Tsai-Wei Shen,Keyur Talsania,Cristobal Juan Vera,Kurt Langenbach,Maryellen de Mars,Jennifer A. Hipp,James C. Willey,Jing Wang,Jyoti Shetty,Yuliya Kriga,Arati Raziuddin,Bao Tran,Yuanting Zheng,Ying Yu,Margaret Cam,Parthav Jailwala,Cu Nguyen,Daoud Meerzaman,Qingrong Chen,Chunhua Yan,Ben Ernest,Urvashi Mehra,Roderick V. Jensen,Wendell Jones,Jian-Liang Li,Brian N. Papas,Mehdi Pirooznia,Yun-Ching Chen,Fayaz Seifuddin,Zhipan Li,Xuelu Liu,Wolfgang Resch,Jingya Wang,Leihong Wu,Gokhan Yavas,Corey Miles,Baitang Ning,Weida Tong,Christopher E. Mason,Eric Donaldson,Samir Lababidi,Louis M. Staudt,Zivana Tezak,Huixiao Hong,Charles Wang,Leming Shi
DOI: https://doi.org/10.1038/s41587-021-00994-5
IF: 46.9
2021-09-01
Nature Biotechnology
Abstract:Clinical applications of precision oncology require accurate tests that can distinguish true cancer-specific mutations from errors introduced at each step of next-generation sequencing (NGS). To date, no bulk sequencing study has addressed the effects of cross-site reproducibility, nor the biological, technical and computational factors that influence variant identification. Here we report a systematic interrogation of somatic mutations in paired tumor–normal cell lines to identify factors affecting detection reproducibility and accuracy at six different centers. Using whole-genome sequencing (WGS) and whole-exome sequencing (WES), we evaluated the reproducibility of different sample types with varying input amount and tumor purity, and multiple library construction protocols, followed by processing with nine bioinformatics pipelines. We found that read coverage and callers affected both WGS and WES reproducibility, but WES performance was influenced by insert fragment size, genomic copy content and the global imbalance score (GIV; G > T/C > A). Finally, taking into account library preparation protocol, tumor content, read coverage and bioinformatics processes concomitantly, we recommend actionable practices to improve the reproducibility and accuracy of NGS experiments for cancer mutation detection.
biotechnology & applied microbiology