Severus: accurate detection and characterization of somatic structural variation in tumor genomes using long reads

Ayse Keskus,Asher Bryant,Tanveer Ahmad,Byunggil Yoo,Sergey Aganezov,Anton Goretsky,Ataberk Donmez,Lisa A. Lansdon,Isabel Rodriguez,Jimin Park,Yuelin Liu,Xiwen Cui,Joshua Gardner,Brandy McNulty,Samuel Sacco,Jyoti Shetty,Yongmei Zhao,Bao Tran,Giuseppe Narzisi,Adrienne Helland,Daniel E. Cook,Pi-Chuan Chang,Alexey Kolesnikov,Andrew Carroll,Erin K. Molloy,Irina Pushel,Erin Guest,Tomi Pastinen,Kishwar Shafin,Karen H. Miga,Salem Malikic,Chi-Ping Day,Nicolas Robine,Cenk Sahinalp,Michael Dean,Midhat S. Farooqi,Benedict Paten,Mikhail Kolmogorov
DOI: https://doi.org/10.1101/2024.03.22.24304756
2024-03-26
Abstract:Most current studies rely on short-read sequencing to detect somatic structural variation (SV) in cancer genomes. Long-read sequencing offers the advantage of better mappability and long-range phasing, which results in substantial improvements in germline SV detection. However, current long-read SV detection methods do not generalize well to the analysis of somatic SVs in tumor genomes with complex rearrangements, heterogeneity, and aneuploidy. Here, we present Severus: a method for the accurate detection of different types of somatic SVs using a phased breakpoint graph approach. To benchmark various short- and long-read SV detection methods, we sequenced five tumor/normal cell line pairs with Illumina, Nanopore, and PacBio sequencing platforms; on this benchmark Severus showed the highest F1 scores (harmonic mean of the precision and recall) as compared to long-read and short-read methods. We then applied Severus to three clinical cases of pediatric cancer, demonstrating concordance with known genetic findings as well as revealing clinically relevant cryptic rearrangements missed by standard genomic panels.
Genetic and Genomic Medicine
What problem does this paper attempt to address?