Comprehensive and accurate genome analysis at scale using DRAGEN accelerated algorithms

Sairam Behera,Severine Catreux,Massimiliano Rossi,Sean Truong,Zhuoyi Huang,Michael Ruehle,Arun Visvanath,Gavin Parnaby,Cooper Roddey,Vitor Onuchic,Daniel L Cameron,Adam English,Shyamal Mehtalia,James Han,Rami Mehio,Fritz J Sedlazeck
DOI: https://doi.org/10.1101/2024.01.02.573821
2024-01-06
Abstract:Research and medical genomics require comprehensive and scalable solutions to drive the discovery of novel disease targets, evolutionary drivers, and genetic markers with clinical significance. This necessitates a framework to identify all types of variants independent of their size (e.g., SNV/SV) or location (e.g., repeats). Here we present DRAGEN that utilizes novel methods based on multigenomes, hardware acceleration, and machine learning based variant detection to provide novel insights into individual genomes with ∼30min computation time (from raw reads to variant detection). DRAGEN outperforms all other state-of-the-art methods in speed and accuracy across all variant types (SNV, indel, STR, SV, CNV) and further incorporates specialized methods to obtain key insights in medically relevant genes (e.g., HLA, SMN, GBA). We showcase DRAGEN across 3,202 genomes and demonstrate its scalability, accuracy, and innovations to further advance the integration of comprehensive genomics for research and medical applications.
Genomics
What problem does this paper attempt to address?