A program for real-time surveillance of SARS-CoV-2 genetics
Hayden N. Brochu,Kuncheng Song,Qimin Zhang,Qiandong Zeng,Adib Shafi,Matthew Robinson,Jake Humphrey,Bobbi Croy,Lydia Peavy,Minoli Perera,Scott Parker,John Pruitt,Jason Munroe,Rama Ghatti,Thomas J. Urban,Ayla B. Harris,David Alfego,Brian Norvell,Michael Levandoski,Brian Krueger,Jonathan D. Williams,Deborah Boles,Melinda B. Nye,Suzanne E. Dale,Michael Sapeta,Christos J. Petropoulos,Jonathan Meltzer,Marcia Eisenberg,Oren Cohen,Stanley Letovsky,Lakshmanan K. Iyer
DOI: https://doi.org/10.1101/2024.04.18.24306026
2024-04-25
Abstract:The COVID-19 pandemic brought forth an urgent need for widespread genomic surveillance for rapid detection and monitoring of emerging SARS-CoV-2 variants. It necessitated design, development, and deployment of a nationwide infrastructure designed for sequestration, consolidation, and characterization of patient samples that disseminates de-identified information to public authorities in tight turnaround times. Here, we describe our development of such an infrastructure, which sequenced 594,832 high coverage SARS-CoV-2 genomes from isolates we collected in the U.S. from March 13 2020 to July 3 2023. Our sequencing protocol (‘Virseq’) generates mutation-resistant sequencing of the entire SARS-CoV-2 genome, capturing all major lineages. We also characterize 379 clinically relevant SARS-CoV-2 multi-strain co-infections and ensure robust detection of emerging lineages via simulation. The modular infrastructure, sequencing, and analysis capabilities we describe support the U.S. Centers for Disease Control national surveillance program and serve as a model for rapid response to emerging pandemics at a national scale.
Public and Global Health