Real-time Plasmid Transmission Detection Pipeline

Natalie Scherff,Jörg Rothgänger,Thomas Weniger,Alexander Mellmann,Dag Harmsen
DOI: https://doi.org/10.1101/2024.07.09.602722
2024-08-22
Abstract:The spread of antimicrobial resistance among bacteria by horizontal plasmid transmissions poses a major challenge for clinical microbiology. Here, we evaluate a new real-time plasmid transmission detection pipeline implemented in the SeqSphere+ (Ridom GmbH, Muenster, Germany) software. Within the pipeline, a local Mash plasmid database is created and Mash searches with a distance threshold of 0.001 are used to trigger plasmid transmission early warning alerts (EWA). Clonal transmissions are detected using cgMLST allelic differences. The integrated tools MOB-suite, NCBI AMRFinderPlus, CGE MobileElementFinder, pyGenomeViz, and MUMmer are used to characterize plasmids and for visual pairwise plasmid comparisons, respectively. We evaluated the pipeline using published hybrid assemblies (Oxford Nanopore Technology/Illumina) of a surveillance and outbreak dataset with plasmid transmissions. To emulate prospective usage, samples were imported in chronological order of sampling date. Different combinations of the user-adjustable parameters sketch size (1,000 vs 10,000) and plasmid size correction were tested and discrepancies between resulting clusters were analyzed with Quast. When using a sketch size of 1,000 with size correction turned on, the SeqSphere+ pipeline agreed with the published data and produced the same clonal and carbapenemase-carrying plasmid clusters. EWAs were in the correct chronological order. In summary, the developed pipeline presented here is suitable for integration into clinical microbiology settings with limited bioinformatics knowledge due to its automated analyses and alert system, which are combined with the GUI-based SeqSphere+ platform. Thus, with its integrated sample database, (near) real-time plasmid transmission detection is within reach in bacterial routine-diagnostic settings when long-read sequencing is employed.
Bioinformatics
What problem does this paper attempt to address?