Abstract:ABSTRACT Oxford Nanopore sequencing is one of the high-throughput sequencing technologies that facilitates the reconstruction of metagenome-assembled genomes (MAGs). This study aimed to assess the potential of long-read assembly algorithms in Oxford Nanopore sequencing to enhance the MAG-based identification of bacterial pathogens using both simulated and mock communities. Simulated communities were generated to mimic those on fresh spinach and in surface water. Long reads were produced using R9.4.1+SQK-LSK109 and R10.4 + SQK-LSK112, with 0.5, 1, and 2 million reads. The simulated bacterial communities included multidrug-resistant Salmonella enterica serotypes Heidelberg, Montevideo, and Typhimurium in the fresh spinach community individually or in combination, as well as multidrug-resistant Pseudomonas aeruginosa in the surface water community. Real data sets of the ZymoBIOMICS HMW DNA Standard were also studied. A bioinformatic pipeline (MAGenie, freely available at https://github.com/jackchen129/MAGenie ) that combines metagenome assembly, taxonomic classification, and sequence extraction was developed to reconstruct draft MAGs from metagenome assemblies. Five assemblers were evaluated based on a series of genomic analyses. Overall, Flye outperformed the other assemblers, followed by Shasta, Raven, and Unicycler, while Canu performed least effectively. In some instances, the extracted sequences resulted in draft MAGs and provided the locations and structures of antimicrobial resistance genes and mobile genetic elements. Our study showcases the viability of utilizing the extracted sequences for precise phylogenetic inference, as demonstrated by the consistent alignment of phylogenetic topology between the reference genome and the extracted sequences. R9.4.1+SQK-LSK109 was more effective in most cases than R10.4+SQK-LSK112, and greater sequencing depths generally led to more accurate results. IMPORTANCE By examining diverse bacterial communities, particularly those housing multiple Salmonella enterica serotypes, this study holds significance in uncovering the potential of long-read assembly algorithms to improve metagenome-assembled genome (MAG)-based pathogen identification through Oxford Nanopore sequencing. Our research demonstrates that long-read assembly stands out as a promising avenue for boosting precision in MAG-based pathogen identification, thus advancing the development of more robust surveillance measures. The findings also support ongoing endeavors to fine-tune a bioinformatic pipeline for accurate pathogen identification within complex metagenomic samples.

Easing genomic surveillance: A comprehensive performance evaluation of long-read assemblers across multi-strain mixture data of HIV-1 and Other pathogenic viruses for constructing a user-friendly bioinformatic pipeline

Comparative Evaluation of Bioinformatic Pipelines for Full-Length Viral Genome Assembly

Complementary Insights into Gut Viral Genomes: a Comparative Benchmark of Short- and Long-Read Metagenomes Using Diverse Assemblers and Binners

Benchmarking of Long-Read Sequencing, Assemblers and Polishers for Yeast Genome

Benchmarking of bioinformatics tools for the hybrid de novo assembly of human whole-genome sequencing data

Comprehensive assessment of 11 de novo HiFi assemblers on complex eukaryotic genomes and metagenomes

Evaluating long-read de novo assembly tools for eukaryotic genomes: insights and considerations

Bridging genomic gaps: A versatile SARS-CoV-2 benchmark dataset for adaptive laboratory workflows

Benchmarking short and long read polishing tools for nanopore assemblies: achieving near-perfect genomes for outbreak isolates

Assessment of Metagenomic Assemblers Based on Hybrid Reads of Real and Simulated Metagenomic Sequences

Benchmarking genome assembly methods on metagenomic sequencing data

Assembly Arena: Benchmarking RNA isoform reconstruction algorithms for nanopore sequencing

Strain-resolved de-novo metagenomic assembly of viral genomes and microbial 16S rRNAs

Benchmarking multi-platform sequencing technologies for human genome assembly

Performance comparison of next generation sequencing analysis pipelines for HIV-1 drug resistance testing

Evaluation of 10 Different Pipelines for Bacterial Single-Nucleotide Variant Detection

AsmMix: an efficient haplotype-resolved hybrid de novo genome assembling pipeline

Benchmarking of next and third generation sequencing technologies and their associated algorithms for de novo genome assembly

Assessing the performance of current strain resolution tools on long-read metagenomes

Advancing metagenome-assembled genome-based pathogen identification: unraveling the power of long-read assembly algorithms in Oxford Nanopore sequencing

A survey on computational strategies for genome-resolved gut metagenomics