Removal of sequencing adapter contamination improves microbial genome databases

Andrew H. Moeller,Brian A. Dillard,Samantha L. Goldman,Madalena V. F. Real,Daniel D. Sprockett
DOI: https://doi.org/10.1186/s12864-024-10956-1
IF: 4.547
2024-11-05
BMC Genomics
Abstract:Advances in assembling microbial genomes have led to growth of reference genome databases, which have been transformative for applied and basic microbiome research. Here we show that published microbial genome databases from humans, mice, cows, pigs, fish, honeybees, and marine environments contain significant sequencing-adapter contamination that systematically reduces assembly accuracy and contiguousness. By removing the adapter-contaminated ends of contiguous sequences and reassembling MGnify reference genomes, we improve the quality of assemblies in these databases.
genetics & heredity,biotechnology & applied microbiology
What problem does this paper attempt to address?