Abstract:Short-amplicon 16S rRNA gene sequencing is currently the method of choice for studies investigating microbiomes. However, comparative studies on differences in procedures are scarce. We sequenced human stool samples and mock communities with increasing complexity using a variety of commonly used protocols. Short amplicons targeting different variable regions (V-regions) or ranges thereof (V1-V2, V1-V3, V3-V4, V4, V4-V5, V6-V8, and V7-V9) were investigated for differences in the composition outcome due to primer choices. Next, the influence of clustering (operational taxonomic units [OTUs], zero-radius OTUs [zOTUs], and amplicon sequence variants [ASVs]), different databases (GreenGenes, the Ribosomal Database Project, Silva, the genomic-based 16S rRNA Database, and The All-Species Living Tree), and bioinformatic settings on taxonomic assignment were also investigated. We present a systematic comparison across all typically used V-regions using well-established primers. While it is known that the primer choice has a significant influence on the resulting microbial composition, we show that microbial profiles generated using different primer pairs need independent validation of performance. Further, comparing data sets across V-regions using different databases might be misleading due to differences in nomenclature (e.g., Enterorhabdus versus Adlercreutzia ) and varying precisions in classification down to genus level. Overall, specific but important taxa are not picked up by certain primer pairs (e.g., Bacteroidetes is missed using primers 515F-944R) or due to the database used (e.g., Acetatifactor in GreenGenes and the genomic-based 16S rRNA Database). We found that appropriate truncation of amplicons is essential and different truncated-length combinations should be tested for each study. Finally, specific mock communities of sufficient and adequate complexity are highly recommended. IMPORTANCE In 16S rRNA gene sequencing, certain bacterial genera were found to be underrepresented or even missing in taxonomic profiles when using unsuitable primer combinations, outdated reference databases, or inadequate pipeline settings. Concerning the last, quality thresholds as well as bioinformatic settings (i.e., clustering approach, analysis pipeline, and specific adjustments such as truncation) are responsible for a number of observed differences between studies. Conclusions drawn by comparing one data set to another (e.g., between publications) appear to be problematic and require independent cross-validation using matching V-regions and uniform data processing. Therefore, we highlight the importance of a thought-out study design including sufficiently complex mock standards and appropriate V-region choice for the sample of interest. The use of processing pipelines and parameters must be tested beforehand.

Combining ANNs to improve phone recognition

Intra-Genomic Heterogeneity In 16s Rrna Genes In Strictly Anaerobic Clinical Isolates From Periodontal Abscesses

Pain in Anderson-Fabry's disease

Vibrio Parahaemolyticus Isolates from Southeastern Chinese Coast Are Genetically Diverse with Circulation of Clonal Complex 3 Strains Since 2002.

Sequence heterogeneities of genes encoding 16S rRNAs in Paenibacillus polymyxa detected by temperature gradient gel electrophoresis

Information about variations in multiple copies of bacterial 16S rRNA genes may aid in species identification

PCR-based method for targeting 16S-23S rRNA intergenic spacer regions among Vibrio species

Site Specialization of Human Oral Veillonella Species

Intraspecific variation in the 16S rRNA gene sequences of Mycoplasma agalactiae and Mycoplasma bovis strains

Numerical classification of species of Vibrio and related genera

The 16S rRNA lung microbiome in mechanically ventilated patients: a methodological study

Room-temperature upconversion fiber laser tunable in the red, orange, green, and blue spectral regions.

Genotypic Expansion Within the Population Structure of Classical Brucella Species Revealed by MLVA16 Typing of 1404 Brucella Isolates From Different Animal and Geographic Origins, 1974–2006

Detecting and quantifying Veillonella by real-time quantitative PCR and droplet digital PCR

The enigmatic fungal genus provides a theoretical framework for studying intragenomic variation in ribosomal DNA sequences

Impact of 16S rRNA Gene Sequence Analysis for Identification of Bacteria on Clinical Microbiology and Infectious Diseases

Evaluation of 16S rRNA gene sequencing for species and strain-level microbiome analysis

One species, many faces: The underappreciated importance of strain diversity

Primer, Pipelines, Parameters: Issues in 16S rRNA Gene Sequencing

Hidden diversity of double-stranded DNA phages in symbiotic Rhizobium species

Long amplicons as a tool to identify variable regions of ribosomal RNA for improved taxonomic resolution and diagnostic assay design in microeukaryotes: using ascetosporea as a case study