MetaSquare: an integrated metadatabase of 16S rRNA gene amplicon for microbiome taxonomic classification

Chun-Chieh Liao,Po-Ying Fu,Chih-Wei Huang,Chia-Hsien Chuang,Yun Yen,Chung-Yen Lin,Shu-Hwa Chen
DOI: https://doi.org/10.1093/bioinformatics/btac184
IF: 5.8
2022-03-23
Bioinformatics
Abstract:Abstract Motivation Taxonomic classification of 16S ribosomal RNA gene amplicon is an efficient and economic approach in microbiome analysis. 16S rRNA sequence databases like SILVA, RDP, EzBioCloud and HOMD used in downstream bioinformatic pipelines have limitations on either the sequence redundancy or the delay on new sequence recruitment. To improve the 16S rRNA gene-based taxonomic classification, we merged these widely used databases and a collection of novel sequences systemically into an integrated resource. Results MetaSquare version 1.0 is an integrated 16S rRNA sequence database. It is composed of more than 6 million sequences and improves taxonomic classification resolution on both long-read and short-read methods. Availability and implementation Accessible at https://hub.docker.com/r/lsbnb/metasquare_db and https://github.com/lsbnb/MetaSquare Supplementary information Supplementary data are available at Bioinformatics online.
biochemical research methods,biotechnology & applied microbiology,mathematical & computational biology
What problem does this paper attempt to address?