A detailed workflow to develop QIIME2-formatted reference databases for taxonomic analysis of DNA metabarcoding data

Benjamin Dubois,Frédéric Debode,Louis Hautier,Julie Hulin,Gilles San Martin,Alain Delvaux,Eric Janssen,Dominique Mingeot
DOI: https://doi.org/10.1186/s12863-022-01067-5
2022-07-08
BMC Genetics
Abstract:The DNA metabarcoding approach has become one of the most used techniques to study the taxa composition of various sample types. To deal with the high amount of data generated by the high-throughput sequencing process, a bioinformatics workflow is required and the QIIME2 platform has emerged as one of the most reliable and commonly used. However, only some pre-formatted reference databases dedicated to a few barcode sequences are available to assign taxonomy. If users want to develop a new custom reference database, several bottlenecks still need to be addressed and a detailed procedure explaining how to develop and format such a database is currently missing. In consequence, this work is aimed at presenting a detailed workflow explaining from start to finish how to develop such a curated reference database for any barcode sequence.
genetics & heredity
What problem does this paper attempt to address?