Improving metabarcoding taxonomic assignment: A case study of fishes in a large marine ecosystem

Zachary Gold,Emily E. Curd,Kelly D. Goodwin,Emma S. Choi,Benjamin W. Frable,Andrew R. Thompson,Harold J. Walker,Ronald S. Burton,Dovi Kacev,Lucas D. Martz,Paul H. Barber
DOI: https://doi.org/10.1111/1755-0998.13450
IF: 7.7
2021-07-08
Molecular Ecology Resources
Abstract:<p>DNA metabarcoding is an important tool for molecular ecology. However, its effectiveness hinges on the quality of reference sequence databases and classification parameters employed. Here we evaluate the performance of MiFish <i>12S</i> taxonomic assignments using a case study of California Current Large Marine Ecosystem fishes to determine best practices for metabarcoding. Specifically, we use a taxonomy cross-validation by identity framework to compare classification performance between a global database comprised of all available sequences and a curated database that only includes sequences of fishes from the California Current Large Marine Ecosystem. We demonstrate that the regional database provides higher assignment accuracy than the comprehensive global database. We also document a tradeoff between accuracy and misclassification across a range of taxonomic cutoff scores, highlighting the importance of parameter selection for taxonomic classification. Furthermore, we compared assignment accuracy with and without the inclusion of additionally generated reference sequences. To this end, we sequenced tissue from 597 species using the MiFish <i>12S</i> primers, adding 252 species to GenBank's existing 550 California Current Large Marine Ecosystem fish sequences. We then compared species and reads identified from seawater environmental DNA samples using global databases with and without our generated references, and the regional database. The addition of new references allowed for the identification of 16 additional native taxa representing 17.0% of total reads from eDNA samples, including species with vast ecological and economic value. Together these results demonstrate the importance of comprehensive and curated reference databases for effective metabarcoding and the need for locus-specific validation efforts.</p>
biochemistry & molecular biology,ecology,evolutionary biology
What problem does this paper attempt to address?