Exhaustive reanalysis of barcode sequences from public repositories highlights ongoing misidentifications and impacts taxa diversity and distribution
Antoine Fort,Marcus McHale,Kevin Cascella,Philippe Potin,Marie‐Mathilde Perrineau,Philip D. Kerrison,Elisabete Costa,Ricardo Calado,Maria do Rosário Domingues,Isabel Costa Azevedo,Isabel Sousa‐Pinto,Claire Gachon,Adrie Werf,Willem Visser,Johanna E. Beniers,Henrice Jansen,Michael D. Guiry,Ronan Sulpice
DOI: https://doi.org/10.1111/1755-0998.13453
IF: 7.7
2021-07-05
Molecular Ecology Resources
Abstract:<p>Accurate species identification often relies on public repositories to compare the barcode sequences of the investigated individual(s) with taxonomically assigned sequences. However, the accuracy of identifications in public repositories is often questionable, and the names originally given are rarely updated. For instance, species of the Sea Lettuce (<i>Ulva</i> spp.; Ulvophyceae, Ulvales, Ulvaceae) are frequently misidentified in public repositories, including herbaria and gene banks, making species identification based on traditional barcoding unreliable. We DNA barcoded 295 individual distromatic foliose strains of <i>Ulva</i> from the North-East Atlantic for three loci (<i>rbc</i>L, <i>tuf</i>A, ITS1). Seven distinct species were found, and we compared our results with all worldwide <i>Ulva</i> spp sequences present in the NCBI database for the three barcodes <i>rbc</i>L, <i>tuf</i>A and the ITS1. Our results demonstrate a large degree of species misidentification, where we estimate that 24 to 32% of the entries pertaining to foliose species are misannotated and provide an exhaustive list of NCBI sequences reannotations. An analysis of the global distribution of registered samples from foliose species also indicates possible geographical isolation for some species, and the absence of <i>U</i>. <i>lactuca</i> from Northern Europe. We extended our analytical framework to three other genera, <i>Fucus</i>, <i>Porphyra</i> and <i>Pyropia</i> and also identified erroneously labelled accessions and possibly new synonymies, albeit less than for <i>Ulva</i> spp. Altogether, exhaustive taxonomic clarification by aggregation of a library of barcode sequences highlights misannotations and delivers an improved representation of species diversity and distribution.</p>
biochemistry & molecular biology,ecology,evolutionary biology