Low coverage of species constrains the use of DNA barcoding to assess mosquito biodiversity

Maurício Moraes Zenker,Tatiana Pineda Portella,Felipe Arley Costa Pessoa,Johan Bengtsson-Palme,Pedro Manoel Galetti
DOI: https://doi.org/10.1038/s41598-024-58071-1
IF: 4.6
2024-03-30
Scientific Reports
Abstract:Mosquitoes (Culicidae) represent the main vector insects globally, and they also inhabit many of the terrestrial and aquatic habitats of the world. DNA barcoding and metabarcoding are now widely used in both research and routine practices involving mosquitoes. However, these methodologies rely on information available in databases consisting of barcode sequences representing taxonomically identified voucher specimens. In this study, we assess the availability of public data for mosquitoes in the main online databases, focusing specifically on the two most widely used DNA barcoding markers in Culicidae: COI and ITS2. In addition, we test hypotheses on possible factors affecting species coverage (i.e., the percentage of species covered in the online databases) for COI in different countries and the occurrence of the DNA barcode gap for COI. Our findings showed differences in the data publicly available in the repositories, with a taxonomic or species coverage of 28.4–30.11% for COI in BOLD + GenBank, and 12.32% for ITS2 in GenBank. Afrotropical, Australian and Oriental biogeographic regions had the lowest coverages, while Nearctic, Palearctic and Oceanian had the highest. The Neotropical region had an intermediate coverage. In general, countries with a higher diversity of mosquitoes and higher numbers of medically important species had lower coverage. Moreover, countries with a higher number of endemic species tended to have a higher coverage. Although our DNA barcode gap analyses suggested that the species boundaries need to be revised in half of the mosquito species available in the databases, additional data must be gathered to confirm these results and to allow explaining the occurrence of the DNA barcode gap. We hope this study can help guide regional species inventories of mosquitoes and the completion of a publicly available reference library of DNA barcodes for all mosquito species.
multidisciplinary sciences
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to evaluate the data coverage of mosquito species in major online databases, especially for two of the most commonly used DNA barcode markers: COI (cytochrome c oxidase subunit I) and ITS2 (internal transcribed spacer 2 of the ribosomal RNA gene). The study also tested the factors that may affect the COI species coverage in different countries and the existence of the DNA barcode gap of COI. Through these analyses, the authors hope to guide the compilation of regional mosquito species lists and promote the improvement of publicly available mosquito DNA barcode reference libraries. Specifically, the paper focuses on the following aspects: 1. **Evaluating the availability of public data**: The study evaluated the availability of COI and ITS2 barcode data of mosquito species in the BOLD system and GenBank. 2. **Testing the factors affecting species coverage**: The study tested the factors that may affect the COI species coverage, including species richness, endemic species richness, the number of medically important species, etc. 3. **Analyzing the DNA barcode gap**: The study analyzed the existence of the COI barcode gap to evaluate the reliability of species boundaries. Through these analyses, the paper aims to reveal the data gaps existing in the current databases and provide guidance for future barcode projects.