The UNITE database for molecular identification of fungi: handling dark taxa and parallel taxonomic classifications

Rolf Henrik Nilsson,Karl-Henrik Larsson,Andy F S Taylor,Johan Bengtsson-Palme,Thomas S Jeppesen,Dmitry Schigel,Peter Kennedy,Kathryn Picard,Frank Oliver Glöckner,Leho Tedersoo,Irja Saar,Urmas Kõljalg,Kessy Abarenkov,Andy F S Taylor
DOI: https://doi.org/10.1093/nar/gky1022
IF: 14.9
2018-10-29
Nucleic Acids Research
Abstract:UNITE (https://unite.ut.ee/) is a web-based database and sequence management environment for the molecular identification of fungi. It targets the formal fungal barcode-the nuclear ribosomal internal transcribed spacer (ITS) region-and offers all ∼1 000 000 public fungal ITS sequences for reference. These are clustered into ∼459 000 species hypotheses and assigned digital object identifiers (DOIs) to promote unambiguous reference across studies. In-house and web-based third-party sequence curation and annotation have resulted in more than 275 000 improvements to the data over the past 15 years. UNITE serves as a data provider for a range of metabarcoding software pipelines and regularly exchanges data with all major fungal sequence databases and other community resources. Recent improvements include redesigned handling of unclassifiable species hypotheses, integration with the taxonomic backbone of the Global Biodiversity Information Facility, and support for an unlimited number of parallel taxonomic classification systems.
biochemistry & molecular biology
What problem does this paper attempt to address?