The UNITE database for molecular identification and taxonomic communication of fungi and other eukaryotes: sequences, taxa and classifications reconsidered

Kessy Abarenkov,R Henrik Nilsson,Karl-Henrik Larsson,Andy F S Taylor,Tom W May,Tobias Guldberg Frøslev,Julia Pawlowska,Björn Lindahl,Kadri Põldmaa,Camille Truong,Duong Vu,Tsuyoshi Hosoya,Tuula Niskanen,Timo Piirmann,Filipp Ivanov,Allan Zirk,Marko Peterson,Tanya E Cheeke,Yui Ishigami,Arnold Tobias Jansson,Thomas Stjernegaard Jeppesen,Erik Kristiansson,Vladimir Mikryukov,Joseph T Miller,Ryoko Oono,Francisco J Ossandon,Joana Paupério,Irja Saar,Dmitry Schigel,Ave Suija,Leho Tedersoo,Urmas Kõljalg
DOI: https://doi.org/10.1093/nar/gkad1039
2024-01-05
Abstract:UNITE (https://unite.ut.ee) is a web-based database and sequence management environment for molecular identification of eukaryotes. It targets the nuclear ribosomal internal transcribed spacer (ITS) region and offers nearly 10 million such sequences for reference. These are clustered into ∼2.4M species hypotheses (SHs), each assigned a unique digital object identifier (DOI) to promote unambiguous referencing across studies. UNITE users have contributed over 600 000 third-party sequence annotations, which are shared with a range of databases and other community resources. Recent improvements facilitate the detection of cross-kingdom biological associations and the integration of undescribed groups of organisms into everyday biological pursuits. Serving as a digital twin for eukaryotic biodiversity and communities worldwide, the latest release of UNITE offers improved avenues for biodiversity discovery, precise taxonomic communication and integration of biological knowledge across platforms.
What problem does this paper attempt to address?