Collection and curation of prokaryotic genome assemblies from type strains at NCBI

Sivakumar Kannan,Shobha Sharma,Stacy Ciufo,Karen Clark,Seán Turner,Paul A. Kitts,Conrad L. Schoch,Michael DiCuccio,Avi Kimchi
DOI: https://doi.org/10.1099/ijsem.0.005707
IF: 2.689
2023-01-21
INTERNATIONAL JOURNAL OF SYSTEMATIC AND EVOLUTIONARY MICROBIOLOGY
Abstract:The public sequence databases are entrusted with the dual responsibility of providing an accessible archive to all submitters and supporting data reliability and its re-use to all users. Genomes from type materials can act as an unambiguous reference for a taxonomic name and play an important role in comparative genomics, especially for taxon verification or reclassification. The National Center for Biotechnology Information (NCBI) collects and curates information on prokaryotic type strains and genomes from type strains. The average nucleotide identity (ANI)-based quality control processes introduced at NCBI to verify the genomes from type strains and improve related sequence records are detailed here. Using the curated genomes from type strains as reference, the taxonomy of over 1.1 million GenBank genomes were verified and the taxonomy of over 7000 new submissions before acceptance to GenBank and over 1800 existing genomes in GenBank were reclassified.
microbiology
What problem does this paper attempt to address?