Abstract:The Genome Database (CGD; www.candidagenome.org) is unique in being both a model organism database and a fungal pathogen database. As a fungal pathogen database, CGD hosts locus pages for five species of the best-studied pathogenic fungi in the group. As a model organism database, the species serves as a model both for other spp. and for non- fungi that form biofilms and undergo routine morphogenic switching from the planktonic form to the filamentous form, which is not done by other model yeasts. As pathogenic species have become increasingly drug resistant, the high lethality of invasive candidiasis in immunocompromised people is increasingly alarming. There is a pressing need for additional research into basic biology, epidemiology and phylogeny, and potential new antifungals. CGD serves the needs of this diverse research community by curating the entire gene-based experimental literature as it is published, extracting, organizing and standardizing gene annotations. Most recently, we have begun linking clinical data on disease to relevant Literature Topics to improve searchability for clinical researchers. Because CGD curates for multiple species and most research focuses on aspects related to pathogenicity, we focus our curation efforts on assigning Literature Topic tags, collecting detailed mutant phenotype data, and assigning controlled Gene Ontology terms with accompanying evidence codes. Our Summary pages for each feature include the primary name and all aliases for that locus, a description of the gene and/or gene product, detailed ortholog information with links, a JBrowse window with a visual view of the gene on its chromosome, summarized phenotype, Gene Ontology, and sequence information, references cited on the summary page itself, and any locus notes. The database serves as a community hub, where we link to various types of reference material of relevance to researchers, including colleague information, news, and notice of upcoming meetings. We routinely survey the community to learn how the field is evolving and how needs may have changed. A key future challenge is management of the flood of high-throughput expression data to make it as useful as possible to as many researchers as possible. The central challenge for any community database is to turn data into knowledge, which the community can access, use, and build upon.

The Candida Genome Database (CGD): incorporation of Assembly 22, systematic identifiers and visualization of high throughput sequencing data

The Candida Genome Database: Annotation and Visualization Updates

Saccharomyces Genome Database: Advances in Genome Annotation, Expanded Biochemical Pathways, and Other Key Enhancements

Genome Database: Advances in Genome Annotation, Expanded Biochemical Pathways, and Other Key Enhancements

Genome Snapshot: a new resource at the Saccharomyces Genome Database (SGD) presenting an overview of the Saccharomyces cerevisiae genome

HSCGD: a comprehensive database of single-cell whole-genome data and metadata

Use of a Candida albicans SC5314 PacBio HiFi reads dataset to close gaps in the reference genome assembly, reveal a subtelomeric gene family, and produce accurate phased allelic sequences

Annotation of 2,507 Saccharomyces cerevisiae genomes

The Celera Discovery System

The NCBI Comparative Genome Viewer (CGV) is an interactive visualization tool for the analysis of whole-genome eukaryotic alignments

Accessing NCBI data using the NCBI Sequence Viewer and Genome Data Viewer (GDV)

ParameciumDB in 2011: new tools and new data for functional and comparative genomics of the model ciliate Paramecium tetraurelia

CGC1, a new reference genome for

The UCSC Genome Browser database: 2025 update

Collection and curation of prokaryotic genome assemblies from type strains at NCBI

Comprehensive genome annotation of the model ciliate by in-depth epigenetic and transcriptomic profiling

The Saccharomyces Genome Database: Exploring Biochemical Pathways and Mutant Phenotypes

CFGP: a web-based, comparative fungal genomics platform

GRACy: A tool for analysing human cytomegalovirus sequence data

The UCSC Genome Browser database: 2015 update

The UCSC Genome Browser database: 2024 update