CDD/SPARCLE: Functional Classification of Proteins Via Subfamily Domain Architectures

Aron Marchler-Bauer,Yu Bo,Lianyi Han,Jane He,Christopher J. Lanczycki,Shennan Lu,Farideh Chitsaz,Myra K. Derbyshire,Renata C. Geer,Noreen R. Gonzales,Marc Gwadz,David I. Hurwitz,Fu Lu,Gabriele H. Marchler,James S. Song,Narmada Thanki,Zhouxi Wang,Roxanne A. Yamashita,Dachuan Zhang,Chanjuan Zheng,Lewis Y. Geer,Stephen H. Bryant
DOI: https://doi.org/10.1093/nar/gkw1129
IF: 14.9
2016-01-01
Nucleic Acids Research
Abstract:NCBI's Conserved Domain Database (CDD) aims at annotating biomolecular sequences with the location of evolutionarily conserved protein domain footprints, and functional sites inferred from such footprints. An archive of pre-computed domain annotation is maintained for proteins tracked by NCBI's Entrez database, and live search services are offered as well. CDD curation staff supplements a comprehensive collection of protein domain and protein family models, which have been imported from external providers, with representations of selected domain families that are curated in-house and organized into hierarchical classifications of functionally distinct families and sub-families. CDD also supports comparative analyses of protein families via conserved domain architectures, and a recent curation effort focuses on providing functional characterizations of distinct subfamily architectures using SPARCLE: Subfamily Protein Architecture Labeling Engine. CDD can be accessed at https://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml.
What problem does this paper attempt to address?