SuperCT: a supervised-learning framework for enhanced characterization of single-cell transcriptomic profiles.

Peng Xie,Mingxuan Gao,Chunming Wang,Jianfei Zhang,Pawan Noel,Chaoyong Yang,Daniel Von Hoff,Haiyong Han,Michael Q Zhang,Wei Lin
DOI: https://doi.org/10.1093/nar/gkz116
IF: 14.9
2019-01-01
Nucleic Acids Research
Abstract:Characterization of individual cell types is fundamental to the study of multicellular samples. Single-cell RNAseq techniques, which allow high-throughput expression profiling of individual cells, have significantly advanced our ability of this task. Currently, most of the scRNA-seq data analyses are commenced with unsupervised clustering. Clusters are often assigned to different cell types based on the enriched canonical markers. However, this process is inefficient and arbitrary. In this study, we present a technical framework of training the expandable supervised-classifier in order to reveal the single-cell identities as soon as the single-cell expression profile is input. Using multiple scRNA-seq datasets we demonstrate the superior accuracy, robustness, compatibility and expandability of this new solution compared to the traditional methods. We use two examples of the model upgrade to demonstrate how the projected evolution of the cell-type classifier is realized.
What problem does this paper attempt to address?