Number of sIngle-cell Clusters Estimation ( NICE ) : a robust and priori knowledge independent algorithm for single-cell RNA-seq data analysis

Xin Zou
2017-01-01
Abstract:Interpretation of single-cell transcriptomic data is usually achieved by dissecting cells into a few clusters. A successful interpretation performance is highly dependent on how well the number of cell clusters matches the intrinsic structure of the dataset. However, the existing estimation methods for number of clusters either require priori information which is not usually available for single-cell data, or are subject to interferences, such as artificial bias and technological noise. To tackle these issues, we propose a novel algorithm, Number of sIngle-cell Clusters Estimation (NICE), to estimate the number of cell clusters based on single-cell RNA-seq data. The new algorithm can effectively discriminate significant variations from subtle perturbations without requiring any priori information about the datasets, and therefore, it is highly robust. Furthermore, the output of algorithm guarantees each cell cluster is significantly distinctive from the others, and thereby, each cell cluster has specific biological meaning. Keywords—single-cell analysis; clustering; number of clusters estimation
What problem does this paper attempt to address?