A Robust Single Cell Clustering Method Based on Subspace Learning and Partial Imputation

Ruiqing Zheng,Zhenlan Liang,Xiangmao Meng,Yu Tian,Min Li
DOI: https://doi.org/10.1109/bibm49941.2020.9313478
2020-01-01
Abstract:Cell heterogeneity analysis is an important and urgent task in single cell data research. Numerous cell type identification methods have been proposed to address the issue. Due to the high rate of dropout and complex biological background, it is still a challenging task to obtain the accurate clusters of cells. In this study, we propose a robust single cell clustering method based on subspace learning and partial imputation, called RCSLI. RCSLI incorporates a modified variable genes selection method and utilizes the self-expression of scRNA-seq data to learn sparse cell-to-cell similarity and impute part of missing expression values. To evaluate the clustering performance of RCSLI, we compare it with nine state-of-the-art single cell clustering methods on eight scRNA-seq datasets. The experimental results show that RCSLI gets more accurate and robust clustering results. The imputation impact on the specific gene markers is evaluated on PBMC data. The classification results by taking these marker genes as predictors show RCSLI recovers the real dropouts, meanwhile, introduces less noise.
What problem does this paper attempt to address?