Cellular Similarity based Imputation for Single cell RNA Sequencing Data

Chenliang Liu,Yuan Zhu,Houwang Zhang
DOI: https://doi.org/10.1145/3473258.3473269
2021-05-21
Abstract:Single cell RNA sequencing (scRNA-seq) technology can study gene expression in single cell resolution and solve cell heterogeneity that cannot be solved by the traditional RNA sequencing (Bulk RNA-seq) technology. It provides a powerful analytical tool for in-depth study of the immune, regulatory, replication, and other activities of cells and explanations of life activities rules. However, single cell sequencing experiments are hindered by several technical issues, which cause output scRNA-seq with a high percentage of zeros (dropout events), impacting the reliability of downstream analyses. Therefore, a great number of computational methods have been proposed to address the increased sparsity observed in scRNA-seq data. In this paper, motivated by the image denoising method, we propose an imputation algorithm CSI based on the similarity between cells. The cell similarity is considered as the weight to average the gene expression values to impute the missing values in the data. CSI achieves better accuracy than other the 7 publicly notable scRNA-seq imputation methods on some real scRNA-seq data, especially gains better performance in some downstream analysis such as cluster analysis and pseudo trajectory inference.
What problem does this paper attempt to address?