Single-cell RNA-seq data imputation using Feature Propagation

Sukwon Yun,Junseok Lee,Chanyoung Park
2023-07-21
Abstract:While single-cell RNA sequencing provides an understanding of the transcriptome of individual cells, its high sparsity, often termed dropout, hampers the capture of significant cell-cell relationships. Here, we propose scFP (single-cell Feature Propagation), which directly propagates features, i.e., gene expression, especially in raw feature space, via cell-cell graph. Specifically, it first obtains a warmed-up cell-gene matrix via Hard Feature Propagation which fully utilizes known gene transcripts. Then, we refine the k-Nearest Neighbor(kNN) of the cell-cell graph with a warmed-up cell-gene matrix, followed by Soft Feature Propagation which now allows known gene transcripts to be further denoised through their neighbors. Through extensive experiments on imputation with cell clustering tasks, we demonstrate our proposed model, scFP, outperforms various recent imputation and clustering methods. The source code of scFP can be found at <a class="link-external link-https" href="https://github.com/Junseok0207/scFP" rel="external noopener nofollow">this https URL</a>.
Computational Engineering, Finance, and Science
What problem does this paper attempt to address?