scSID: A lightweight algorithm for identifying rare cell types by capturing differential expression from single-cell sequencing data

Shudong Wang,Hengxiao Li,Kuijie Zhang,Hao Wu,Shanchen Pang,Wenhao Wu,Lan Ye,Jionglong Su,Yulin Zhang
DOI: https://doi.org/10.1016/j.csbj.2023.12.043
IF: 6.155
2024-01-05
Computational and Structural Biotechnology Journal
Abstract:Single-cell RNA sequencing (scRNA-seq) is currently an important technology for identifying cell types and studying diseases at the genetic level. Identifying rare cell types is biologically important as one of the downstream data analyses of single-cell RNA sequencing. Although rare cell identification methods have been developed, most of these suffer from insufficient mining of intercellular similarities, low scalability, and being time-consuming. In this paper, we propose a single-cell similarity division algorithm (scSID) for identifying rare cells. It takes cell-to-cell similarity into consideration by analyzing both inter-cluster and intra-cluster similarities, and discovers rare cell types based on the similarity differences. We show that scSID outperforms other existing methods by benchmarking it on different experimental datasets. Application of scSID to multiple datasets, including 68K PBMC and intestine, highlights its exceptional scalability and remarkable ability to identify rare cell populations. Graphical abstract Download : Download high-res image (102KB) Download : Download full-size image
biochemistry & molecular biology
What problem does this paper attempt to address?