SoCube: an Innovative End-to-end Doublet Detection Algorithm for Analyzing Scrna-Seq Data.

Hongning Zhang,Mingkun Lu,Gaole Lin,Lingyan Zheng,Wei Zhang,Zhijian Xu,Feng Zhu
DOI: https://doi.org/10.1093/bib/bbad104
IF: 9.5
2023-01-01
Briefings in Bioinformatics
Abstract:Doublets formed during single-cell RNA sequencing (scRNA-seq) severely affect downstream studies, such as differentially expressed gene analysis and cell trajectory inference, and limit the cellular throughput of scRNA-seq. Several doublet detection algorithms are currently available, but their generalization performance could be further improved due to the lack of effective feature-embedding strategies with suitable model architectures. Therefore, SoCube, a novel deep learning algorithm, was developed to precisely detect doublets in various types of scRNA-seq data. SoCube (i) proposed a novel 3D composite feature-embedding strategy that embedded latent gene information and (ii) constructed a multikernel, multichannel CNN-ensembled architecture in conjunction with the feature-embedding strategy. With its excellent performance on benchmark evaluation and several downstream tasks, it is expected to be a powerful algorithm to detect and remove doublets in scRNA-seq data. SoCube is freely provided as an end-to-end tool on the Python official package site PyPi (https://pypi.org/project/socube/) and open-source on GitHub (https://github.com/idrblab/socube/).
What problem does this paper attempt to address?