HDSCC: A robust clustering approach for Single Cell RNA-seq data using Hyperdimensional Encoding

Maziyar Baranpouyan,Hossein Mohammadi
DOI: https://doi.org/10.1109/EMBC40787.2023.10341176
Abstract:Significant improvement in Single Cell technologies has given a hand to researchers to measure RNA expression of considerable number of Single Cells simultaneously, resulting in noticeable progress in our knowledge of cellular structure. Microfluidics-based sequencing protocols employing unique molecular identifiers (UMIs) lead to not only high-quality processing but also screening of thousands of cells. However, analysis of said data has caused challenges when it comes to processing time and computational resources as well as analyzing noisy and highly sparse data. Addressing these issues, we proposed a new method to cluster large RNA-seq datasets effectively. In our proposed approach, for having a noise robust clustering, we employed Hyper Dimensional Computing (HDC) approach to analyze Single Cell RNA sequencing data for the first time as best of our knowledge. We compared our results with state-of-the-art works on single-cell clustering and it shows promising performance and robustness in comparison to them. We performed our experiments with a 3.2-GHz CPU and 32 GB of RAM laptop.
What problem does this paper attempt to address?