KGLRR: A low-rank representation K-means with graph regularization constraint method for Single-cell type identification

Lin-Ping Wang,Jin-Xing Liu,Jun-Liang Shang,Xiang-Zhen Kong,Bo-Xin Guan,Juan Wang
DOI: https://doi.org/10.1016/j.compbiolchem.2023.107862
IF: 3.737
2023-04-05
Computational Biology and Chemistry
Abstract:Single-cell RNA sequencing technology provides a tremendous opportunity for studying disease mechanisms at the single-cell level. Cell type identification is a key step in the research of disease mechanisms. Many clustering algorithms have been proposed to identify cell types. Most clustering algorithms perform similarity calculation before cell clustering. Because clustering and similarity calculation are independent, a low-rank matrix obtained only by similarity calculation may be unable to fully reveal the patterns in single-cell data. In this study, to capture accurate single-cell clustering information, we propose a novel method based on a low-rank representation model, called KGLRR, that combines the low-rank representation approach with K-means clustering. The cluster centroid is updated as the cell dimension decreases to better from new clusters and improve the quality of clustering information. In addition, the low-rank representation model ignores local geometric information, so the graph regularization constraint is introduced. KGLRR is tested on both simulated and real single-cell datasets to validate the effectiveness of the new method. The experimental results show that KGLRR is more robust and accurate in cell type identification than other advanced algorithms.
biology,computer science, interdisciplinary applications
What problem does this paper attempt to address?