Segmentary Group-Sparsity Self-Representation Learning and Spectral Clustering Via Double L21 Norm

Deyu Zeng,Chris Ding,Zongze Wu,Xiaopin Zhong,Weixiang Liu
DOI: https://doi.org/10.1016/j.knosys.2024.111392
IF: 8.139
2024-01-01
Knowledge-Based Systems
Abstract:With the rapid expansion of data dimensions, subspace representation learning, a method for mapping high-dimensional data samples to their corresponding underlying low-dimensional subspaces, has become an essential process for high-dimensional data clustering. Although the existing methods have achieved reliable data representation learning and precise clustering, few of them realized that the corrupted data points in the dataset will influence the linear representation of the others. When there are multiple heavily corrupted data in a dataset, the matrix of the self-representation coefficient would be influenced by these data. Therefore, this paper proposes the segmentary group-sparsity self-representation learning (SGSSL) and segmentary group-sparsity-based spectral clustering (SGSSC) models to eliminate their influence on representation learning and clustering results. We proposed that imposing varying degrees of row sparsity and column sparsity constraints on the representation coefficient matrix can prevent corrupted data from contaminating other data during the self-representation process, thus obtaining better spectral clustering results. Extensive experiments on several real datasets demonstrate that our proposed method can perform better than several related methods in recent years.
What problem does this paper attempt to address?