Sparse subspace clustering based on L0 constraint

Hui Shuai,Xiaotong Yuan,Qingshan Liu
DOI: https://doi.org/10.13232/j.cnki.jnju.2018.01.003
2018-01-01
Abstract:With the rapid increasement of the amount and dimensionality of data,high-dimension data processing has become the key and difficult point of cluster analysis in the age of big data.Subspace clustering is an important method in the field of high-dimension data clustering on account of the fact that data in a class or category lie in a low-dimension subspace of the ambient space.Sparse Space Clustering(SSC)proposed by Elhamifar discovers the sparse representations of data distributed in a union of low-dimension subspaces.SSC solves the sparse self-expression coefficient of data matrix constrained by L 1 norm via Alternating Direction Method of Multipliers (ADMM)and establishes the Laplacian matrix of the data.Then,the data are classified into specific categories via special clustering algorithm.However,ADMM has too many parameters to optimalize and slow convergence speed. These disadvantages make SSC far from dealing with large scale datasets efficiently.In consideration of these problems,we propose a sparse subspace clustering algorithm based on L 0 constraint in this paper.The proposed method solves the sparse self-expression reconstruction problem constrained by L 0 norm through Orthogonal Matching Pursuit(OMP).OMP finds the sparse represent of each data point as a linear combination of other data points in a direct and efficient way.The sparse self-expression coefficient acquired by OMP is transformed into similarity matrix.Ultimately similarity matrix is applied by spectral clustering to obtain the clustering result.In order to further decrease the computation complexity of OMP,we also optimize OMP according to the relativity in continuous iterations and improve the efficiency of our algorithm.Experiments on synthetic data and Extended Yale B database demonstrate that the proposed L 0 constrained sparse subspace clustering is significantly more efficient while the accuracy is comparable to SSC.
What problem does this paper attempt to address?