Downsampling for Sparse Subspace Clustering.

Xianghui Mao,Xiaohan Wang,Yuantao Gu
DOI: https://doi.org/10.1109/icassp.2015.7178683
2015-01-01
Abstract:Sparse subspace clustering (SSC) is a technique to partition unlabeled samples according to the subspaces they locate in. With the rapid increase of data amount, efficiently downsampling a big dataset, while at the same time keeping the structure of subspaces, becomes an important topic for SSC. In order to reduce the computational cost while preserving clustering accuracy, a new approach of SSC with downsampling (SSCD) is proposed in this paper. In SSCD, the numbers of samples located in respective subspaces are estimated utilizing the ℓ 1 norm of the sparse representation. Then a downsampling strategy is designed to decimate samples with the probabilities that are in reverse ratio to the amounts of samples in respective subspaces. As a consequence, the samples in different subspaces are expected to be balanced after the downsampling operation. Theoretical analysis proves the correctness of the proposed strategy. Numerical simulations also verify the efficiency of SSCD.
What problem does this paper attempt to address?