Fast Online $L_0$ Elastic Net Subspace Clustering via A Novel Dictionary Update Strategy

Wentao Qu,Lingchen Kong,Linglong Kong,Bei Jiang
2024-12-10
Abstract:With the rapid growth of data volume and the increasing demand for real-time analysis, online subspace clustering has emerged as an effective tool for processing dynamic data streams. However, existing online subspace clustering methods often struggle to capture the complex and evolving distribution of such data due to their reliance on rigid dictionary learning mechanisms. In this paper, we propose a novel $\ell_0$ elastic net subspace clustering model by integrating the $\ell_0$ norm and the Frobenius norm, which owns the desirable block diagonal property. To address the challenges posed by the evolving data distributions in online data, we design a fast online alternating direction method of multipliers with an innovative dictionary update strategy based on support points, which are a set of data points to capture the underlying distribution of the data. By selectively updating dictionary atoms according to the support points, the proposed method can dynamically adapt to the evolving data characteristics, thereby enhancing both adaptability and computational efficiency. Moreover, we rigorously prove the convergence of the algorithm. Finally, extensive numerical experiments demonstrate that the proposed method improves clustering performance and computational efficiency, making it well-suited for real-time and large-scale data processing tasks.
Optimization and Control
What problem does this paper attempt to address?
### What problems does this paper attempt to solve? This paper aims to solve several key challenges in online subspace clustering, especially when dealing with dynamic data streams. Specifically, existing online subspace clustering methods have difficulties in capturing complex and changing data distributions, mainly because they rely on rigid dictionary - learning mechanisms. To solve these problems, the authors propose a novel ℓ₀ - elastic net subspace clustering model (ℓ₀ - ENSC) and design an innovative dictionary - updating strategy by introducing support points. #### Main problems and challenges 1. **Limitations of existing methods**: - Existing online subspace clustering methods are usually difficult to capture the complex and constantly changing distributions in dynamic data streams. - These methods rely on rigid dictionary - learning mechanisms, which limit their adaptability and efficiency. 2. **Processing of dynamic data streams**: - Dynamic data streams are characterized by large amounts of data and strong real - time performance, requiring algorithms to respond quickly and adapt to new data. - Online subspace clustering needs to process incrementally arriving data with limited memory and be able to detect and handle data corruption to ensure robustness. 3. **Improvement of dictionary - updating strategies**: - The dictionary - updating rules of existing methods fail to fully reflect the true distribution of data, limiting their ability to represent data effectively. - A more flexible and efficient dictionary - updating strategy is needed to better capture the underlying structure of data. #### Proposed solutions 1. **ℓ₀ - elastic net subspace clustering model (ℓ₀ - ENSC)**: - By combining the ℓ₀ - norm and the Frobenius norm, a new model with block - diagonal properties is proposed. - The model not only performs variable selection but also can effectively group highly correlated samples, enhancing the interpretability of the model and reducing complexity. 2. **Dictionary - updating strategy based on support points**: - The concept of support points is introduced to capture the basic distribution characteristics of data. - By selectively updating dictionary atoms, the algorithm can dynamically adapt to changes in data characteristics, thereby improving adaptability and computational efficiency. 3. **Fast online alternating direction method of multipliers (ADMM)**: - A fast online ADMM algorithm is designed, which combines the dictionary - updating strategy with the ADMM framework. - The convergence of the algorithm is proved, and its superior performance on six public data sets is verified through experiments. #### Summary The main contribution of this paper lies in proposing a novel ℓ₀ - elastic net subspace clustering model and designing an innovative dictionary - updating strategy by introducing support points, which solves the limitations of existing online subspace clustering methods when dealing with dynamic data streams. These improvements significantly improve the adaptability and computational efficiency of the algorithm, making it more suitable for real - time and large - scale data processing tasks.