Enhancing Unsupervised Feature Selection via Double Sparsity Constrained Optimization

Xianchao Xiu,Anning Yang,Chenyi Huang,Xinrong Li,Wanquan Liu
2025-01-01
Abstract:Unsupervised feature selection (UFS) is widely applied in machine learning and pattern recognition. However, most of the existing methods only consider a single sparsity, which makes it difficult to select valuable and discriminative feature subsets from the original high-dimensional feature set. In this paper, we propose a new UFS method called DSCOFS via embedding double sparsity constrained optimization into the classical principal component analysis (PCA) framework. Double sparsity refers to using $\ell_{2,0}$-norm and $\ell_0$-norm to simultaneously constrain variables, by adding the sparsity of different types, to achieve the purpose of improving the accuracy of identifying differential features. The core is that $\ell_{2,0}$-norm can remove irrelevant and redundant features, while $\ell_0$-norm can filter out irregular noisy features, thereby complementing $\ell_{2,0}$-norm to improve discrimination. An effective proximal alternating minimization method is proposed to solve the resulting nonconvex nonsmooth model. Theoretically, we rigorously prove that the sequence generated by our method globally converges to a stationary point. Numerical experiments on three synthetic datasets and eight real-world datasets demonstrate the effectiveness, stability, and convergence of the proposed method. In particular, the average clustering accuracy (ACC) and normalized mutual information (NMI) are improved by at least 3.34% and 3.02%, respectively, compared with the state-of-the-art methods. More importantly, two common statistical tests and a new feature similarity metric verify the advantages of double sparsity. All results suggest that our proposed DSCOFS provides a new perspective for feature selection.
Optimization and Control,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: Most of the existing Unsupervised Feature Selection (UFS) methods only consider single sparsity, which makes it difficult to select valuable and discriminative feature subsets from the original high - dimensional feature set. Specifically: 1. **Single Sparsity Limitation**: Most existing methods use only one type of sparsity constraint (such as \(\ell_2,0\)-norm or \(\ell_0\)-norm), which makes it difficult for them to effectively remove irrelevant, redundant and noisy features when dealing with complex data. 2. **Poor Feature Selection Effect**: Due to the lack of multi - type sparsity constraints, the accuracy of existing methods in identifying differential features is insufficient, thus affecting the effect of feature selection. To solve these problems, the author proposes a new UFS method - DSCOFS (Double Sparsity Constrained Optimization for Feature Selection). This method enhances the effect of feature selection by embedding double - sparsity - constrained optimization in the classical Principal Component Analysis (PCA) framework. Double sparsity means using \(\ell_2,0\)-norm and \(\ell_0\)-norm to constrain variables simultaneously to improve the accuracy and discrimination of feature selection. Specifically: - **\(\ell_2,0\)-norm**: It is used to remove irrelevant and redundant features and ensure global structural sparsity. - **\(\ell_0\)-norm**: It is used to filter irregular noisy features and ensure local element sparsity. By combining these two sparsity constraints, DSCOFS can select valuable features more effectively and shows better clustering accuracy and Normalized Mutual Information (NMI) on multiple experimental data sets. The experimental results show that, compared with existing methods, DSCOFS improves the average Clustering Accuracy (ACC) and NMI by at least 3.34% and 3.02% respectively. In summary, this paper aims to improve the performance of unsupervised feature selection by introducing double - sparsity - constrained optimization, so as to better meet the feature selection challenges in high - dimensional data.