Feature Subspace Learning-based Binary Differential Evolution Algorithm for Unsupervised Feature Selection

Tao Li,Yuhua Qian,Feijiang Li,Xinyan Liang,Zhi-hui Zhan
DOI: https://doi.org/10.1109/tbdata.2024.3378090
2024-01-01
IEEE Transactions on Big Data
Abstract:It is a challenging task to select the informative features that can maintain the manifold structure in the original feature space. Many unsupervised feature selection methods still suffer the poor cluster performance in the selected feature subset. To tackle this problem, a feature subspace learning-based binary differential evolution algorithm is proposed for unsupervised feature selection. Firstly, a new unsupervised feature selection framework based on evolutionary computation is designed, in which the feature subspace learning and the population search mechanism are combined into a unified unsupervised feature selection. Secondly, a local manifold structure learning strategy and a sample pseudo-label learning strategy are presented to calculate the importance of the selected feature subspace. Thirdly, the binary differential evolution algorithm is developed to optimize the selected feature subspace, in which the binary information migration mutation operator and the adaptive crossover operator are designed to promote the searching for the global optimal feature subspace. Experimental results on various types of realworld datasets demonstrate that the proposed algorithm can obtain more informative feature subset and competitive cluster performance compared with eight state-of-the-art unsupervised feature selection methods.
computer science, information systems, theory & methods
What problem does this paper attempt to address?