Multi-label Feature Selection Based on Label Correlations and Feature Redundancy

Yuling Fan,Baihua Chen,Weiqin Huang,Jinghua Liu,Wei Weng,Weiyao Lan
DOI: https://doi.org/10.1016/j.knosys.2022.108256
IF: 8.139
2022-01-01
Knowledge-Based Systems
Abstract:The task of multi-label feature selection (MLFS) is to reduce redundant information and generate the optimal feature subset from the original multi-label data. A variety of MLFS methods utilize pseudo-label matrix to explore label correlations for identifying the most informative features. Moreover, some methods consider feature redundancy by virtue of information theory technique, but no prior literature unites them in a framework to perform feature selection. To remedy the deficiency, we propose a novel MLFS method based on label correlations and feature redundancy, namely LFFS. To be specific, we first utilize the ridge regression to create a feature selection matrix and a low dimensional embedding, and impose ℓ2,1-norm on the feature selection matrix. Then, the low-dimensional embedding is devoted to mine label correlations, which can keep the global and local structure of original label space. Finally, cosine similarity is employed to analyze feature redundancy, so as to generate a low redundancy feature subset. By virtue of the above process, we design an objective function followed with an optimization solution. Comprehensive experiments results demonstrate the effectiveness and superiority of the proposed method LFFS among ten competition methods.
What problem does this paper attempt to address?