Abstract:Feature selection, as an important pre-processing technique, can efficiently mitigate the issue of “the curse of dimensionality” by selecting discriminative features especially for multi-label learning, a discriminative feature subset can improve the classification accuracy. The existing feature selection methods for multi-label classification address the problem of label ambiguity by with logical labels. However, the significance of each label is often different in many practical applications. Using logical label to train the model may result in unsatisfactory performance due to not considering the importance of related labels with each sample. To address this issue, a novel multi-label feature selection algorithm is proposed with two-step: label enhancement and label correlations-based feature selection with label enhancement. In the step of label enhancement, a framework of label enhancement based on deep forest is utilized to transform the logical label to label distribution, which contains rich semantic information and then guides a more correct exploration of semantic correlations. In the step of feature selection, a novel multi-label feature selection algorithm is proposed based on label distribution data. Firstly, the samples are divided into multiple different clusters by using spectral clustering in the label space. Then, the label correlations can be reflected by multiple different clusters. Finally, the l2,1-norm is used to construct an objective function to achieve multi-label feature selection. Experimental results demonstrate that competitiveness of the proposed algorithm over six state-of-the-art multi-label feature selection algorithms on eighteen benchmark datasets in terms of six widely accepted evaluation metrics.

Learning Label Specific Features for Multi-label Classification.

Enhancing Binary Relevance For Multi-Label Learning With Controlled Label Correlations Exploitation

Binary relevance for multi-label learning: an overview

Joint Ranking SVM and Binary Relevance with Robust Low-Rank Learning for Multi-Label Classification

BiLabel-Specific Features for Multi-Label Classification.

An efficient stacking model with label selection for multi-label classification

Multi-label Learning with Label-Specific Features by Resolving Label Correlations.

Joint Feature Selection and Classification for Multilabel Learning

A unified framework implementing linear binary relevance for multi-label learning.

A General Framework for Multi-Label Learning Towards Class Correlations and Class Imbalance.

Joint Label-Specific Features and Correlation Information for Multi-Label Learning.

Bi-directional Mapping for Multi-Label Learning of Label-Specific Features

Learning Label-Specific Features for Multi-Label Classification with Missing Labels

Learning common and label-specific features for multi-Label classification with correlation information

Label Correlation Guided Feature Selection for Multi-label Learning.

Learning Common and Label-Specific Features for Multi-Label Classification With Missing Labels

Improving Multi-Label Classification with Missing Labels by Learning Label-Specific Features

Label Correlations-Based Multi-Label Feature Selection with Label Enhancement

Improving Multi-Label Learning by Modeling Local Label and Feature Correlations

Learning Label-Specific Features Via Neural Network for Multi-Label Classification

An Efficient Stacking Model of Multi-Label Classification Based on Pareto Optimum