ReliefF-based Multi-label Feature Selection

Yaping Cai,Ming Yang,Yang Gao,Hujun Yin
DOI: https://doi.org/10.14257/ijdta.2015.8.4.31
2015-01-01
International Journal of Database Theory and Application
Abstract:In recent years, multi-label learning has been used to deal with data attributed to multiple labels simultaneously and has been increasingly applied to various applications.As many other machine learning tasks, multi-label learning also suffers from the curse of dimensionality; so extracting good features using multiple labels of the datasets becomes an important step prior to classification.In this paper, we study the problem of multilabel feature selection for classification and have proposed a method based on single label feature selection ReliefF, termed ML-ReliefF, to select discriminant features in order to boost multi-label classification accuracy.Compared to other multi-label feature selection methods that only consider the relationship between pairwise classes, the proposed method introduces the concept of label set to further consider the relationship among more than two labels, modifies the regulation of the nearest neighbors computation reflecting the influence between samples and multiple labels, and considers and adds the similarity between samples to reinforce the effect.With the classifier, ML-kNN, experiments on five different datasets show that the proposed method is effective in removing irrelevant or redundant features and the selected features are more discriminant for classification.
What problem does this paper attempt to address?