Unified dual-label semi-supervised learning with top-<i>k</i> feature selection

Han Zhang,Maoguo Gong,Feiping Nie,Xuelong Li
DOI: https://doi.org/10.1016/j.neucom.2022.05.090
IF: 6
2022-01-01
Neurocomputing
Abstract:Semi-supervised feature selection alleviates the annotation burden of supervised feature learning by exploiting data under a handful of supervision information. The mainstream technique is to employ a lin-ear regression framework that jointly learns labeled and unlabeled samples. However, existing approaches always encounter the deficiencies in two aspects: 1) the performance of models are severely degenerated once predicted labels are unreliable; 2) the balance of objectives in regards to two types of data are not well considered. In the article, we propose unified dual-label semi-supervised learning for top -k feature selection. The technique defines a soft label matrix to indicate the probability of samples belonging to each class. From the probability, the model could recognize unclassifiable samples that lay around the boundaries. Meanwhile, the label matrix is equipped with an exponent parameter c. It endows the soft labels dual effects that the labeled and unlabeled data are tactfully discriminated. For the purpose of feature selection, we impose the l(2;0)-norm constraint on the projection matrix, such that the exact top -k features are picked out. An iteration algorithm is designed to solve the given problem, by which large-scale data are facilely tackled. We conduct experiments that validate the superiority of the proposed method against the state-of-the-art competitors. (C) 2022 Published by Elsevier B.V.
What problem does this paper attempt to address?