A General Non-Parametric Active Learning Framework for Classification on Multiple Manifolds.

Lei Huang,Yuqing Ma,Xianglong Liu
DOI: https://doi.org/10.1016/j.patrec.2019.01.013
IF: 4.757
2019-01-01
Pattern Recognition Letters
Abstract:Active learning is an important paradigm for investigating learners' behavior and reducing costs on labeling. We propose a novel non-parametric active learning framework which utilizes label propagation to sense the potential data clusters/manifolds in the feature space and minimizes global uncertainty to investigate the unexplored clusters/manifolds for querying examples. Based on this framework, it is convenient to design new active learning algorithms for targeted problems. Furthermore, we analyze the sample selection mechanism of our proposed method and provide a formal proof. While selecting informative examples, our method has the following characteristics: (1) in each iteration, examples are primarily chosen from the cluster which contains unlabeled samples; (2) if there is more than one cluster with unlabeled samples, it will choose from the one containing the most samples; (3) the example which has the closest connection with the others will be preferentially selected for the same cluster. The designed algorithms achieve empirical success in multi-class classification and dramatically reduce the label costs on several real world datasets. (C) 2019 Elsevier B.V. All rights reserved.
What problem does this paper attempt to address?