Abstract:Multi-label active learning addresses the scarce labeled example problem by querying the most valuable unlabeled examples, or example-label pairs, to achieve a better performance with limited query cost. Current multi-label active learning methods require the scrutiny of the whole example in order to obtain its annotation. In contrast, one can find positive evidence with respect to a label by examining specific patterns (i.e., subexample), rather than the whole example, thus making the annotation process more efficient. Based on this observation, we propose a novel two-stage cost effective multi-label active learning framework, called CMAL. In the first stage, a novel example-label pair selection strategy is introduced. Our strategy leverages label correlation and label space sparsity of multi-label examples to select the most uncertain example-label pairs. Specifically, the unknown relevant label of an example can be inferred from the correlated labels that are already assigned to the example, thus reducing the uncertainty of the unknown label. In addition, the larger the number of relevant examples of a particular label, the smaller the uncertainty of the label is. In the second stage, CMAL queries the most plausible positive subexample-label pairs of the selected example-label pairs. Comprehensive experiments on multi-label datasets collected from different domains demonstrate the effectiveness of our proposed approach on cost effective queries. We also show that leveraging label correlation and label sparsity contribute to saving costs.

Gaussian Process Versus Margin Sampling Active Learning

Active Learning of Gaussian Processes with Manifold-Preserving Graph Reduction

Improved Margin Sampling for Active Learning.

Active Learning Methods with Deep Gaussian Processes.

Uncertainty-Based Active Learning Via Sparse Modeling for Image Classification

Margin-based sampling in high dimensions: When being active is less efficient than staying passive

Learning Distinctive Margin Toward Active Domain Adaptation

Promoting Active Learning with Mixtures of Gaussian Processes

Active Learning with Weak Supervision for Gaussian Processes

Active Probabilistic Sample Selection for Intelligent Soft Sensing of Industrial Processes

Nearest Neighbor Classifier with Margin Penalty for Active Learning

Active Learning Guided by Efficient Surrogate Learners

Online active classification via margin-based and feature-based label queries

Sparse Gaussian Processes with Manifold-Preserving Graph Reduction

Active learning on manifolds

Active Learning with Label Quality Control

Cost-Accuracy Aware Adaptive Labeling for Active Learning

Exemplar Guided Active Learning

Using methods from dimensionality reduction for active learning with low query budget

Cost Effective Multi-label Active Learning Via Querying Subexamples

Gaussian Switch Sampling: A Second Order Approach to Active Learning