Deep Active Learning via Open Set Recognition

Jaya Krishna Mandivarapu,Blake Camp,Rolando Estrada
DOI: https://doi.org/10.48550/arXiv.2007.02196
2021-04-06
Abstract:In many applications, data is easy to acquire but expensive and time-consuming to label prominent examples include medical imaging and NLP. This disparity has only grown in recent years as our ability to collect data improves. Under these constraints, it makes sense to select only the most informative instances from the unlabeled pool and request an oracle (e.g., a human expert) to provide labels for those samples. The goal of active learning is to infer the informativeness of unlabeled samples so as to minimize the number of requests to the oracle. Here, we formulate active learning as an open-set recognition problem. In this paradigm, only some of the inputs belong to known classes; the classifier must identify the rest as unknown. More specifically, we leverage variational neural networks (VNNs), which produce high-confidence (i.e., low-entropy) predictions only for inputs that closely resemble the training data. We use the inverse of this confidence measure to select the samples that the oracle should label. Intuitively, unlabeled samples that the VNN is uncertain about are more informative for future training. We carried out an extensive evaluation of our novel, probabilistic formulation of active learning, achieving state-of-the-art results on MNIST, CIFAR-10, and CIFAR-100. Additionally, unlike current active learning methods, our algorithm can learn tasks without the need for task labels. As our experiments show, when the unlabeled pool consists of a mixture of samples from multiple datasets, our approach can automatically distinguish between samples from seen vs. unseen tasks.
Machine Learning,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to effectively select the most worthy - to - be - labeled samples in the unlabeled data pool through Active Learning (AL) to minimize the number of requests to the oracle when data labeling is expensive and time - consuming. Specifically, the paper proposes a new method to treat the active learning problem as an Open - Set Recognition (OSR) problem. In OSR, only some of the inputs belong to known classes, and the classifier must identify the remaining inputs as unknown classes. This method is especially suitable for when the unlabeled data pool contains mixed samples from multiple datasets and can automatically distinguish samples of seen tasks from those of unseen tasks. The main contributions of the paper are as follows: 1. **Proposing a new active learning framework**: Using Variational Neural Networks (VNNs) as classifiers, this network only produces high - confidence (low - entropy) predictions for inputs highly similar to the training data. By using the inverse of this confidence to select the next unlabeled sample to be queried, that is, selecting the sample that the classifier is most uncertain about for labeling. 2. **Improving model performance**: On multiple image classification datasets such as MNIST, CIFAR - 10 and CIFAR - 100, this method has achieved state - of - the - art results. 3. **Enhancing robustness**: Experiments show that this method exhibits good robustness in cases of skewed initial labeled pools, different budget sizes, noisy labels, and the presence of samples from unseen distributions. In conclusion, the paper aims to provide a more efficient and robust sample selection mechanism by combining active learning with open - set recognition, thereby reducing the need for a large amount of labeled data, especially in fields such as medical imaging and natural language processing where data acquisition is relatively easy but labeling costs are high.