Abstract:Active learning reduces the annotation cost of machine learning by selecting and querying informative unlabeled samples. Semi-supervised active learning methods can considerably utilize the regional information of unlabeled samples, and thus, more effectively select valuable samples. Existing semi-supervised batch active learning algorithms frequently exhibit poor robustness due to their high computational complexity, making handling large-scale datasets a difficult task. However, existing active learning algorithms based on high-performance semi-supervised learners adopt a single-sample selection mode, under which the model requires multiple rounds of iterative processes, significantly reducing the overall efficiency of the algorithm and affecting its practicality. To address these issues, we propose a new semi-supervised batch active learning algorithm called approximate error reduction based on mutual information (MIAER). First, we use hierarchical anchor graph regularization (HAGR) as the semi-supervised learner. HAGR exhibits good robustness and only involves a small-scale reduced Laplacian matrix in its optimization process, enabling rapid processing of large-scale datasets. Second, we propose a batch sampling strategy based on mutual information and error reduction in the sample selection stage. This strategy, which is based on hierarchical anchor graphs, first measures the uncertainty of samples by using approximate error reduction, considerably reducing computational overhead. Then, it uses mutual information to measure the diversity of samples in category space while removing redundant batch samples, preserving samples with high uncertainty as much as possible. Comparative experiments with several advanced active learning methods on a large number of datasets fully demonstrate the effectiveness and stability of MIAER.

K-means Based on Active Learning for Support Vector Machine.

Semi-supervised SVM Batch Mode Active Learning for Image Retrieval

Semisupervised SVM Batch Mode Active Learning with Applications to Image Retrieval

Dual-Classifier Collaborative Method Based on Semi-Supervised Active Learning

Hierarchical exploration based active learning with support vector machine

Column Subset Selection for Active Learning in Image Classification

A novel batch-mode active learning method for SVM classifier

ASCENT: Active Supervision for Semi-Supervised Learning.

K-SVM: an Effective SVM Algorithm Based on K-means Clustering

Semi-supervised batch active learning based on mutual information

Active Semi-supervised K-Means Clustering Based on Silhouette Coefficient

Semi-Supervised Active Learning for Support Vector Machines: A Novel Approach that Exploits Structure Information in Data

Active Semi-Supervised Clustering Based on Multi-View Learning

Image retrieval method based on SVM and active learning

The Application of Active Query K-Means in Text Classification

Semi-supervised clustering method based on active learning strategy

Asymmetric semi-supervised boosting for SVM active learning in CBIR

Active learning with adaptive regularization

Multiple Kernel Active Learning for Image Classification

A General Non-Parametric Active Learning Framework for Classification on Multiple Manifolds.

Image Retrieval Method Based on SVMs and Active Learning