Abstract:We study a generalization of classical active learning to real-world settings with concrete prediction targets where sampling is restricted to an accessible region of the domain, while prediction targets may lie outside this region. We analyze a family of decision rules that sample adaptively to minimize uncertainty about prediction targets. We are the first to show, under general regularity assumptions, that such decision rules converge uniformly to the smallest possible uncertainty obtainable from the accessible data. We demonstrate their strong sample efficiency in two key applications: active fine-tuning of large neural networks and safe Bayesian optimization, where they achieve state-of-the-art performance.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is how to effectively learn the target function \(f\) in the target space \(A\) within the limited data sampling space \(S\) in practical applications. Specifically, the paper focuses on how to minimize the uncertainty of the target space \(A\) through the Active Learning method when the target space \(A\) may be partially or entirely outside the data sampling space \(S\). This is different from traditional global active learning, where the latter usually assumes that the target space and the sampling space are the same, that is, \(A = S\). ### Main Contributions 1. **Theoretical Analysis**: - For the first time, the paper gives the rate at which the uncertainty on the target space \(A\) uniformly converges to the minimum possible value given the samples in the sample space \(S\) (Theorems 3.2 and 3.3). These results provide a theoretical basis for minimizing posterior uncertainty in transductive active learning and indicate that transductive active learning may be superior to inductive active learning in sample efficiency. 2. **Application Demonstration**: - The paper demonstrates the superior performance of transductive active learning in two key applications: batch - wise active fine - tuning of large - scale neural networks and safe Bayesian optimization. In these two applications, transductive active learning has achieved state - of - the - art performance. ### Specific Problem Description - **Target Space \(A\) and Sample Space \(S\)**: Given a domain \(X\), where \(A\subseteq X\) is the target space and \(S\subseteq X\) is the sample space. The goal is to select sample points in \(S\) to reduce the uncertainty of \(A\) as much as possible. - **Adaptive Decision Rules**: The paper studies a class of adaptive decision rules that minimize the uncertainty of the target space \(A\) by selecting sample points. The main decision rules include Information - based Transductive Learning (ITL) and Variance - based Transductive Learning (VTL). - **ITL**: Select the sample point \(x_n\) to maximize the information gain \(I(f_A; y_x|D_{n - 1})\) about the target space \(A\). - **VTL**: Select the sample point \(x_n\) to minimize the total variance \(\text{Tr}[\text{Var}[f_A|D_{n - 1}, y_x]]\) about the target space \(A\). ### Experimental Verification - **Gaussian Process Setup**: Under the Gaussian Process (GP) assumption, the paper demonstrates the effectiveness of ITL and VTL in reducing the uncertainty of the target space \(A\) and improving prediction accuracy. - **Neural Network Fine - tuning**: The paper conducts experiments on the MNIST and CIFAR - 100 datasets and demonstrates the ability of ITL and VTL to effectively fine - tune neural networks with a small number of samples, especially when the target distribution \(P_A\) is different from the training distribution. - **Safe Bayesian Optimization**: The paper also applies transductive active learning to safe Bayesian optimization and demonstrates its advantages in exploring the safe parameter space and optimizing the target function. ### Conclusion Through theoretical analysis and experiments, this paper proves the effectiveness and superiority of transductive active learning in dealing with practical problems, especially when the target space and the sample space do not completely coincide. These results provide new directions and methods for future active learning research.

Transductive Active Learning: Theory and Applications

Active Few-Shot Fine-Tuning

Learning Distinctive Margin Toward Active Domain Adaptation

Learning to Extrapolate: A Transductive Approach

Deep Active Learning with Adaptive Acquisition

Deep Active Learning by Leveraging Training Dynamics

Deconstructing deep active inference

Targeted Active Learning for Bayesian Decision-Making

Heuristic improvement for active learning using localized generalization error as selection criterion

Learning with Minimum Supervision: A General Framework for Transductive Transfer Learning

Learning to View: Decision Transformers for Active Object Detection

Deconstructing Deep Active Inference: A Contrarian Information Gatherer

Learning Generative State Space Models for Active Inference

Advancing Deep Active Learning & Data Subset Selection: Unifying Principles with Information-Theory Intuitions

Active Expansion Sampling for Learning Feasible Domains in an Unbounded Input Space

Diffusion-Based Probabilistic Uncertainty Estimation for Active Domain Adaptation.

Active Learning amidst Logical Knowledge

Active Learning based on Data Uncertainty and Model Sensitivity

Active Learning Over Multiple Domains in Natural Language Tasks

Active Representation Learning for General Task Space with Applications in Robotics

Information-Theoretic Generalization Bounds for Transductive Learning and its Applications