Transductive Active Learning: Theory and Applications

Jonas Hübotter,Bhavya Sukhija,Lenart Treven,Yarden As,Andreas Krause
2024-10-29
Abstract:We study a generalization of classical active learning to real-world settings with concrete prediction targets where sampling is restricted to an accessible region of the domain, while prediction targets may lie outside this region. We analyze a family of decision rules that sample adaptively to minimize uncertainty about prediction targets. We are the first to show, under general regularity assumptions, that such decision rules converge uniformly to the smallest possible uncertainty obtainable from the accessible data. We demonstrate their strong sample efficiency in two key applications: active fine-tuning of large neural networks and safe Bayesian optimization, where they achieve state-of-the-art performance.
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to effectively learn the target function \(f\) in the target space \(A\) within the limited data sampling space \(S\) in practical applications. Specifically, the paper focuses on how to minimize the uncertainty of the target space \(A\) through the Active Learning method when the target space \(A\) may be partially or entirely outside the data sampling space \(S\). This is different from traditional global active learning, where the latter usually assumes that the target space and the sampling space are the same, that is, \(A = S\). ### Main Contributions 1. **Theoretical Analysis**: - For the first time, the paper gives the rate at which the uncertainty on the target space \(A\) uniformly converges to the minimum possible value given the samples in the sample space \(S\) (Theorems 3.2 and 3.3). These results provide a theoretical basis for minimizing posterior uncertainty in transductive active learning and indicate that transductive active learning may be superior to inductive active learning in sample efficiency. 2. **Application Demonstration**: - The paper demonstrates the superior performance of transductive active learning in two key applications: batch - wise active fine - tuning of large - scale neural networks and safe Bayesian optimization. In these two applications, transductive active learning has achieved state - of - the - art performance. ### Specific Problem Description - **Target Space \(A\) and Sample Space \(S\)**: Given a domain \(X\), where \(A\subseteq X\) is the target space and \(S\subseteq X\) is the sample space. The goal is to select sample points in \(S\) to reduce the uncertainty of \(A\) as much as possible. - **Adaptive Decision Rules**: The paper studies a class of adaptive decision rules that minimize the uncertainty of the target space \(A\) by selecting sample points. The main decision rules include Information - based Transductive Learning (ITL) and Variance - based Transductive Learning (VTL). - **ITL**: Select the sample point \(x_n\) to maximize the information gain \(I(f_A; y_x|D_{n - 1})\) about the target space \(A\). - **VTL**: Select the sample point \(x_n\) to minimize the total variance \(\text{Tr}[\text{Var}[f_A|D_{n - 1}, y_x]]\) about the target space \(A\). ### Experimental Verification - **Gaussian Process Setup**: Under the Gaussian Process (GP) assumption, the paper demonstrates the effectiveness of ITL and VTL in reducing the uncertainty of the target space \(A\) and improving prediction accuracy. - **Neural Network Fine - tuning**: The paper conducts experiments on the MNIST and CIFAR - 100 datasets and demonstrates the ability of ITL and VTL to effectively fine - tune neural networks with a small number of samples, especially when the target distribution \(P_A\) is different from the training distribution. - **Safe Bayesian Optimization**: The paper also applies transductive active learning to safe Bayesian optimization and demonstrates its advantages in exploring the safe parameter space and optimizing the target function. ### Conclusion Through theoretical analysis and experiments, this paper proves the effectiveness and superiority of transductive active learning in dealing with practical problems, especially when the target space and the sample space do not completely coincide. These results provide new directions and methods for future active learning research.