Label Shift Conditioned Hybrid Querying for Deep Active Learning

Jiaqi Li,Haojia Kong,Gezheng Xu,Changjian Shui,Ruizhi Pu,Zhao Kang,Charles X. Ling,Boyu Wang
DOI: https://doi.org/10.1016/j.knosys.2023.110616
IF: 8.139
2023-01-01
Knowledge-Based Systems
Abstract:Active learning aims to interactively select the most informative data for accelerating the learning procedure. In this paper, we propose a novel and principled deep active learning approach under label shift—the labeled dataset has a different class distribution with respect to the unlabeled data pool. By formulating the querying procedure as a distribution matching problem, we derive the generalization guarantee under label shift. Moreover, the theoretical results inspire a unified practical framework for deep batch active learning: in the training stage, we jointly correct the label shift and align the semantic conditional distributions; as for the querying stage, label-shift-conditioned hybrid querying (LSCHQ) strategy is proposed to balance uncertainty and diversity under label shift. Empirical results show that our proposed method is highly effective and outperforms other baseline algorithms over a variety of benchmarks and a real-world dataset.
What problem does this paper attempt to address?