Counterfactual Active Learning for Out-of-Distribution Generalization.

Xun Deng,Wenjie Wang,Fuli Feng,Hanwang Zhang,Xiangnan He,Yong Liao
DOI: https://doi.org/10.18653/v1/2023.acl-long.636
2023-01-01
Abstract:We study the out-of-distribution generalization of active learning that adaptively selects samples for annotation in learning the decision boundary of classification.Our empirical study finds that increasingly annotating seen samples may hardly benefit the generalization.To address the problem, we propose Counterfactual Active Learning (CounterAL) that empowers active learning with counterfactual thinking to bridge the seen samples with unseen cases.In addition to annotating factual samples, Coun-terAL requires annotators to answer counterfactual questions to construct counterfactual samples for training.To achieve CounterAL, we design a new acquisition strategy that selects the informative factual-counterfactual pairs for annotation; and a new training strategy that pushes the model update to focus on the discrepancy between factual and counterfactual samples.We evaluate CounterAL on multiple public datasets of sentiment analysis and natural language inference.The experiment results show that CounterAL requires fewer acquisition rounds and outperforms existing active learning methods by a large margin in OOD tests with comparable IID performance.
What problem does this paper attempt to address?