A Survey on Deep Active Learning: Recent Advances and New Frontiers

Dongyuan Li,Zhen Wang,Yankai Chen,Renhe Jiang,Weiping Ding,Manabu Okumura
2024-07-15
Abstract:Active learning seeks to achieve strong performance with fewer training samples. It does this by iteratively asking an oracle to label new selected samples in a human-in-the-loop manner. This technique has gained increasing popularity due to its broad applicability, yet its survey papers, especially for deep learning-based active learning (DAL), remain scarce. Therefore, we conduct an advanced and comprehensive survey on DAL. We first introduce reviewed paper collection and filtering. Second, we formally define the DAL task and summarize the most influential baselines and widely used datasets. Third, we systematically provide a taxonomy of DAL methods from five perspectives, including annotation types, query strategies, deep model architectures, learning paradigms, and training processes, and objectively analyze their strengths and weaknesses. Then, we comprehensively summarize main applications of DAL in Natural Language Processing (NLP), Computer Vision (CV), and Data Mining (DM), etc. Finally, we discuss challenges and perspectives after a detailed analysis of current studies. This work aims to serve as a useful and quick guide for researchers in overcoming difficulties in DAL. We hope that this survey will spur further progress in this burgeoning field.
Machine Learning
What problem does this paper attempt to address?
The paper aims to address key issues in the field of Deep Active Learning (DAL), particularly in achieving strong performance while reducing annotation costs. Specifically, the goals of the paper include: 1. **Comprehensive Review of DAL**: Given the scarcity of review articles on DAL, this paper provides a detailed review covering the latest advancements and technologies. 2. **Definition and Classification**: Formally defines the DAL task and systematically classifies DAL methods from five aspects, including annotation types, query strategies, deep model architectures, learning paradigms, and training processes. 3. **Summary of Application Areas**: Summarizes the applications of DAL in fields such as Natural Language Processing (NLP), Computer Vision (CV), and Data Mining (DM). 4. **Challenges and Prospects**: Analyzes the main challenges in current research and proposes future research directions. Through these efforts, the paper hopes to provide researchers with a useful guide to help them overcome difficulties in the DAL field and promote further development in this area.