WMBAL: weighted minimum bounds for active learning

Shuai Lu,Jiaxi Zheng,Zhi Li,Xuerui Dai
DOI: https://doi.org/10.1007/s10489-024-05328-x
IF: 5.3
2024-02-18
Applied Intelligence
Abstract:In the present study, aimed at reliably acquiring difficult samples for object detection models from massive raw data, we propose a novel difficult sample mining strategy based on active learning with Weighted Minimum Bounds (WMB). To accurately gauge the difficulty of samples for object detection models, we introduce the concept of weighted minimum bounds. Uniquely, the metric for measuring sample difficulty includes the classification discrepancy within detection frames and a weight factor derived from the Average Precision (AP) of the object detection model on the val set. Additionally, we introduce the Don't Care Area (DCA) to capture the uncertainty in localization tasks for object detection models. The DCA is utilized only during the data mining and training phases, ensuring that no additional time is incurred during inference. Furthermore, we propose a periodic and phased framework based on active learning for mining difficult samples, which can progressively identify challenging samples from unlabeled data and perform iterative optimization. To evaluate the effectiveness of our methods, we have collected the VANJEE-Image dataset and the VANJEE-PointCloud dataset from real-world scenarios. We empirically demonstrate the superiority of our approach, which outperforms traditional active learning methods on both image detection and point cloud detection datasets. The code and datasets are available at https://github.com/sharkls/WMBAL-Weighted-Minimum-Bounds-for-Active-Learning.
computer science, artificial intelligence
What problem does this paper attempt to address?