Credible Teacher for Semi-Supervised Object Detection in Open Scene

Jingyu Zhuang,Kuo Wang,Liang Lin,Guanbin Li
2024-01-03
Abstract:Semi-Supervised Object Detection (SSOD) has achieved resounding success by leveraging unlabeled data to improve detection performance. However, in Open Scene Semi-Supervised Object Detection (O-SSOD), unlabeled data may contains unknown objects not observed in the labeled data, which will increase uncertainty in the model's predictions for known objects. It is detrimental to the current methods that mainly rely on self-training, as more uncertainty leads to the lower localization and classification precision of pseudo labels. To this end, we propose Credible Teacher, an end-to-end framework. Credible Teacher adopts an interactive teaching mechanism using flexible labels to prevent uncertain pseudo labels from misleading the model and gradually reduces its uncertainty through the guidance of other credible pseudo labels. Empirical results have demonstrated our method effectively restrains the adverse effect caused by O-SSOD and significantly outperforms existing counterparts.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper primarily addresses the issue of Open Scene Semi-Supervised Object Detection (O-SSOD) by proposing a new solution. In O-SSOD, unlabeled data may contain unknown object categories that are not observed in the labeled data, which increases the uncertainty of the model's predictions for known objects. Existing pseudo-label-based methods mainly rely on a self-training mechanism, and greater uncertainty can lead to reduced accuracy in the positioning and classification of pseudo-labels, thereby affecting model performance. To address the above issues, the authors propose an end-to-end framework called Credible Teacher. This framework is based on a teacher-student structure, where the "teacher" model guides the learning process of the "student" model through a flexible labeling mechanism. Specifically, Credible Teacher adopts an interactive teaching mechanism, using flexible labels to prevent uncertain pseudo-labels from misleading the model and gradually reducing the model's uncertainty. Additionally, to mitigate the impact of different dataset distribution differences, the method also employs Data-specific Batch Normalization (DBN) technology. Experimental results show that on the MS-COCO and Objects365 datasets, Credible Teacher achieves significantly better performance under the O-SSOD setting compared to existing methods. It performs particularly well in handling unknown categories, effectively reducing the impact of pseudo-label noise, and mining more useful information from unlabeled data to improve model performance.