PseCo: Pseudo Labeling and Consistency Training for Semi-Supervised Object Detection

Gang Li,Xiang Li,Yujie Wang,Yichao Wu,Ding Liang,Shanshan Zhang
2022-07-20
Abstract:In this paper, we delve into two key techniques in Semi-Supervised Object Detection (SSOD), namely pseudo labeling and consistency training. We observe that these two techniques currently neglect some important properties of object detection, hindering efficient learning on unlabeled data. Specifically, for pseudo labeling, existing works only focus on the classification score yet fail to guarantee the localization precision of pseudo boxes; For consistency training, the widely adopted random-resize training only considers the label-level consistency but misses the feature-level one, which also plays an important role in ensuring the scale invariance. To address the problems incurred by noisy pseudo boxes, we design Noisy Pseudo box Learning (NPL) that includes Prediction-guided Label Assignment (PLA) and Positive-proposal Consistency Voting (PCV). PLA relies on model predictions to assign labels and makes it robust to even coarse pseudo boxes; while PCV leverages the regression consistency of positive proposals to reflect the localization quality of pseudo boxes. Furthermore, in consistency training, we propose Multi-view Scale-invariant Learning (MSL) that includes mechanisms of both label- and feature-level consistency, where feature consistency is achieved by aligning shifted feature pyramids between two images with identical content but varied scales. On COCO benchmark, our method, termed PSEudo labeling and COnsistency training (PseCo), outperforms the SOTA (Soft Teacher) by 2.0, 1.8, 2.0 points under 1%, 5%, and 10% labelling ratios, respectively. It also significantly improves the learning efficiency for SSOD, e.g., PseCo halves the training time of the SOTA approach but achieves even better performance. Code is available at
Artificial Intelligence
What problem does this paper attempt to address?
The paper aims to address two key technical issues in Semi-Supervised Object Detection (SSOD): pseudo labeling and consistency training. Specifically: 1. **The issue of pseudo labeling**: Existing methods only focus on classification scores while ignoring the positional accuracy of pseudo boxes. The paper points out that there is no strong correlation between classification scores and the accuracy of box localization, which leads to poor optimization and low learning efficiency when using inaccurate pseudo boxes for training. 2. **The issue of consistency training**: The widely adopted random scaling training method only considers label-level consistency, neglecting feature-level consistency, which is equally important for ensuring scale invariance. To address these issues, the paper proposes two new methods: - **Noisy Pseudo box Learning (NPL)**: This includes Prediction-guided Label Assignment (PLA) and Positive-proposal Consistency Voting (PCV). The former assigns labels based on model predictions rather than IoU values, while the latter measures the quality of pseudo boxes through positive sample regression consistency and reweights the regression loss accordingly. - **Multi-view Scale-invariant Learning (MSL)**: This combines label-level and feature-level consistency training by aligning the feature pyramids of images with the same content but different scales to achieve feature consistency. With these improvements, the proposed method PseCo significantly outperforms existing state-of-the-art methods on the COCO benchmark, showing notable improvements across different annotation ratios and greatly enhancing learning efficiency. For example, with a 10% annotation ratio, PseCo increases the baseline mAP from 26.9% to 36.1%, surpassing previous methods by at least 2.0%.