A Novel Semi-Supervised Object Detection Approach Via Scale ReBalancing and Global Proposal Contrast Consistency

Bo Liu,Chengrong Yang,Jing Guo,Yun Yang
DOI: https://doi.org/10.1109/tcsvt.2024.3458907
IF: 5.859
2024-01-01
IEEE Transactions on Circuits and Systems for Video Technology
Abstract:Semi-supervised Object Detection (SSOD) is a method that uses a small amount of labeled data and a large amount of unlabeled data to improve the performance of object detection. However, existing SSOD methods face the challenges of scale imbalance and class inconsistency, resulting in large differences in detection results across different scales and classes. To overcome these challenges, we propose a Scale-Rebalanced Global Proposal Contrast Consistency (SGPC) approach, which has the following three advantages: (1) We design a Scale-Rebalanced Input (SRI) structure, which adjusts the distribution of objects of different scales by resampling the input images at low magnification, thereby enhancing the ability of small object detection. (2) We design a Global Proposal Contrast Consistency Loss (GPCC), which can enhance the intra-class compactness and inter-class diversity of Region of Interest (RoI) features, thereby reducing the class inconsistency in pseudo-labels. (3) We adopt a loss blending optimization strategy, which optimizes the localization accuracy of pseudo-labels by combining supervised loss and unsupervised loss. We conduct extensive experiments on multiple datasets, and the results show that SGPC significantly outperforms the latest other methods on the SSOD task. On the PASCAL VOC dataset, SGPC achieves 55.90 mAP, on the MS-COCO dataset, SGPC exceeds the supervised methods by more than 10 mAP at different scales, and we also verify the significant improvement and robustness of SGPC on the small object detection datasets VisDrone-2019 and EDD.
What problem does this paper attempt to address?