Exploiting Minority Pseudo-Labels for Semi-Supervised Semantic Segmentation in Autonomous Driving

Yuting Hong,Hui Xiao,Huazheng Hao,Xiaojie Qiu,Baochen Yao,Chengbin Peng
2024-09-22
Abstract:With the advancement of autonomous driving, semantic segmentation has achieved remarkable progress. The training of such networks heavily relies on image annotations, which are very expensive to obtain. Semi-supervised learning can utilize both labeled data and unlabeled data with the help of pseudo-labels. However, in many real-world scenarios where classes are imbalanced, majority classes often play a dominant role during training and the learning quality of minority classes can be undermined. To overcome this limitation, we propose a synergistic training framework, including a professional training module to enhance minority class learning and a general training module to learn more comprehensive semantic information. Based on a pixel selection strategy, they can iteratively learn from each other to reduce error accumulation and coupling. In addition, a dual contrastive learning with anchors is proposed to guarantee more distinct decision boundaries. In experiments, our framework demonstrates superior performance compared to state-of-the-art methods on benchmark datasets.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
This paper attempts to solve the problems of class imbalance and model coupling in the semi - supervised semantic segmentation task in the field of autonomous driving. Specifically: 1. **Class Imbalance Problem**: In many real - world scenarios, the class distribution in the dataset is often a long - tailed distribution (i.e., the number of samples in some classes is much larger than that in other classes). This imbalance will cause the majority classes to dominate during the training process, and the learning quality of the minority classes will be severely affected. 2. **Model Coupling Problem**: Some existing semi - supervised learning methods are prone to cause coupling between models, thus leading to more serious error accumulation problems. These problems limit the performance improvement of the model. To solve the above problems, the author proposes a synergistic training framework (Synergistic Training framework with Professional and General Training, STPG), which includes two modules: - **Professional Training Module**: Focuses on improving the learning quality of minority classes and reducing error accumulation. - **General Training Module**: Learns more comprehensive semantic information and avoids model coupling. In addition, the author also introduces Dual Contrastive Learning with Anchors to enhance the decision boundaries between different classes, ensuring that the model not only focuses on the majority classes but also can better handle the minority classes. Through these innovations, the author hopes to significantly improve the model performance on the benchmark dataset and surpass the existing state - of - the - art techniques. ### Formula Summary - **Cross - Entropy Loss Function**: \[ L_s=\ell_{ce}(f_{\theta_{Gen}}(A_w(x_l)), y_l)+\ell_{ce}(f_{\theta_{Pro}}(A_w(x_l)), y_l) \] - **Professional Training Module Loss**: \[ L_{Pro}^u = \omega_{Pro}^u\ell_{ce}(p_{Pro}^u,\hat{y}_{Cons}^u+\hat{y}_{Hmis}^u) \] - **General Training Module Loss**: \[ L_{Gen}^u=\omega_{Gen}^u\ell_{ce}(p_{Gen}^u,\hat{y}_{Pro}^u) \] - **Anchor Contrast Loss**: \[ L_{ac}=-\log\frac{\exp(f\cdot v_{\sigma_c}/\tau)}{\exp(f\cdot v_{\sigma_c}/\tau)+\sum_{c'\neq c}\exp(f\cdot v_{\sigma_{c'}}/\tau)} \] - **Similarity Loss**: \[ L_{sim}=\frac{1}{|M_c|}\sum_{i^+\in Q_c}\left(1 - \frac{\langle f, i^+\rangle}{\|f\|_2\cdot\|i^+\|_2}\right) \] Through these formulas and methods, the author effectively solves the problems of class imbalance and model coupling and improves the performance of semi - supervised semantic segmentation.