Abstract:With the advancement of autonomous driving, semantic segmentation has achieved remarkable progress. The training of such networks heavily relies on image annotations, which are very expensive to obtain. Semi-supervised learning can utilize both labeled data and unlabeled data with the help of pseudo-labels. However, in many real-world scenarios where classes are imbalanced, majority classes often play a dominant role during training and the learning quality of minority classes can be undermined. To overcome this limitation, we propose a synergistic training framework, including a professional training module to enhance minority class learning and a general training module to learn more comprehensive semantic information. Based on a pixel selection strategy, they can iteratively learn from each other to reduce error accumulation and coupling. In addition, a dual contrastive learning with anchors is proposed to guarantee more distinct decision boundaries. In experiments, our framework demonstrates superior performance compared to state-of-the-art methods on benchmark datasets.

What problem does this paper attempt to address?

This paper attempts to solve the problems of class imbalance and model coupling in the semi - supervised semantic segmentation task in the field of autonomous driving. Specifically: 1. **Class Imbalance Problem**: In many real - world scenarios, the class distribution in the dataset is often a long - tailed distribution (i.e., the number of samples in some classes is much larger than that in other classes). This imbalance will cause the majority classes to dominate during the training process, and the learning quality of the minority classes will be severely affected. 2. **Model Coupling Problem**: Some existing semi - supervised learning methods are prone to cause coupling between models, thus leading to more serious error accumulation problems. These problems limit the performance improvement of the model. To solve the above problems, the author proposes a synergistic training framework (Synergistic Training framework with Professional and General Training, STPG), which includes two modules: - **Professional Training Module**: Focuses on improving the learning quality of minority classes and reducing error accumulation. - **General Training Module**: Learns more comprehensive semantic information and avoids model coupling. In addition, the author also introduces Dual Contrastive Learning with Anchors to enhance the decision boundaries between different classes, ensuring that the model not only focuses on the majority classes but also can better handle the minority classes. Through these innovations, the author hopes to significantly improve the model performance on the benchmark dataset and surpass the existing state - of - the - art techniques. ### Formula Summary - **Cross - Entropy Loss Function**: \[ L_s=\ell_{ce}(f_{\theta_{Gen}}(A_w(x_l)), y_l)+\ell_{ce}(f_{\theta_{Pro}}(A_w(x_l)), y_l) \] - **Professional Training Module Loss**: \[ L_{Pro}^u = \omega_{Pro}^u\ell_{ce}(p_{Pro}^u,\hat{y}_{Cons}^u+\hat{y}_{Hmis}^u) \] - **General Training Module Loss**: \[ L_{Gen}^u=\omega_{Gen}^u\ell_{ce}(p_{Gen}^u,\hat{y}_{Pro}^u) \] - **Anchor Contrast Loss**: \[ L_{ac}=-\log\frac{\exp(f\cdot v_{\sigma_c}/\tau)}{\exp(f\cdot v_{\sigma_c}/\tau)+\sum_{c'\neq c}\exp(f\cdot v_{\sigma_{c'}}/\tau)} \] - **Similarity Loss**: \[ L_{sim}=\frac{1}{|M_c|}\sum_{i^+\in Q_c}\left(1 - \frac{\langle f, i^+\rangle}{\|f\|_2\cdot\|i^+\|_2}\right) \] Through these formulas and methods, the author effectively solves the problems of class imbalance and model coupling and improves the performance of semi - supervised semantic segmentation.

Exploiting Minority Pseudo-Labels for Semi-Supervised Semantic Segmentation in Autonomous Driving

In Defense Of Multi-Source Omni-Supervised Efficient Convnet For Robust Semantic Segmentation In Heterogeneous Unseen Domains

Semi-Supervised Semantic Segmentation Using Unreliable Pseudo-Labels

Semi-Supervised Semantic Segmentation via Gentle Teaching Assistant

Realizing Pixel-Level Semantic Learning in Complex Driving Scenes Based on Only One Annotated Pixel Per Class

Image Understands Point Cloud: Weakly Supervised 3D Semantic Segmentation via Association Learning

Synergy-Guided Regional Supervision of Pseudo Labels for Semi-Supervised Medical Image Segmentation

Learning Pseudo Labels for Semi-and-weakly Supervised Semantic Segmentation

Using Unreliable Pseudo-Labels for Label-Efficient Semantic Segmentation

Adversarial Dual-Student With Differentiable Spatial Warping for Semi-Supervised Semantic Segmentation

Regularizing Proxies with Multi-Adversarial Training for Unsupervised Domain-Adaptive Semantic Segmentation.

Enhanced Pseudo-Label Generation with Self-supervised Training for Weakly-supervised Semantic Segmentation

Semi-Supervised Semantic Segmentation Via Adaptive Equalization Learning

Improving Synthetic to Realistic Semantic Segmentation with Parallel Generative Ensembles for Autonomous Urban Driving

Semi-supervised 3D Object Detection with Proficient Teachers.

Conservative-Progressive Collaborative Learning for Semi-supervised Semantic Segmentation

Semi-supervised Semantic Segmentation via Strong-Weak Dual-Branch Network

Adaptive Affinity Loss and Erroneous Pseudo-Label Refinement for Weakly Supervised Semantic Segmentation

Bidirectional Self-Training with Multiple Anisotropic Prototypes for Domain Adaptive Semantic Segmentation

SS-ADA: A Semi-Supervised Active Domain Adaptation Framework for Semantic Segmentation