Train Till You Drop: Towards Stable and Robust Source-free Unsupervised 3D Domain Adaptation

Björn Michele,Alexandre Boulch,Tuan-Hung Vu,Gilles Puy,Renaud Marlet,Nicolas Courty
2024-09-07
Abstract:We tackle the challenging problem of source-free unsupervised domain adaptation (SFUDA) for 3D semantic segmentation. It amounts to performing domain adaptation on an unlabeled target domain without any access to source data; the available information is a model trained to achieve good performance on the source domain. A common issue with existing SFUDA approaches is that performance degrades after some training time, which is a by product of an under-constrained and ill-posed problem. We discuss two strategies to alleviate this issue. First, we propose a sensible way to regularize the learning problem. Second, we introduce a novel criterion based on agreement with a reference model. It is used (1) to stop the training when appropriate and (2) as validator to select hyperparameters without any knowledge on the target domain. Our contributions are easy to implement and readily amenable for all SFUDA methods, ensuring stable improvements over all baselines. We validate our findings on various 3D lidar settings, achieving state-of-the-art performance. The project repository (with code) is: <a class="link-external link-http" href="http://github.com/valeoai/TTYD" rel="external noopener nofollow">this http URL</a>.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? This paper aims to solve the **performance degradation problem of unsupervised domain adaptation in source - free domains (SFUDA) in 3D semantic segmentation**. Specifically, the author focuses on how to effectively perform domain adaptation on unlabeled target - domain data using a model trained only on the source domain without accessing the source data. #### Background and Challenges 1. **Source - free Domain Setting**: In traditional unsupervised domain adaptation (UDA), data from both the source domain and the target domain need to be available during training. However, in practical applications, due to data privacy or development cycle reasons, the source - domain data may be inaccessible. Moreover, re - training the already - trained model jointly on the source - domain and target - domain data will consume a large amount of resources. 2. **Performance Degradation**: Existing SFUDA methods will have the problem of performance degradation during the training process, that is, as the training time increases, the model performance instead decreases. This phenomenon is caused by the under - constrained and ill - posed nature of the problem. #### Solutions To address these challenges, the author proposes the following strategies: 1. **Regularizing the Learning Problem**: By introducing a reasonable regularization term to stabilize the learning process and prevent the model from overfitting the target - domain data. 2. **Consistency Criterion Based on the Reference Model**: - **Stopping Training**: A new unsupervised stopping criterion is proposed to determine when to stop training to avoid performance degradation. - **Hyperparameter Selection**: This criterion can also be used as a validator to select hyperparameters without any target - domain labels. 3. **Self - training Module**: Use the preliminarily adapted model to generate pseudo - labels and use these pseudo - labels for self - training to further improve the model performance. #### Formula Summary - **Discriminative Loss**: \[ L_{\text{discrim}}(P)=\frac{1}{|P|} \sum_{p \in P} H(f[\theta_t](P)_p) \] where \(H\) is the entropy function, \(P\) is the point cloud, and \(\theta_t\) is the target - domain parameter. - **Distribution Similarity Loss**: \[ L_{\text{simsrc}}(P)=\text{KL}(D(P) \| D_s), \quad \text{where } D(P)=\frac{1}{|P|} \sum_{p \in P} f[\theta_t](P)_p \] Here \(D_s\) is the category distribution of the source domain, and \(\text{KL}\) is the Kullback - Leibler divergence. - **Total Loss**: \[ L(P)=\max(0, L_{\text{discrim}}(P)-\lambda)+\max(0, L_{\text{simsrc}}(P)-\lambda) \] where \(\lambda\) is a threshold, set to 0.02. #### Experimental Results The author conducted experiments on multiple 3D LiDAR datasets, verified the effectiveness of the proposed method, and achieved state - of - the - art performance. In conclusion, this paper solves the common performance degradation problem in SFUDA by introducing new regularization strategies and consistency criteria, providing a more stable and robust solution for 3D semantic segmentation.