Simplifying Source-Free Domain Adaptation for Object Detection: Effective Self-Training Strategies and Performance Insights

Yan Hao,Florent Forest,Olga Fink
2024-07-10
Abstract:This paper focuses on source-free domain adaptation for object detection in computer vision. This task is challenging and of great practical interest, due to the cost of obtaining annotated data sets for every new domain. Recent research has proposed various solutions for Source-Free Object Detection (SFOD), most being variations of teacher-student architectures with diverse feature alignment, regularization and pseudo-label selection strategies. Our work investigates simpler approaches and their performance compared to more complex SFOD methods in several adaptation scenarios. We highlight the importance of batch normalization layers in the detector backbone, and show that adapting only the batch statistics is a strong baseline for SFOD. We propose a simple extension of a Mean Teacher with strong-weak augmentation in the source-free setting, Source-Free Unbiased Teacher (SF-UT), and show that it actually outperforms most of the previous SFOD methods. Additionally, we showcase that an even simpler strategy consisting in training on a fixed set of pseudo-labels can achieve similar performance to the more complex teacher-student mutual learning, while being computationally efficient and mitigating the major issue of teacher-student collapse. We conduct experiments on several adaptation tasks using benchmark driving datasets including (Foggy)Cityscapes, Sim10k and KITTI, and achieve a notable improvement of 4.7\% AP50 on Cityscapes$\rightarrow$Foggy-Cityscapes compared with the latest state-of-the-art in SFOD. Source code is available at <a class="link-external link-https" href="https://github.com/EPFL-IMOS/simple-SFOD" rel="external noopener nofollow">this https URL</a>.
Computer Vision and Pattern Recognition,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to achieve source - free domain adaptation (SFDA) in object detection tasks. Specifically, the paper focuses on how to transfer a pre - trained object detection model from the source domain to the target domain without accessing the source - domain data, in order to improve the performance of the model on the target domain. Since obtaining the annotated data sets for each new domain is costly, this task is of great practical significance. In addition, the paper also explores simpler self - training strategies and their performance compared to existing complex methods, especially in several different domain - adaptation scenarios. The main contributions of the paper include: 1. Emphasizing the importance of the batch normalization layer (BN) and showing that simply adapting the batch statistics (i.e., using the AdaBN technique) can serve as a strong baseline for source - free domain adaptation. 2. Proposing a method for a simple extension of the unbiased teacher (UT) framework to the source - free setting, called source - free unbiased teacher (SF - UT). 3. Exploring multiple self - training strategy configurations, including different teacher exponential moving average (EMA) update rates, the use of weak - strong augmentation, and the adaptation of batch statistics. 4. Introducing a new lightweight strategy, combining AdaBN with the training of a fixed set of pseudo - labels (AdaBN + Fixed SF - FixMatch), which achieves performance comparable to the fully complex unbiased teacher method while avoiding the collapse of the teacher - student architecture. Through these studies, the paper aims to simplify the methods of source - free domain adaptation and provide a computationally efficient solution that can effectively alleviate the problem of teacher - student architecture collapse.