Adversarial Defense Teacher for Cross-Domain Object Detection under Poor Visibility Conditions

Kaiwen Wang,Yinzhe Shen,Martin Lauer
2024-03-23
Abstract:Existing object detectors encounter challenges in handling domain shifts between training and real-world data, particularly under poor visibility conditions like fog and night. Cutting-edge cross-domain object detection methods use teacher-student frameworks and compel teacher and student models to produce consistent predictions under weak and strong augmentations, respectively. In this paper, we reveal that manually crafted augmentations are insufficient for optimal teaching and present a simple yet effective framework named Adversarial Defense Teacher (ADT), leveraging adversarial defense to enhance teaching quality. Specifically, we employ adversarial attacks, encouraging the model to generalize on subtly perturbed inputs that effectively deceive the model. To address small objects under poor visibility conditions, we propose a Zoom-in Zoom-out strategy, which zooms-in images for better pseudo-labels and zooms-out images and pseudo-labels to learn refined features. Our results demonstrate that ADT achieves superior performance, reaching 54.5% mAP on Foggy Cityscapes, surpassing the previous state-of-the-art by 2.6% mAP.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the challenges encountered by existing object detectors in handling domain transfer between training data and actual data under poor visibility conditions (such as foggy days and at night). Specifically, the paper points out that current methods perform poorly under these conditions because manually - designed data augmentation methods are not sufficient to optimize the teaching effect. To solve this problem, the paper proposes a new framework - Adversarial Defense Teacher (ADT), which improves the quality of teacher - student mutual learning by introducing adversarial defense. In addition, in order to better detect small objects, the paper also proposes a Zoom - in Zoom - out strategy to improve the detection ability of small objects under poor visibility conditions. ### Main Contributions 1. **Introduction of Adversarial Defense Teacher (ADT)**: This is a method based on the teacher - student framework, which enhances the quality of mutual learning through adversarial defense. Specifically, by making small but effective perturbations (adversarial attacks) to the input data, the model makes highly inconsistent predictions on these perturbed data, thereby improving the effect of mutual learning. 2. **Proposal of Zoom - in Zoom - out Strategy**: This strategy improves the pseudo - label recall rate of small objects by magnifying the image, and then shrinks the image and pseudo - labels so that the student model can learn from more refined features. 3. **Experimental Verification**: The effectiveness of this framework has been verified through extensive experiments. ADT has achieved 54.5% mAP on the Foggy Cityscapes dataset, which is 2.6% higher than the previous state - of - the - art method. ### Method Overview - **Weak - strong Augmentation**: The teacher model generates high - confidence pseudo - labels on weakly - augmented data, while the student model is trained on strongly - augmented data. - **Adversarial Attack**: Conduct adversarial attacks on strongly - augmented data to generate adversarial samples. These samples are almost imperceptible to human vision but can significantly affect the model's prediction. - **Zoom - in Zoom - out Strategy**: Use magnified images in the teacher model to generate better pseudo - labels, and then use shrunk images and pseudo - labels in the student model to extract more refined features. ### Experimental Results - **Foggy Cityscapes**: ADT outperforms the existing state - of - the - art methods on both the "0.02" split and the "all" split, increasing mAP by 0.8% and 2.6% respectively. - **BDD100K**: In the day - to - night adaptation task, ADT also performs well, further verifying its effectiveness in different scenarios. ### Conclusion The ADT framework proposed in the paper effectively solves the domain transfer problem of object detection under poor visibility conditions through adversarial defense and the Zoom - in Zoom - out strategy, significantly improving the robustness and performance of the model.