Abstract:Existing object detectors encounter challenges in handling domain shifts between training and real-world data, particularly under poor visibility conditions like fog and night. Cutting-edge cross-domain object detection methods use teacher-student frameworks and compel teacher and student models to produce consistent predictions under weak and strong augmentations, respectively. In this paper, we reveal that manually crafted augmentations are insufficient for optimal teaching and present a simple yet effective framework named Adversarial Defense Teacher (ADT), leveraging adversarial defense to enhance teaching quality. Specifically, we employ adversarial attacks, encouraging the model to generalize on subtly perturbed inputs that effectively deceive the model. To address small objects under poor visibility conditions, we propose a Zoom-in Zoom-out strategy, which zooms-in images for better pseudo-labels and zooms-out images and pseudo-labels to learn refined features. Our results demonstrate that ADT achieves superior performance, reaching 54.5% mAP on Foggy Cityscapes, surpassing the previous state-of-the-art by 2.6% mAP.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is the challenges encountered by existing object detectors in handling domain transfer between training data and actual data under poor visibility conditions (such as foggy days and at night). Specifically, the paper points out that current methods perform poorly under these conditions because manually - designed data augmentation methods are not sufficient to optimize the teaching effect. To solve this problem, the paper proposes a new framework - Adversarial Defense Teacher (ADT), which improves the quality of teacher - student mutual learning by introducing adversarial defense. In addition, in order to better detect small objects, the paper also proposes a Zoom - in Zoom - out strategy to improve the detection ability of small objects under poor visibility conditions. ### Main Contributions 1. **Introduction of Adversarial Defense Teacher (ADT)**: This is a method based on the teacher - student framework, which enhances the quality of mutual learning through adversarial defense. Specifically, by making small but effective perturbations (adversarial attacks) to the input data, the model makes highly inconsistent predictions on these perturbed data, thereby improving the effect of mutual learning. 2. **Proposal of Zoom - in Zoom - out Strategy**: This strategy improves the pseudo - label recall rate of small objects by magnifying the image, and then shrinks the image and pseudo - labels so that the student model can learn from more refined features. 3. **Experimental Verification**: The effectiveness of this framework has been verified through extensive experiments. ADT has achieved 54.5% mAP on the Foggy Cityscapes dataset, which is 2.6% higher than the previous state - of - the - art method. ### Method Overview - **Weak - strong Augmentation**: The teacher model generates high - confidence pseudo - labels on weakly - augmented data, while the student model is trained on strongly - augmented data. - **Adversarial Attack**: Conduct adversarial attacks on strongly - augmented data to generate adversarial samples. These samples are almost imperceptible to human vision but can significantly affect the model's prediction. - **Zoom - in Zoom - out Strategy**: Use magnified images in the teacher model to generate better pseudo - labels, and then use shrunk images and pseudo - labels in the student model to extract more refined features. ### Experimental Results - **Foggy Cityscapes**: ADT outperforms the existing state - of - the - art methods on both the "0.02" split and the "all" split, increasing mAP by 0.8% and 2.6% respectively. - **BDD100K**: In the day - to - night adaptation task, ADT also performs well, further verifying its effectiveness in different scenarios. ### Conclusion The ADT framework proposed in the paper effectively solves the domain transfer problem of object detection under poor visibility conditions through adversarial defense and the Zoom - in Zoom - out strategy, significantly improving the robustness and performance of the model.

Adversarial Defense Teacher for Cross-Domain Object Detection under Poor Visibility Conditions

Cross-Domain Adaptive Teacher for Object Detection

Adversarial Attacked Teacher for Unsupervised Domain Adaptive Object Detection

Joint Feature-Level And Pixel-Level Domain Adaption For Object Detection In The Wild

Style-Guided Adversarial Teacher for Cross-Domain Object Detection

Robust and Accurate Object Detection via Adversarial Learning

A Step-Wise Domain Adaptation Detection Transformer for Object Detection under Poor Visibility Conditions

Understanding Object Detection Through An Adversarial Lens

Domain Adaptive Object Detection for Autonomous Driving under Foggy Weather

Masked Retraining Teacher-Student Framework for Domain Adaptive Object Detection

Domain Adaptive Multitask Model for Object Detection in Foggy Weather Conditions

Playing Against Deep-Neural-Network-Based Object Detectors: A Novel Bidirectional Adversarial Attack Approach

Cross-Domain Object Detection through Consistent and Contrastive Teacher with Fourier Transform

Transferable Adversarial Attacks for Object Detection Using Object-Aware Significant Feature Distortion

CAT: Exploiting Inter-Class Dynamics for Domain Adaptive Object Detection

Cooperative Students: Navigating Unsupervised Domain Adaptation in Nighttime Object Detection

Rethinking Weak-to-Strong Augmentation in Source-Free Domain Adaptive Object Detection

Contrastive Mean Teacher for Domain Adaptive Object Detectors

Multi-adversarial Faster-RCNN with Paradigm Teacher for Unrestricted Object Detection

Adversarial Attack and Defense of YOLO Detectors in Autonomous Driving Scenarios

Partial Alignment for Object Detection in the Wild