Abstract:Object detectors frequently encounter significant performance degradation when confronted with domain gaps between collected data (source domain) and data from real-world applications (target domain). To address this task, numerous unsupervised domain adaptive detectors have been proposed, leveraging carefully designed feature alignment techniques. However, these techniques primarily align instance-level features in a class-agnostic manner, overlooking the differences between extracted features from different categories, which results in only limited improvement. Furthermore, the scope of current alignment modules is often restricted to a limited batch of images, failing to learn the entire dataset-level cues, thereby severely constraining the detector's generalization ability to the target domain. To this end, we introduce a strong DETR-based detector named Domain Adaptive detection TRansformer (DATR) for unsupervised domain adaptation of object detection. Firstly, we propose the Class-wise Prototypes Alignment (CPA) module, which effectively aligns cross-domain features in a class-aware manner by bridging the gap between object detection task and domain adaptation task. Then, the designed Dataset-level Alignment Scheme (DAS) explicitly guides the detector to achieve global representation and enhance inter-class distinguishability of instance-level features across the entire dataset, which spans both domains, by leveraging contrastive learning. Moreover, DATR incorporates a mean-teacher based self-training framework, utilizing pseudo-labels generated by the teacher model to further mitigate domain bias. Extensive experimental results demonstrate superior performance and generalization capabilities of our proposed DATR in multiple domain adaptation scenarios. Code is released at

Towards Unsupervised Domain Adaptation via Domain-Transformer

Making the Best of Both Worlds: A Domain-Oriented Transformer for Unsupervised Domain Adaptation

CDTrans: Cross-domain Transformer for Unsupervised Domain Adaptation

Learning cross-domain representations by vision transformer for unsupervised domain adaptation

Exploiting Both Domain-specific and Invariant Knowledge via a Win-win Transformer for Unsupervised Domain Adaptation

Transformer-Based Multi-Source Domain Adaptation Without Source Data.

Unsupervised Domain Adaptation for Remote Sensing Semantic Segmentation with Transformer

DeNKD: Decoupled Non-Target Knowledge Distillation for Complementing Transformer-based Unsupervised Domain Adaptation

DCST: Dual Cross-Supervision for Transformer-based Unsupervised Domain Adaptation

TransConv: Transformer Meets Contextual Convolution for Unsupervised Domain Adaptation

Safe Self-Refinement for Transformer-based Domain Adaptation

When Unsupervised Domain Adaptation Meets Tensor Representations.

Robust Core-Periphery Constrained Transformer for Domain Adaptation

Transformer-Based Source-Free Domain Adaptation

DATR: Unsupervised Domain Adaptive Detection Transformer with Dataset-Level Adaptation and Prototypical Alignment

Dispensed Transformer Network for Unsupervised Domain Adaptation

Domain-Augmented Domain Adaptation

One-Shot Domain Adaptive and Generalizable Semantic Segmentation with Class-Aware Cross-Domain Transformers

Domain Adaptation without Model Transferring

Domain Adaptation via Bidirectional Cross-Attention Transformer

Decomposed-distance weighted optimal transport for unsupervised domain adaptation