Abstract:Cross domain object detection learns an object detector for an unlabeled target domain by transferring knowledge from an annotated source domain. Promising results have been achieved via Mean Teacher, however, pseudo labeling which is the bottleneck of mutual learning remains to be further explored. In this study, we find that confidence misalignment of the predictions, including category-level overconfidence, instance-level task confidence inconsistency, and image-level confidence misfocusing, leading to the injection of noisy pseudo label in the training process, will bring suboptimal performance on the target domain. To tackle this issue, we present a novel general framework termed Multi-Granularity Confidence Alignment Mean Teacher (MGCAMT) for cross domain object detection, which alleviates confidence misalignment across category-, instance-, and image-levels simultaneously to obtain high quality pseudo supervision for better teacher-student learning. Specifically, to align confidence with accuracy at category level, we propose Classification Confidence Alignment (CCA) to model category uncertainty based on Evidential Deep Learning (EDL) and filter out the category incorrect labels via an uncertainty-aware selection strategy. Furthermore, to mitigate the instance-level misalignment between classification and localization, we design Task Confidence Alignment (TCA) to enhance the interaction between the two task branches and allow each classification feature to adaptively locate the optimal feature for the regression. Finally, we develop imagery Focusing Confidence Alignment (FCA) adopting another way of pseudo label learning, i.e., we use the original outputs from the Mean Teacher network for supervised learning without label assignment to concentrate on holistic information in the target image. These three procedures benefit from each other from a cooperative learning perspective.

CMT: Co-training Mean-Teacher for Unsupervised Domain Adaptation on 3D Object Detection

Contrastive Mean Teacher for Domain Adaptive Object Detectors

CTS: Sim-to-Real Unsupervised Domain Adaptation on 3D Detection

Unsupervised Domain Adaptive 3D Detection with Multi-Level Consistency

Density-Insensitive Unsupervised Domain Adaption on 3D Object Detection

MS3D: Leveraging Multiple Detectors for Unsupervised Domain Adaptation in 3D Object Detection

Cross-Domain Adaptive Teacher for Object Detection

STAL3D: Unsupervised Domain Adaptation for 3D Object Detection via Collaborating Self-Training and Adversarial Learning

ST3D: Self-training for Unsupervised Domain Adaptation on 3D Object Detection

CMDA: Cross-Modal and Domain Adversarial Adaptation for LiDAR-Based 3D Object Detection

Cross Domain Object Detection via Multi-Granularity Confidence Alignment based Mean Teacher

MDT3D: Multi-Dataset Training for LiDAR 3D Object Detection Generalization

MA-ST3D: Motion Associated Self-Training for Unsupervised Domain Adaptation on 3D Object Detection

Exploring Object Relation in Mean Teacher for Cross-Domain Detection

Multi-adversarial Faster-RCNN with Paradigm Teacher for Unrestricted Object Detection

Monocular 3D Object Detection via Feature Domain Adaptation

Bi3D: Bi-domain Active Learning for Cross-domain 3D Object Detection

Domain Adaptation in 3D Object Detection with Gradual Batch Alternation Training

Multimodal 3D Object Detection on Unseen Domains

UADA3D: Unsupervised Adversarial Domain Adaptation for 3D Object Detection with Sparse LiDAR and Large Domain Gaps

Source-free domain adaptive object detection based on pseudo-supervised mean teacher