Frequency-based pseudo-domain generation for domain generalizable object detection

Siqi Zhang,Lu Zhang,Zhi-Yong Liu
DOI: https://doi.org/10.1016/j.neucom.2023.126265
IF: 6
2023-04-30
Neurocomputing
Abstract:Domain generalizable object detection (DGOD) aims to train a detector that performs well on multiple unseen target domains, which is crucial for deploying the detector in practice. Recent methods for DGOD typically inherit the idea from domain adaptation to align or disentangle features, but these methods struggle to handle unknown target distributions. In this paper, we propose a unified framework to tackle the DGOD task from a novel pseudo-domain generation perspective. Our framework comprises two stages: distribution diversification and domain-invariant feature learning. In the distribution diversification stage, we design a Frequency-based Pseudo-domain Generator (FPG) to construct the pseudo domain via excavating latent style information and enhancing semantic information in frequency space. The generated pseudo domain can provide diverse training distributions, which enhances generalization performance. In the domain-invariant feature learning stage, we introduce Rotation Prediction and Semantic Consistency (RPSC) learning, including an auxiliary self-supervised task rotation prediction to encourage generalized feature learning and a semantic consistency loss to enforce the detector to be invariant of domain shifts. Extensive experiments are conducted on various object detection benchmarks, demonstrating the superiority of our approach over state-of-the-art methods in both single-source and multi-source settings.
computer science, artificial intelligence
What problem does this paper attempt to address?