DA-Ada: Learning Domain-Aware Adapter for Domain Adaptive Object Detection

Haochen Li,Rui Zhang,Hantao Yao,Xin Zhang,Yifan Hao,Xinkai Song,Xiaqing Li,Yongwei Zhao,Ling Li,Yunji Chen
2024-10-12
Abstract:Domain adaptive object detection (DAOD) aims to generalize detectors trained on an annotated source domain to an unlabelled target domain. As the visual-language models (VLMs) can provide essential general knowledge on unseen images, freezing the visual encoder and inserting a domain-agnostic adapter can learn domain-invariant knowledge for DAOD. However, the domain-agnostic adapter is inevitably biased to the source domain. It discards some beneficial knowledge discriminative on the unlabelled domain, i.e., domain-specific knowledge of the target domain. To solve the issue, we propose a novel Domain-Aware Adapter (DA-Ada) tailored for the DAOD task. The key point is exploiting domain-specific knowledge between the essential general knowledge and domain-invariant knowledge. DA-Ada consists of the Domain-Invariant Adapter (DIA) for learning domain-invariant knowledge and the Domain-Specific Adapter (DSA) for injecting the domain-specific knowledge from the information discarded by the visual encoder. Comprehensive experiments over multiple DAOD tasks show that DA-Ada can efficiently infer a domain-aware visual encoder for boosting domain adaptive object detection. Our code is available at <a class="link-external link-https" href="https://github.com/Therock90421/DA-Ada" rel="external noopener nofollow">this https URL</a>.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
### What problems does this paper attempt to solve? This paper aims to solve the key problems in **Domain Adaptive Object Detection (DAOD)**. Specifically, the author focuses on how to improve the generalization ability of the object detection model in the unlabeled target domain. Traditional DAOD methods are usually biased towards the source domain, resulting in poor performance on the target domain. In addition, although existing methods can extract domain - invariant knowledge, they often ignore or lose domain - specific knowledge that is beneficial to the target domain. To solve these problems, the author proposes a new method named **Domain - Aware Adapter (DA - Ada)**. The main contributions of this method are as follows: 1. **Introduce Domain - Aware Adapter**: By simultaneously learning domain - invariant knowledge and domain - specific knowledge, improve the generalization ability of the model on the target domain. 2. **Design Domain - Invariant Adapter (DIA) and Domain - Specific Adapter (DSA)**: DIA is used to extract domain - invariant features, while DSA recovers domain - specific knowledge from the discarded features. 3. **Propose Visual - guided Textual Adapter (VTA)**: Embed visual features into the text encoder to enhance the discrimination ability of the detection head. Through these innovations, DA - Ada can significantly outperform existing methods in multiple DAOD benchmark tests, especially in scenarios such as cross - weather, cross - view, simulation - to - real, and cross - style. ### Formula summary To better understand the working principle of DA - Ada, the following are the key formulas involved in the paper: 1. **Update formula of the DA - Ada module**: \[ h_0 = S(x); \quad h_i = A_i(h_{i - 1}, F_i(h_{i - 1})) \] where \(S\) represents the stem layer, \(A_i\) is the \(i\)-th DA - Ada module, and \(F_i\) is the \(i\)-th block. 2. **Extraction of domain - invariant features and domain - specific features**: \[ h^I_i = A^I_i(h_{i - 1})+F_i(h_{i - 1}); \quad h^S_i = A^S_i(h_{i - 1}-h^I_i) \] 3. **Fusion of domain - invariant features and domain - specific features**: \[ h_i = h^I_i + h^I_i\cdot h^S_i \] where \(\cdot\) represents the element - level Hadamard product. 4. **Dimensionality reduction and elevation operations of the Domain - Invariant Adapter (DIA)**: \[ h^E_i = E_I(h_{i - 1}); \quad h^L_i = C_D(h^E_i); \quad A^I_i(h_{i - 1})=C_U(h^L_i) \] 5. **Feature recovery of the Domain - Specific Adapter (DSA)**: \[ h^D_i = E_S(h_{i - 1}-h^I_i); \quad h^{L'}_i = C'_D(h^D_i); \quad h^S_i = A^S_i(h_{i - 1}-h^I_i)=C'_U(h^{L'}_i) \] 6. **Optimization objective function**: \[ L = L_{\text{det}}+\lambda_{\text{dia}}L_{\text{dia}}+\lambda_{\text{dita}}L_{\text{dita}}+\lambda_{\text{dec}}L_{\text{dec}}+L_{\text