Universal Domain Adaptive Object Detection via Dual Probabilistic Alignment

Yuanfan Zheng,Jinlin Wu,Wuyang Li,Zhen Chen
2024-12-16
Abstract:Domain Adaptive Object Detection (DAOD) transfers knowledge from a labeled source domain to an unannotated target domain under closed-set assumption. Universal DAOD (UniDAOD) extends DAOD to handle open-set, partial-set, and closed-set domain adaptation. In this paper, we first unveil two issues: domain-private category alignment is crucial for global-level features, and the domain probability heterogeneity of features across different levels. To address these issues, we propose a novel Dual Probabilistic Alignment (DPA) framework to model domain probability as Gaussian distribution, enabling the heterogeneity domain distribution sampling and measurement. The DPA consists of three tailored modules: the Global-level Domain Private Alignment (GDPA), the Instance-level Domain Shared Alignment (IDSA), and the Private Class Constraint (PCC). GDPA utilizes the global-level sampling to mine domain-private category samples and calculate alignment weight through a cumulative distribution function to address the global-level private category alignment. IDSA utilizes instance-level sampling to mine domain-shared category samples and calculates alignment weight through Gaussian distribution to conduct the domain-shared category domain alignment to address the feature heterogeneity. The PCC aggregates domain-private category centroids between feature and probability spaces to mitigate negative transfer. Extensive experiments demonstrate that our DPA outperforms state-of-the-art UniDAOD and DAOD methods across various datasets and scenarios, including open, partial, and closed sets. Codes are available at \url{<a class="link-external link-https" href="https://github.com/zyfone/DPA" rel="external noopener nofollow">this https URL</a>}.
Computer Vision and Pattern Recognition,Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?
### What problems does this paper attempt to solve? This paper aims to solve two key problems in **Universal Domain Adaptive Object Detection (UniDAOD)**: 1. **Neglect of global feature alignment for private classes**: - Existing UniDAOD methods mainly focus on the alignment of shared classes and ignore the alignment of private classes at the global feature level. Through experiments, the authors found that global features are more inclined to align private classes, while instance features are more inclined to align shared classes. This phenomenon indicates that existing methods ignore the alignment of private classes at the global feature level, resulting in sub - optimal domain alignment effects. 2. **Heterogeneity of different feature levels**: - There are significant differences in the representation of global features and instance features, resulting in different probability distributions in different domains. Existing methods deal with this problem by setting different thresholds and entropy functions, but these methods require manual parameter adjustment, are highly complex, and are difficult to adapt to detection tasks. To solve these problems, the authors propose a new **Dual Probabilistic Alignment (DPA) framework**. The DPA framework consists of three modules: - **Global - level Domain Private Alignment (GDPA)**: Used to align private classes at the global feature level. - **Instance - level Domain Shared Alignment (IDSA)**: Used to align shared classes at the instance feature level. - **Private Class Constraint (PCC)**: Used to reduce negative transfer caused by the alignment of private classes. Through these modules, the DPA framework can effectively handle domain - adaptive object detection tasks in open - set, partial - set, and closed - set scenarios and has achieved better performance than existing methods on multiple datasets. ### Summary of mathematical formulas 1. **Error risk decomposition**: \[ \epsilon(h)=\int_{x}|h(x)-\psi(x)|P_{x,\psi}dx = \int_{x_c}F_{h,\psi}P_{x,\psi}dx+\int_{x_{c^*}}F_{h,\psi}P_{x,\psi}dx=\epsilon_c(h)+\epsilon_{c^*}(h) \] where \(\epsilon_c(h)\) and \(\epsilon_{c^*}(h)\) are the error risks of shared classes and private classes respectively. 2. **Upper bound of error risk**: \[ \epsilon_t^c(h)\leq \underbrace{\epsilon_s(h)}_{L_{det}}-\underbrace{\epsilon_t^{c^*}(h)}_{L_{PCC}}+\underbrace{d_H(D_s^c,D_t^c)}_{L_{IDSA}}+\underbrace{d_H(D_s^{c^*},D_t^{c^*})}_{L_{GDPA}}+\text{other terms} \] 3. **Global - level sampling**: \[ \Omega_{neg}^g = \{i\mid \|x_i - C\|_2>d\}, \quad \Omega_{pos}^g = \{i\mid \|x_i - C\|_2\leq d\} \] where \(C\) is the dynamic feature center and \(d\) is the learnable radius. 4. **Cumulative distribution function (CDF) calculation weight**: \[ \Phi(z)=\frac{1}{2} \left[