Tackling Dimensional Collapse toward Comprehensive Universal Domain Adaptation

Hung-Chieh Fang,Po-Yi Lu,Hsuan-Tien Lin
2025-02-11
Abstract:Universal Domain Adaptation (UniDA) addresses unsupervised domain adaptation where target classes may differ arbitrarily from source ones, except for a shared subset. An important approach, partial domain matching (PDM), aligns only shared classes but struggles in extreme cases where many source classes are absent in the target domain, underperforming the most naive baseline that trains on only source data. In this work, we identify that the failure of PDM for extreme UniDA stems from dimensional collapse (DC) in target representations. To address target DC, we propose to jointly leverage the alignment and uniformity techniques in modern self-supervised learning (SSL) on the unlabeled target data to preserve the intrinsic structure of the learned representations. Our experimental results confirm that SSL consistently advances PDM and delivers new state-of-the-art results across a broader benchmark of UniDA scenarios with different portions of shared classes, representing a crucial step toward truly comprehensive UniDA.
Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: In Universal Domain Adaptation (UniDA), when there are arbitrary differences between the classes in the target domain and those in the source domain, how to achieve unsupervised domain adaptation. In particular, in extreme cases, that is, when there are a large number of source - private classes in the source domain and only a small number of shared classes in the target domain, the existing Partial Domain Matching (PDM) methods perform poorly, and are even worse than the simple baseline model trained only on the source data. ### Specific description of the problem 1. **Universal Domain Adaptation (UniDA)** - UniDA aims to deal with the situation where the class distributions between the source domain and the target domain are different, where the classes in the target domain can be completely different from those in the source domain, except for a part of shared classes. - In this setting, the goal is to classify the data in the target domain as one of the shared classes or an unseen class. 2. **Limitations of existing methods** - The existing PDM methods avoid negative transfer by only aligning the shared classes, but in extreme cases, that is, when there are many private classes in the source domain and only a small number of shared classes in the target domain, these methods perform poorly. - This phenomenon is called "Dimensional Collapse (DC)", that is, the target representation collapses into a low - dimensional subspace, resulting in the loss of the diversity of the representation space, which affects the accuracy of the important weight function and ultimately weakens the effect of PDM. ### Main contributions of the paper 1. **Identify and analyze the dimensional collapse problem in extreme UniDA** - Through detailed ablation studies and analysis, the paper reveals that the existence of a high proportion of source - private classes will lead to a decline in the quality of the target representation, which in turn affects the performance of PDM methods. 2. **Propose a new method to solve dimensional collapse** - The paper introduces modern Self - Supervised Learning (SSL) techniques, combined with alignment loss and uniformity loss, to maintain the internal structure of the target representation and prevent dimensional collapse. - The experimental results show that the SSL method significantly improves the performance of PDM in extreme UniDA scenarios and reaches a new state - of - the - art level in a broader benchmark test. ### Formula summary - **Alignment Loss** \[ L_{\text{Align}}(\theta_f)=\mathbb{E}_{x\sim p}\left\|\theta_f(T(x))-\theta_f(T'(x))\right\|_2^2 \] where \(T\) and \(T'\) are independent random augmentation functions. - **Uniformity Loss** \[ L_{\text{Uniform}}(\theta_f)=\log\mathbb{E}_{x,x'\sim p}\left[e^{-t\left\|\theta_f(x)-\theta_f(x')\right\|_2^2}\right] \] where \(t\) is a fixed hyperparameter. Through these methods, the paper not only solves the dimensional collapse problem in extreme UniDA, but also provides new directions and tools for future research.