On $f$-Divergence Principled Domain Adaptation: An Improved Framework

Ziqiao Wang,Yongyi Mao
2024-10-27
Abstract:Unsupervised domain adaptation (UDA) plays a crucial role in addressing distribution shifts in machine learning. In this work, we improve the theoretical foundations of UDA proposed in Acuna et al. (2021) by refining their $f$-divergence-based discrepancy and additionally introducing a new measure, $f$-domain discrepancy ($f$-DD). By removing the absolute value function and incorporating a scaling parameter, $f$-DD obtains novel target error and sample complexity bounds, allowing us to recover previous KL-based results and bridging the gap between algorithms and theory presented in Acuna et al. (2021). Using a localization technique, we also develop a fast-rate generalization bound. Empirical results demonstrate the superior performance of $f$-DD-based learning algorithms over previous works in popular UDA benchmarks.
Machine Learning,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the distribution shift problem in Unsupervised Domain Adaptation (UDA). Specifically, the paper aims to improve the existing UDA theoretical framework, especially the domain adaptation methods based on f - divergence. The main objectives include: 1. **Improve the theoretical basis**: By improving the f - divergence - based discrepancy measure proposed by Acuna et al. (2021), the paper introduces a new measure - f - domain discrepancy (f - DD). This new measure obtains new target error and sample complexity bounds by removing the absolute value function and introducing a scaling parameter. 2. **Restore previous results**: By using the localization technique, the paper develops fast - rate generalization bounds and is able to restore previous results based on KL - divergence, thus bridging the gap between theory and algorithms. 3. **Improve algorithm performance**: The paper experimentally verifies that the f - DD - based learning algorithm outperforms previous work in popular UDA benchmark tests. ### Specific problems and solutions 1. **Limitations of f - divergence**: - **Weakness of variational representation**: The existing variational representation of f - divergence has weaknesses and cannot restore the KL - divergence representation of Donsker and Varadhan. - **Impact of the absolute value function**: Existing methods rely on the absolute value function when deriving the upper bound of the target error, which may lead to over - estimation. - **Inconsistency between theory and algorithms**: There is a gap between existing methods in theory and practical algorithms. 2. **Improvement measures**: - **New variational representation of f - divergence**: The paper adopts a more advanced variational representation of f - divergence, removing the absolute value function and introducing a scaling parameter. - **f - domain discrepancy (f - DD)**: A new measure f - DD is defined. By removing the absolute value function and introducing a scaling parameter, new upper bounds of the target error and sample complexity bounds are obtained. - **Localization technique**: The localization technique is used to further optimize the convergence rate of f - DD, obtaining fast - rate target error bounds. 3. **Experimental verification**: - The paper conducts experiments on two benchmark datasets, Office - 31 and Office - Home. The results show that the f - DD - based algorithm outperforms existing UDA methods on multiple tasks, especially achieving the best performance when using Jeffreys - DD. ### Main contributions 1. **Theoretical contributions**: - A new variational representation of f - divergence is proposed, improving the existing f - divergence measure. - f - domain discrepancy (f - DD) is introduced, and its effectiveness and superiority are proven through theoretical analysis. 2. **Algorithm contributions**: - A domain adaptation algorithm based on f - DD is designed, and its superior performance on multiple UDA benchmarks is experimentally verified. 3. **Experimental contributions**: - Extensive experiments are carried out on the Office - 31 and Office - Home datasets, showing the performance advantages of the f - DD - based algorithm on different tasks. Through these improvements and innovations, the paper provides a more solid theoretical basis and a more effective algorithm framework for unsupervised domain adaptation.