Domain Generalization by Learning and Removing Domain-specific Features

Yu Ding,Lei Wang,Bin Liang,Shuming Liang,Yang Wang,Fang Chen
DOI: https://doi.org/10.48550/arXiv.2212.07101
2022-12-14
Abstract:Deep Neural Networks (DNNs) suffer from domain shift when the test dataset follows a distribution different from the training dataset. Domain generalization aims to tackle this issue by learning a model that can generalize to unseen domains. In this paper, we propose a new approach that aims to explicitly remove domain-specific features for domain generalization. Following this approach, we propose a novel framework called Learning and Removing Domain-specific features for Generalization (LRDG) that learns a domain-invariant model by tactically removing domain-specific features from the input images. Specifically, we design a classifier to effectively learn the domain-specific features for each source domain, respectively. We then develop an encoder-decoder network to map each input image into a new image space where the learned domain-specific features are removed. With the images output by the encoder-decoder network, another classifier is designed to learn the domain-invariant features to conduct image classification. Extensive experiments demonstrate that our framework achieves superior performance compared with state-of-the-art methods.
Computer Vision and Pattern Recognition,Machine Learning
What problem does this paper attempt to address?
This paper aims to solve the domain shift problem encountered by deep neural networks (DNNs) when the distribution of the test data set is different from that of the training data set. Specifically, the goal of domain generalization is to perform well in unseen target domains. This paper proposes a new method to achieve domain generalization by explicitly removing domain - specific features. To this end, the author designs a new framework named "Learning and Removing Domain - specific features for Generalization" (LRDG), which achieves the goal through the following steps: 1. **Learning domain - specific features**: First, a domain - specific classifier is designed for each source domain respectively. These classifiers can effectively learn domain - specific features from their respective domains. Each domain - specific classifier is designed to be able to effectively classify categories only within a specific source domain, and cannot classify categories in any other source domain. 2. **Removing domain - specific features**: Next, an encoder - decoder network is used to map the input image to a new image space, in which the previously learned domain - specific features are removed. In this way, the encoder - decoder network actively removes the domain - specific features in the input image and retains the domain - invariant features. 3. **Learning domain - invariant features**: Finally, the image processed by the encoder - decoder network is input into a domain - invariant classifier, which is used to learn domain - invariant features and perform image classification. Through this series of steps, the LRDG framework can not only effectively reduce the distribution differences between source domains, but also enable the model to better adapt to unseen target domains, thereby improving the performance of domain generalization. The experimental results show that this framework has achieved better performance than existing methods on multiple benchmark data sets.