Abstract:Open-set domain generalization addresses a real-world challenge: training a model to generalize across unseen domains (domain generalization) while also detecting samples from unknown classes not encountered during training (open-set recognition). However, most existing approaches tackle these issues separately, limiting their practical applicability. To overcome this limitation, we propose a unified framework for open-set domain generalization by introducing Feature-space Semantic Invariance (FSI). FSI maintains semantic consistency across different domains within the feature space, enabling more accurate detection of OOD instances in unseen domains. Additionally, we adopt a generative model to produce synthetic data with novel domain styles or class labels, enhancing model robustness. Initial experiments show that our method improves AUROC by 9.1% to 18.9% on ColoredMNIST, while also significantly increasing in-distribution classification accuracy.
What problem does this paper attempt to address?
The problems that this paper attempts to solve are two core challenges in open - set domain generalization (OODG): domain generalization (DG) and open - set recognition (OSR). Specifically, the research aims to train a model so that it can classify in unseen domains and detect samples of unknown classes that have not been encountered during the training process. These two tasks are of great significance in the real world, but most of the existing methods usually treat them as independent problems, thus limiting the effectiveness of their practical applications.
To overcome these limitations, the authors propose a new framework to solve these two problems in a unified way by introducing Feature - space Semantic Invariance (FSI). FSI ensures semantic consistency in the feature spaces of different domains, thereby improving the detection accuracy of OOD instances and enhancing the generalization ability of the model in unseen domains. In addition, the authors also use a generative model to generate synthetic data with novel domain styles or class labels to further improve the robustness of the model.
### Main contributions:
1. **Introducing Feature - space Semantic Invariance (FSI)**:
- FSI enables the model to learn high - quality domain - invariant features by aligning semantic features in the feature space, thus enhancing its generalization ability in unseen domains.
- Formally, FSI is defined as:
\[
g(x_e, \theta_g) = g(x_{e'}, \theta_g) \quad \text{almost everywhere}
\]
where \( x_{e'} = G(x_e, e') \), \( x_e \sim P(X_e) \), \( x_{e'} \sim P(X_{e'}) \), for all \( e, e' \in E \).
2. **Generating synthetic OOD data**:
- By generating synthetic OOD data from ID samples, a clearer decision boundary between ID and OOD instances is established, which significantly enhances the model's robustness to OOD.
3. **Experimental results**:
- Preliminary experiments show that this method improves the AUROC on the ColoredMNIST dataset by 9.1% to 18.9% and also significantly improves the accuracy of ID classification.
### Method overview:
- **Problem setting**:
- Given a dataset \( D \), consider a set of domains \( E=\{e\}_{e = 1}^E \), and each domain corresponds to a data subset \( D_e=\{(x_i^e, y_i^e)\}_{i = 1}^{|D_e|} \) with specific data variations.
- Divide \( D \) into training domains \( E_{tr}\subset E \) and test domains \( E_{te}=E\setminus E_{tr} \).
- **Network design**:
- The network is defined as \( f := g\circ h \), where \( g:X\times\Theta\rightarrow\mathbb{R}^r \) is a feature extractor and \( h:\mathbb{R}^r\times\Theta\rightarrow\mathbb{R}^K \) is a classifier.
- Force feature - space semantic invariance, that is, the conditional distribution of instances in different domains of the feature extractor \( g \) should remain stable.
- **Regularization term**:
- **Feature regularization \( R_F \)**:
\[
R_F=\mathbb{E}_{(x_e, y)\sim D_{tr}} \left[ d(g(x_e, \theta_g), g(G(x_e, v_{e'}), \theta_g)) \right]
\]