Rethinking Domain Generalization: Discriminability and Generalizability

Shaocong Long,Qianyu Zhou,Chenhao Ying,Lizhuang Ma,Yuan Luo
DOI: https://doi.org/10.1109/TCSVT.2024.3422887
2024-07-29
Abstract:Domain generalization(DG) endeavors to develop robust models that possess strong generalizability while preserving excellent discriminability. Nonetheless, pivotal DG techniques tend to improve the feature generalizability by learning domain-invariant representations, inadvertently overlooking the feature discriminability. On the one hand, the simultaneous attainment of generalizability and discriminability of features presents a complex challenge, often entailing inherent contradictions. This challenge becomes particularly pronounced when domain-invariant features manifest reduced discriminability owing to the inclusion of unstable factors, i.e., spurious correlations. On the other hand, prevailing domain-invariant methods can be categorized as category-level alignment, susceptible to discarding indispensable features possessing substantial generalizability and narrowing intra-class variations. To surmount these obstacles, we rethink DG from a new perspective that concurrently imbues features with formidable discriminability and robust generalizability, and present a novel framework, namely, Discriminative Microscopic Distribution Alignment~(DMDA). DMDA incorporates two core components: Selective Channel Pruning~(SCP) and Micro-level Distribution Alignment~(MDA). Concretely, SCP attempts to curtail redundancy within neural networks, prioritizing stable attributes conducive to accurate classification. This approach alleviates the adverse effect of spurious domain invariance and amplifies the feature discriminability. Besides, MDA accentuates micro-level alignment within each class, going beyond mere category-level alignment. Extensive experiments on four benchmark datasets corroborate that DMDA achieves comparable results to state-of-the-art methods in DG, underscoring the efficacy of our method.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper attempts to address the problem of how to simultaneously enhance the discriminability and generalizability of features in Domain Generalization (DG). Specifically: 1. **Limitations of existing methods**: Most current domain generalization techniques improve the generalizability of features by learning domain-invariant representations, but often neglect the discriminability of features, leading to a decline in model performance when handling data from new domains. 2. **Conflict between feature discriminability and generalizability**: Enhancing both the discriminability and generalizability of features is a complex and inherently contradictory challenge. Particularly, when domain-invariant features contain unstable factors (i.e., spurious correlations), the discriminability of features is affected. 3. **Limitations of class alignment**: Existing domain-invariant methods mainly focus on class-level alignment. This coarse-grained approach may discard some important features with significant generalization ability and ignore intra-class variations. To overcome these challenges, the authors propose a new framework—Discriminative Microscopic Distribution Alignment (DMDA), which includes two core components: Selective Channel Pruning (SCP) and Micro-level Distribution Alignment (MDA). These two components work together to enhance the discriminability and generalizability of features. Specifically: - **SCP**: Reduces redundancy by filtering out unstable channels in the features, prioritizing the retention of stable attributes that contribute to accurate classification, thereby improving feature discriminability. - **MDA**: Performs micro-level alignment within each class, going beyond traditional class-level alignment strategies, thus retaining sufficient generalizable features and promoting intra-class variation. Experimental results show that DMDA outperforms the current state-of-the-art methods on four benchmark datasets, validating the effectiveness and superiority of the proposed method.