Unraveling the Key Components of OOD Generalization via Diversification

Harold Benoit,Liangze Jiang,Andrei Atanov,Oğuzhan Fatih Kar,Mattia Rigotti,Amir Zamir
2024-04-20
Abstract:Supervised learning datasets may contain multiple cues that explain the training set equally well, i.e., learning any of them would lead to the correct predictions on the training data. However, many of them can be spurious, i.e., lose their predictive power under a distribution shift and consequently fail to generalize to out-of-distribution (OOD) data. Recently developed "diversification" methods (Lee et al., 2023; Pagliardini et al., 2023) approach this problem by finding multiple diverse hypotheses that rely on different features. This paper aims to study this class of methods and identify the key components contributing to their OOD generalization abilities.
Machine Learning
What problem does this paper attempt to address?