Sequential Invariant Information Bottleneck

Yichen Zhang,Shujian Yu,Badong Chen
DOI: https://doi.org/10.1109/icassp49357.2023.10094563
2023-01-01
Abstract:Previous approaches to the problem of generalization for out-of-distribution (OOD) data usually assume that data from each environment is available simultaneously, which is unrealistic in real-world applications. In this paper, we develop a new framework termed the sequential invariant information bottleneck (seq-IIB) to improve the generalization ability of learning agents in sequential environments. Our main idea is to combine the merits of the famed Information Bottleneck (IB) principle with the Invariant Risk Minimization (IRM), such that the learning agent can gradually remove spurious features and remain invariant and compact task-relevant information in a sequential manner. Experimental results on three MNIST-like datasets show the effectiveness of our method.
What problem does this paper attempt to address?