Out-of-distribution forgetting: vulnerability of continual learning to intra-class distribution shift

Liangxuan Guo,Yang Chen,Shan Yu
2024-07-04
Abstract:Continual learning (CL) is an important technique to allow artificial neural networks to work in open environments. CL enables a system to learn new tasks without severe interference to its performance on old tasks, i.e., overcome the problems of catastrophic forgetting. In joint learning, it is well known that the out-of-distribution (OOD) problem caused by intentional attacks or environmental perturbations will severely impair the ability of networks to generalize. In this work, we reported a special form of catastrophic forgetting raised by the OOD problem in continual learning settings, and we named it out-of-distribution forgetting (OODF). In continual image classification tasks, we found that for a given category, introducing an intra-class distribution shift significantly impaired the recognition accuracy of CL methods for that category during subsequent learning. Interestingly, this phenomenon is special for CL as the same level of distribution shift had only negligible effects in the joint learning scenario. We verified that CL methods without dedicating subnetworks for individual tasks are all vulnerable to OODF. Moreover, OODF does not depend on any specific way of shifting the distribution, suggesting it is a risk for CL in a wide range of circumstances. Taken together, our work identified an under-attended risk during CL, highlighting the importance of developing approaches that can overcome OODF. Code available: \url{<a class="link-external link-https" href="https://github.com/Hiroid/OODF" rel="external noopener nofollow">this https URL</a>}
Machine Learning,Artificial Intelligence,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is a new form of catastrophic forgetting - Out - of - Distribution Forgetting (OODF) caused by intra - class distribution shift during the Continual Learning (CL) process. Specifically, the paper focuses on the fact that in CL settings, when learning new tasks, even introducing a slight distribution change within a specific class can significantly reduce the recognition accuracy of many CL methods for previous tasks. This phenomenon is called OODF, and the paper points out that compared with the joint learning scenario, the same degree of distribution change will lead to a more severe performance degradation in the CL scenario. Moreover, most CL strategies (except for the parameter isolation strategy) are vulnerable to OODF. Therefore, the paper aims to reveal this overlooked risk in CL research and emphasize the importance of developing methods that can overcome OODF.