Out-of-distribution forgetting: vulnerability of continual learning to intra-class distribution shift

Liangxuan Guo,Yang Chen,Shan Yu

2024-07-04

Abstract:Continual learning (CL) is an important technique to allow artificial neural networks to work in open environments. CL enables a system to learn new tasks without severe interference to its performance on old tasks, i.e., overcome the problems of catastrophic forgetting. In joint learning, it is well known that the out-of-distribution (OOD) problem caused by intentional attacks or environmental perturbations will severely impair the ability of networks to generalize. In this work, we reported a special form of catastrophic forgetting raised by the OOD problem in continual learning settings, and we named it out-of-distribution forgetting (OODF). In continual image classification tasks, we found that for a given category, introducing an intra-class distribution shift significantly impaired the recognition accuracy of CL methods for that category during subsequent learning. Interestingly, this phenomenon is special for CL as the same level of distribution shift had only negligible effects in the joint learning scenario. We verified that CL methods without dedicating subnetworks for individual tasks are all vulnerable to OODF. Moreover, OODF does not depend on any specific way of shifting the distribution, suggesting it is a risk for CL in a wide range of circumstances. Taken together, our work identified an under-attended risk during CL, highlighting the importance of developing approaches that can overcome OODF. Code available: \url{<a class="link-external link-https" href="https://github.com/Hiroid/OODF" rel="external noopener nofollow">this https URL</a>}

Machine Learning,Artificial Intelligence,Computer Vision and Pattern Recognition

What problem does this paper attempt to address?

The problem that this paper attempts to solve is a new form of catastrophic forgetting - Out - of - Distribution Forgetting (OODF) caused by intra - class distribution shift during the Continual Learning (CL) process. Specifically, the paper focuses on the fact that in CL settings, when learning new tasks, even introducing a slight distribution change within a specific class can significantly reduce the recognition accuracy of many CL methods for previous tasks. This phenomenon is called OODF, and the paper points out that compared with the joint learning scenario, the same degree of distribution change will lead to a more severe performance degradation in the CL scenario. Moreover, most CL strategies (except for the parameter isolation strategy) are vulnerable to OODF. Therefore, the paper aims to reveal this overlooked risk in CL research and emphasize the importance of developing methods that can overcome OODF.

Out-of-distribution forgetting: vulnerability of continual learning to intra-class distribution shift

Progressive Learning without Forgetting

Adaptive online continual multi-view learning

Class Relevance Learning For Out-of-distribution Detection

Out-of-Distribution Learning with Human Feedback

Learning by Erasing: Conditional Entropy based Transferable Out-Of-Distribution Detection

Forgetting, Ignorance or Myopia: Revisiting Key Challenges in Online Continual Learning

TARGET: Federated Class-Continual Learning Via Exemplar-Free Distillation

On the Learnability of Out-of-distribution Detection

The Best of Both Worlds: On the Dilemma of Out-of-distribution Detection

Distribution-Level Memory Recall for Continual Learning: Preserving Knowledge and Avoiding Confusion

Defeating Catastrophic Forgetting via Enhanced Orthogonal Weights Modification

Maintaining Discrimination and Fairness in Class Incremental Learning

Out-of-Distribution Detection in Long-Tailed Recognition with Calibrated Outlier Class Learning

Rethinking Out-of-Distribution Detection on Imbalanced Data Distribution

Does Continual Learning Equally Forget All Parameters?

Distributionally Robust Memory Evolution With Generalized Divergence for Continual Learning

Certifiable Out-of-Distribution Generalization.

A noisy elephant in the room: Is your out-of-distribution detector robust to label noise?

Online Distillation with Continual Learning for Cyclic Domain Shifts

Adaptive Feature Generation for Online Continual Learning from Imbalanced Data