Catastrophic Forgetting in Deep Learning: A Comprehensive Taxonomy

Everton L. Aleixo,Juan G. Colonna,Marco Cristo,Everlandio Fernandes

2023-12-17

Abstract:Deep Learning models have achieved remarkable performance in tasks such as image classification or generation, often surpassing human accuracy. However, they can struggle to learn new tasks and update their knowledge without access to previous data, leading to a significant loss of accuracy known as Catastrophic Forgetting (CF). This phenomenon was first observed by McCloskey and Cohen in 1989 and remains an active research topic. Incremental learning without forgetting is widely recognized as a crucial aspect in building better AI systems, as it allows models to adapt to new tasks without losing the ability to perform previously learned ones. This article surveys recent studies that tackle CF in modern Deep Learning models that use gradient descent as their learning algorithm. Although several solutions have been proposed, a definitive solution or consensus on assessing CF is yet to be established. The article provides a comprehensive review of recent solutions, proposes a taxonomy to organize them, and identifies research gaps in this area.

Machine Learning,Artificial Intelligence

What problem does this paper attempt to address?

The paper mainly focuses on the issue of Catastrophic Forgetting (CF) in deep learning models and aims to provide a comprehensive classification system to organize recent research findings on this problem. The paper first introduces the phenomenon of catastrophic forgetting, which refers to the tendency of deep learning models to forget previously learned knowledge when learning new tasks, leading to a significant decline in performance on old tasks. This phenomenon was first observed by McCloskey and Cohen in 1989 and remains an active research topic in the field of artificial intelligence. Subsequently, the paper reviews various methods to address catastrophic forgetting and proposes a classification system to organize these methods. The classification system includes four main categories: 1. **Rehearsal**: Using data or data representations from previous tasks to prevent forgetting. 2. **Distance-Based Methods**: Judging tasks or categories by estimating the similarity between samples. 3. **Sub-Networks**: Dividing the model into different sub-models to avoid overlapping knowledge between tasks. 4. **Dynamic Networks**: Allowing the model structure to expand with the addition of new tasks to accommodate more pattern recognition capabilities. The paper also discusses the advantages and limitations of various methods and points out some gaps in current research. The authors emphasize that although many solutions have been proposed, there is still no consensus on the best practices or evaluation standards. Additionally, the paper suggests some future research directions, including exploring the combined use of different methods. In summary, this paper aims to provide a comprehensive review and analytical framework for the problem of catastrophic forgetting to promote further research and development in this field.

Catastrophic Forgetting in Deep Learning: A Comprehensive Taxonomy

Progressive Learning without Forgetting

Measuring Catastrophic Forgetting in Neural Networks

Overcoming Long-Term Catastrophic Forgetting Through Adversarial Neural Pruning and Synaptic Consolidation

Catastrophic Forgetting in the Context of Model Updates

Anatomy of Catastrophic Forgetting: Hidden Representations and Task Semantics

Learn to Grow: A Continual Structure Learning Framework for Overcoming Catastrophic Forgetting

Continual Learning and Catastrophic Forgetting

A Methodology-Oriented Study of Catastrophic Forgetting in Incremental Deep Neural Networks

Challenging Common Assumptions about Catastrophic Forgetting

Catastrophic forgetting: still a problem for DNNs

Explaining How Deep Neural Networks Forget by Deep Visualization

Overcoming Catastrophic Forgetting by XAI

Catastrophic Importance of Catastrophic Forgetting

An Empirical Investigation of Catastrophic Forgetting in Gradient-Based Neural Networks

Weight Friction: A Simple Method to Overcome Catastrophic Forgetting and Enable Continual Learning

Dissecting Catastrophic Forgetting in Continual Learning by Deep Visualization

Slowing Down Forgetting in Continual Learning

An Empirical Study of Catastrophic Forgetting in Large Language Models During Continual Fine-tuning

EXACFS -- A CIL Method to mitigate Catastrophic Forgetting

Revisiting Catastrophic Forgetting in Large Language Model Tuning