Catastrophic Forgetting in Deep Learning: A Comprehensive Taxonomy

Everton L. Aleixo,Juan G. Colonna,Marco Cristo,Everlandio Fernandes
2023-12-17
Abstract:Deep Learning models have achieved remarkable performance in tasks such as image classification or generation, often surpassing human accuracy. However, they can struggle to learn new tasks and update their knowledge without access to previous data, leading to a significant loss of accuracy known as Catastrophic Forgetting (CF). This phenomenon was first observed by McCloskey and Cohen in 1989 and remains an active research topic. Incremental learning without forgetting is widely recognized as a crucial aspect in building better AI systems, as it allows models to adapt to new tasks without losing the ability to perform previously learned ones. This article surveys recent studies that tackle CF in modern Deep Learning models that use gradient descent as their learning algorithm. Although several solutions have been proposed, a definitive solution or consensus on assessing CF is yet to be established. The article provides a comprehensive review of recent solutions, proposes a taxonomy to organize them, and identifies research gaps in this area.
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?
The paper mainly focuses on the issue of Catastrophic Forgetting (CF) in deep learning models and aims to provide a comprehensive classification system to organize recent research findings on this problem. The paper first introduces the phenomenon of catastrophic forgetting, which refers to the tendency of deep learning models to forget previously learned knowledge when learning new tasks, leading to a significant decline in performance on old tasks. This phenomenon was first observed by McCloskey and Cohen in 1989 and remains an active research topic in the field of artificial intelligence. Subsequently, the paper reviews various methods to address catastrophic forgetting and proposes a classification system to organize these methods. The classification system includes four main categories: 1. **Rehearsal**: Using data or data representations from previous tasks to prevent forgetting. 2. **Distance-Based Methods**: Judging tasks or categories by estimating the similarity between samples. 3. **Sub-Networks**: Dividing the model into different sub-models to avoid overlapping knowledge between tasks. 4. **Dynamic Networks**: Allowing the model structure to expand with the addition of new tasks to accommodate more pattern recognition capabilities. The paper also discusses the advantages and limitations of various methods and points out some gaps in current research. The authors emphasize that although many solutions have been proposed, there is still no consensus on the best practices or evaluation standards. Additionally, the paper suggests some future research directions, including exploring the combined use of different methods. In summary, this paper aims to provide a comprehensive review and analytical framework for the problem of catastrophic forgetting to promote further research and development in this field.