Abstract:In the scenario of class-incremental learning (CIL), deep neural networks have to adapt their model parameters to non-stationary data distributions, e.g., the emergence of new classes over time. However, CIL models are challenged by the well-known catastrophic forgetting phenomenon. Typical methods such as rehearsal-based ones rely on storing exemplars of old classes to mitigate catastrophic forgetting, which limits real-world applications considering memory resources and privacy issues. In this paper, we propose a novel rehearsal-free CIL approach that learns continually via the synergy between two Complementary Learning Subnetworks. Our approach involves jointly optimizing a plastic CNN feature extractor and an analytical feed-forward classifier. The inaccessibility of historical data is tackled by holistically controlling the parameters of a well-trained model, ensuring that the decision boundary learned fits new classes while retaining recognition of previously learned classes. Specifically, the trainable CNN feature extractor provides task-dependent knowledge separately without interference; and the final classifier integrates task-specific knowledge incrementally for decision-making without forgetting. In each CIL session, it accommodates new tasks by attaching a tiny set of declarative parameters to its backbone, in which only one matrix per task or one vector per class is kept for knowledge retention. Extensive experiments on a variety of task sequences show that our method achieves competitive results against state-of-the-art methods, especially in accuracy gain, memory cost, training efficiency, and task-order robustness. Furthermore, to make the non-growing backbone (i.e., a model with limited network capacity) suffice to train on more incoming tasks, a graceful forgetting implementation on previously learned trivial tasks is empirically investigated.

Beyond Not-Forgetting: Continual Learning with Backward Knowledge Transfer

Progressive Learning without Forgetting

Learning After Learning: Positive Backward Transfer in Continual Learning

Achieving Forgetting Prevention and Knowledge Transfer in Continual Learning

Does Continual Learning Equally Forget All Parameters?

Slowing Down Forgetting in Continual Learning

Squeezing More Past Knowledge for Online Class-Incremental Continual Learning

AFEC: Active Forgetting of Negative Transfer in Continual Learning

Forward-Backward Knowledge Distillation for Continual Clustering

CORE: Mitigating Catastrophic Forgetting in Continual Learning through Cognitive Replay

Complementary Learning Subnetworks for Parameter-Efficient Class-Incremental Learning

BNS: Building Network Structures Dynamically for Continual Learning

Bio-inspired, task-free continual learning through activity regularization

Online Continual Learning with Declarative Memory

Mitigating Interference in the Knowledge Continuum through Attention-Guided Incremental Learning

Continual Learning in the Teacher-Student Setup: Impact of Task Similarity

Continual Learning in the Presence of Spurious Correlation

Mitigating Catastrophic Forgetting in Task-Incremental Continual Learning with Adaptive Classification Criterion

Learning Bayesian Sparse Networks with Full Experience Replay for Continual Learning

A Unified and General Framework for Continual Learning

CODE-CL: COnceptor-Based Gradient Projection for DEep Continual Learning