Abstract:In the scenario of class-incremental learning (CIL), deep neural networks have to adapt their model parameters to non-stationary data distributions, e.g., the emergence of new classes over time. However, CIL models are challenged by the well-known catastrophic forgetting phenomenon. Typical methods such as rehearsal-based ones rely on storing exemplars of old classes to mitigate catastrophic forgetting, which limits real-world applications considering memory resources and privacy issues. In this paper, we propose a novel rehearsal-free CIL approach that learns continually via the synergy between two Complementary Learning Subnetworks. Our approach involves jointly optimizing a plastic CNN feature extractor and an analytical feed-forward classifier. The inaccessibility of historical data is tackled by holistically controlling the parameters of a well-trained model, ensuring that the decision boundary learned fits new classes while retaining recognition of previously learned classes. Specifically, the trainable CNN feature extractor provides task-dependent knowledge separately without interference; and the final classifier integrates task-specific knowledge incrementally for decision-making without forgetting. In each CIL session, it accommodates new tasks by attaching a tiny set of declarative parameters to its backbone, in which only one matrix per task or one vector per class is kept for knowledge retention. Extensive experiments on a variety of task sequences show that our method achieves competitive results against state-of-the-art methods, especially in accuracy gain, memory cost, training efficiency, and task-order robustness. Furthermore, to make the non-growing backbone (i.e., a model with limited network capacity) suffice to train on more incoming tasks, a graceful forgetting implementation on previously learned trivial tasks is empirically investigated.

Towards Continual Learning Desiderata via HSIC-Bottleneck Orthogonalization and Equiangular Embedding

Progressive Learning without Forgetting

Overcoming Catastrophic Forgetting in Continual Learning by Exploring Eigenvalues of Hessian Matrix.

Orthogonal Gradient Descent for Continual Learning

Continual Learning by Asymmetric Loss Approximation with Single-Side Overestimation

Defeating Catastrophic Forgetting via Enhanced Orthogonal Weights Modification

Continual Learning with Guarantees via Weight Interval Constraints

Mixture-of-Variational-Experts for Continual Learning

Does Continual Learning Equally Forget All Parameters?

Online Continual Learning with Declarative Memory

PNSP: Overcoming Catastrophic Forgetting Using Primary Null Space Projection in Continual Learning

Learn to Grow: A Continual Structure Learning Framework for Overcoming Catastrophic Forgetting

Adaptive online continual multi-view learning

Sparsity and Heterogeneous Dropout for Continual Learning in the Null Space of Neural Activations

Task-aware Orthogonal Sparse Network for Exploring Shared Knowledge in Continual Learning

Layerwise Optimization by Gradient Decomposition for Continual Learning

DualHSIC: HSIC-Bottleneck and Alignment for Continual Learning

Imbalance Mitigation for Continual Learning via Knowledge Decoupling and Dual Enhanced Contrastive Learning

Training Networks in Null Space of Feature Covariance for Continual Learning

Complementary Learning Subnetworks for Parameter-Efficient Class-Incremental Learning

Efficient Non-Exemplar Class-Incremental Learning with Retrospective Feature Synthesis