Abstract:Artificial intelligence and neuroscience have a long and intertwined history. Advancements in neuroscience research have significantly influenced the development of artificial intelligence systems that have the potential to retain knowledge akin to humans. Building upon foundational insights from neuroscience and existing research in adversarial and continual learning fields, we introduce a novel framework that comprises two key concepts: feature distillation and re-consolidation. The framework distills continual learning (CL) robust features and rehearses them while learning the next task, aiming to replicate the mammalian brain's process of consolidating memories through rehearsing the distilled version of the waking experiences. Furthermore, the proposed framework emulates the mammalian brain's mechanism of memory re-consolidation, where novel experiences influence the assimilation of previous experiences via feature re-consolidation. This process incorporates the new understanding of the CL model after learning the current task into the CL-robust samples of the previous task(s) to mitigate catastrophic forgetting. The proposed framework, called Robust Rehearsal, circumvents the limitations of existing CL frameworks that rely on the availability of pre-trained Oracle CL models to pre-distill CL-robustified datasets for training subsequent CL models. We conducted extensive experiments on three datasets, CIFAR10, CIFAR100, and real-world helicopter attitude datasets, demonstrating that CL models trained using Robust Rehearsal outperform their counterparts' baseline methods. In addition, we conducted a series of experiments to assess the impact of changing memory sizes and the number of tasks, demonstrating that the baseline methods employing robust rehearsal outperform other methods trained without robust rehearsal. Lastly, to shed light on the existence of diverse features, we explore the effects of various optimization training objectives within the realms of joint, continual, and adversarial learning on feature learning in deep neural networks. Our findings indicate that the optimization objective dictates feature learning, which plays a vital role in model performance. Such observation further emphasizes the importance of rehearsing the CL-robust samples in alleviating catastrophic forgetting. In light of our experiments, closely following neuroscience insights can contribute to developing CL approaches to mitigate the long-standing challenge of catastrophic forgetting.

CL-BPUWM: continuous learning with Bayesian parameter updating and weight memory

Progressive Learning without Forgetting

UniGrad-FS: Unified Gradient Projection with Flatter Sharpness for Continual Learning

Overcoming Long-Term Catastrophic Forgetting Through Adversarial Neural Pruning and Synaptic Consolidation

Defeating Catastrophic Forgetting via Enhanced Orthogonal Weights Modification

Bayesian Optimized Continual Learning with Attention Mechanism

Revised Regularization for Efficient Continual Learning through Correlation-Based Parameter Update in Bayesian Neural Networks

Learning to Modulate Random Weights: Neuromodulation-inspired Neural Networks For Efficient Continual Learning

CLR: Channel-wise Lightweight Reprogramming for Continual Learning

Unlocking Continual Learning Abilities in Language Models

Defying Catastrophic Forgetting via Influence Function

Mamba-CL: Optimizing Selective State Space Model in Null Space for Continual Learning

Bio-inspired, task-free continual learning through activity regularization

Overcoming Catastrophic Forgetting for Continual Learning Via Model Adaptation

Learning After Learning: Positive Backward Transfer in Continual Learning

Dual-CBA: Improving Online Continual Learning via Dual Continual Bias Adaptors from a Bi-level Optimization Perspective

Towards Continual Learning Desiderata via HSIC-Bottleneck Orthogonalization and Equiangular Embedding

RanPAC: Random Projections and Pre-trained Models for Continual Learning

Brain-Inspired Continual Learning: Robust Feature Distillation and Re-Consolidation for Class Incremental Learning

AdaCL:Adaptive Continual Learning

Class Incremental Learning via Semantic Information Mapping and Background Information Calibrating