Abstract:Neuromorphic computing with spiking neural networks is promising for energy-efficient artificial intelligence (AI) applications. However, different from humans who continually learn different tasks in a lifetime, neural network models suffer from catastrophic forgetting. How could neuronal operations solve this problem is an important question for AI and neuroscience. Many previous studies draw inspiration from observed neuroscience phenomena and propose episodic replay or synaptic metaplasticity, but they are not guaranteed to explicitly preserve knowledge for neuron populations. Other works focus on machine learning methods with more mathematical grounding, e.g., orthogonal projection on high dimensional spaces, but there is no neural correspondence for neuromorphic computing. In this work, we develop a new method with neuronal operations based on lateral connections and Hebbian learning, which can protect knowledge by projecting activity traces of neurons into an orthogonal subspace so that synaptic weight update will not interfere with old tasks. We show that Hebbian and anti-Hebbian learning on recurrent lateral connections can effectively extract the principal subspace of neural activities and enable orthogonal projection. This provides new insights into how neural circuits and Hebbian learning can help continual learning, and also how the concept of orthogonal projection can be realized in neuronal systems. Our method is also flexible to utilize arbitrary training methods based on presynaptic activities/traces. Experiments show that our method consistently solves forgetting for spiking neural networks with nearly zero forgetting under various supervised training methods with different error propagation approaches, and outperforms previous approaches under various settings. Our method can pave a solid path for building continual neuromorphic computing systems.

Gradient Projection Memory for Continual Learning

UniGrad-FS: Unified Gradient Projection with Flatter Sharpness for Continual Learning

Progressive Learning without Forgetting

Orthogonal Gradient Descent for Continual Learning

Introducing Common Null Space of Gradients for Gradient Projection Methods in Continual Learning

Restricted Orthogonal Gradient Projection for Continual Learning

Iterative Relaxing Gradient Projection for Continual Learning

Class Gradient Projection for Continual Learning

CODE-CL: COnceptor-Based Gradient Projection for DEep Continual Learning

Rethinking Gradient Projection Continual Learning: Stability / Plasticity Feature Space Decoupling

Sparsity and Heterogeneous Dropout for Continual Learning in the Null Space of Neural Activations

Prompt Gradient Projection for Continual Learning.

Deep Generative Dual Memory Network for Continual Learning

Triple-Memory Networks: A Brain-Inspired Method for Continual Learning

Gradient Correlation Subspace Learning against Catastrophic Forgetting

TRGP: Trust Region Gradient Projection for Continual Learning

Hebbian Learning based Orthogonal Projection for Continual Learning of Spiking Neural Networks

Improving Performance in Continual Learning Tasks using Bio-Inspired Architectures

Gradient Episodic Memory for Continual Learning

Sparsified Subgraph Memory for Continual Graph Representation Learning

Fast & Slow Learning: Incorporating Synthetic Gradients in Neural Memory Controllers