Abstract:We propose a Bayesian neural network-based continual learning algorithm using Variational Inference, aiming to overcome several drawbacks of existing methods. Specifically, in continual learning scenarios, storing network parameters at each step to retain knowledge poses challenges. This is compounded by the crucial need to mitigate catastrophic forgetting, particularly given the limited access to past datasets, which complicates maintaining correspondence between network parameters and datasets across all sessions. Current methods using Variational Inference with KL divergence risk catastrophic forgetting during uncertain node updates and coupled disruptions in certain nodes. To address these challenges, we propose the following strategies. To reduce the storage of the dense layer parameters, we propose a parameter distribution learning method that significantly reduces the storage requirements. In the continual learning framework employing variational inference, our study introduces a regularization term that specifically targets the dynamics and population of the mean and variance of the parameters. This term aims to retain the benefits of KL divergence while addressing related challenges. To ensure proper correspondence between network parameters and the data, our method introduces an importance-weighted Evidence Lower Bound term to capture data and parameter correlations. This enables storage of common and distinctive parameter hyperspace bases. The proposed method partitions the parameter space into common and distinctive subspaces, with conditions for effective backward and forward knowledge transfer, elucidating the network-parameter dataset correspondence. The experimental results demonstrate the effectiveness of our method across diverse datasets and various combinations of sequential datasets, yielding superior performance compared to existing approaches.

On Sequential Bayesian Inference for Continual Learning

Progressive Learning without Forgetting

Learning to Continually Learn with the Bayesian Principle

Continual Learning via Sequential Function-Space Variational Inference

Adaptive Progressive Continual Learning.

Learning Bayesian Sparse Networks with Full Experience Replay for Continual Learning

Continual Learning: Tackling Catastrophic Forgetting in Deep Neural Networks with Replay Processes

Bio-inspired, task-free continual learning through activity regularization

Challenging Common Assumptions about Catastrophic Forgetting

Uncertainty Estimation With Neural Processes for Meta-Continual Learning

Bayesian Optimized Continual Learning with Attention Mechanism

Reinforced Continual Learning

Investigating Plausibility of Biologically Inspired Bayesian Learning in ANNs

Revised Regularization for Efficient Continual Learning through Correlation-Based Parameter Update in Bayesian Neural Networks

Lifelong Neural Predictive Coding: Learning Cumulatively Online without Forgetting

Variational Density Propagation Continual Learning

On the Convergence of Continual Learning with Adaptive Methods

A Neural Network Model of Continual Learning with Cognitive Control

Online Continual Learning with Declarative Memory

Towards Robust Continual Learning with Bayesian Adaptive Moment Regularization

Understanding Catastrophic Forgetting and Remembering in Continual Learning with Optimal Relevance Mapping