Abstract:Continual learning aims to rapidly and continually learn the current task from a sequence of tasks, using the knowledge obtained in the past, while performing well on prior tasks. A key challenge in this setting is the stability–plasticity dilemma existing in current and previous tasks, i.e., a high-stability network is weak to learn new knowledge in an effort to maintain previous knowledge. Correspondingly, a high-plasticity network can easily forget old tasks while dealing with well on the new task. Compared to other kinds of methods, the methods based on experience replay have shown great advantages to overcome catastrophic forgetting. One common limitation of this method is the data imbalance between the previous and current tasks, which would further aggravate forgetting. Moreover, how to effectively address the stability–plasticity dilemma in this setting is also an urgent problem to be solved. In this paper, we overcome these challenges by proposing a novel framework called Meta-learning update via Multi-scale Knowledge Distillation and Data Augmentation (MMKDDA). Specifically, we apply multi-scale knowledge distillation to grasp the evolution of long-range and short-range spatial relationships at different feature levels to alleviate the problem of data imbalance. Besides, our method mixes the samples from the episodic memory and current task in the online continual training procedure, thus alleviating the side influence due to the change of probability distribution. Moreover, we optimize our model via the meta-learning update by resorting to the number of tasks seen previously, which is helpful to keep a better balance between stability and plasticity. Finally, our extensive experiments on four benchmark datasets show the effectiveness of the proposed MMKDDA framework against other popular baselines, and ablation studies are also conducted to further analyze the role of each component in our framework.

Continual Learning in Predictive Autoscaling

An Adaptive Continual Learning Method for Nonstationary Industrial Time Series Prediction

M EMORY R EPLAY WITH D ATA C OMPRESSION FOR C ONTINUAL L EARNING

A Meta Reinforcement Learning Approach for Predictive Autoscaling in the Cloud

A Complementary Continual Learning Framework Using Incremental Samples for Remaining Useful Life Prediction of Machinery

ADER: Adaptively Distilled Exemplar Replay Towards Continual Learning for Session-based Recommendation

A Benchmark and Empirical Analysis for Replay Strategies in Continual Learning

Squeezing More Past Knowledge for Online Class-Incremental Continual Learning

Adaptive Memory Replay for Continual Learning

A Unified Replay-based Continuous Learning Framework for Spatio-Temporal Prediction on Streaming Data

A Predictive Autoscaler for Elastic Batch Jobs

A Continual Learning Method for Reducing Class Interference Based on Replay

Improving Replay Sample Selection and Storage for Less Forgetting in Continual Learning

Continual Learning with Diffusion-based Generative Replay for Industrial Streaming Data

Pseudo Replay-based Class Continual Learning for Online New Category Anomaly Detection in Advanced Manufacturing

Continual Recognition with Adaptive Memory Update.

Deep Learning-Based Autoscaling Using Bidirectional Long Short-Term Memory for Kubernetes

Online Continual Learning Via the Meta-learning Update with Multi-scale Knowledge Distillation and Data Augmentation

Continual Learning via Manifold Expansion Replay

Replay-enhanced Continual Reinforcement Learning

Distribution-Level Memory Recall for Continual Learning: Preserving Knowledge and Avoiding Confusion