A Simple Lifelong Learning Approach

Joshua T. Vogelstein,Jayanta Dey,Hayden S. Helm,Will LeVine,Ronak D. Mehta,Tyler M. Tomita,Haoyin Xu,Ali Geisa,Qingyang Wang,Gido M. van de Ven,Chenyu Gao,Weiwei Yang,Bryan Tower,Jonathan Larson,Christopher M. White,Carey E. Priebe

2024-06-12

Abstract:In lifelong learning, data are used to improve performance not only on the present task, but also on past and future (unencountered) tasks. While typical transfer learning algorithms can improve performance on future tasks, their performance on prior tasks degrades upon learning new tasks (called forgetting). Many recent approaches for continual or lifelong learning have attempted to maintain performance on old tasks given new tasks. But striving to avoid forgetting sets the goal unnecessarily low. The goal of lifelong learning should be to use data to improve performance on both future tasks (forward transfer) and past tasks (backward transfer). In this paper, we show that a simple approach -- representation ensembling -- demonstrates both forward and backward transfer in a variety of simulated and benchmark data scenarios, including tabular, vision (CIFAR-100, 5-dataset, Split Mini-Imagenet, and Food1k), and speech (spoken digit), in contrast to various reference algorithms, which typically failed to transfer either forward or backward, or both. Moreover, our proposed approach can flexibly operate with or without a computational budget.

Artificial Intelligence,Machine Learning

What problem does this paper attempt to address?

### The Problem the Paper Aims to Solve This paper aims to address a core issue in lifelong learning, which is how to continuously learn new tasks while not only maintaining performance on past tasks but also improving performance on future tasks. Specifically: 1. **Avoiding Forgetting**: Traditional machine learning systems significantly degrade in performance on old tasks when training on new tasks, a phenomenon known as "catastrophic forgetting." Many existing continual learning methods attempt to maintain performance on old tasks while learning new ones, but they often only achieve the goal of avoiding forgetting without further improving performance. 2. **Forward and Backward Transfer**: The paper proposes a simple method—representation ensembling—that not only maintains good performance on old tasks while learning new ones (backward transfer) but also improves performance on future tasks while learning new ones (forward transfer). In contrast, other reference algorithms usually achieve only one type of transfer or fail to achieve both. 3. **Effectiveness in Low-Sample Environments**: Effective knowledge transfer is particularly important in scenarios with small sample sizes. Many existing lifelong learning algorithms perform poorly in such conditions, which is a key barrier to the capabilities of artificial intelligence. In summary, the main goal of the paper is to propose a general lifelong learning method that can achieve both forward and backward transfer across various data scenarios and remain effective in low-sample environments. This goal is achieved through representation ensembling, and the method has been validated on multiple benchmark datasets.

A Simple Lifelong Learning Approach

A Conceptual Framework for Lifelong Learning

A Deep Learning Framework for Lifelong Machine Learning

Lifelong Learning Via Progressive Distillation And Retrospection

Anti-retroactive Interference for Lifelong Learning

Streaming LifeLong Learning With Any-Time Inference

Learning with Recoverable Forgetting

Lifelong learning with selective attention over seen classes and memorized instances

Learning After Learning: Positive Backward Transfer in Continual Learning

Robustness-preserving Lifelong Learning via Dataset Condensation

Continual Learning via Online Leverage Score Sampling

Asymmetric collaborative network: transferable lifelong learning for remote sensing images

Active Task Selection for Lifelong Machine Learning

LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning

Learn to Grow: A Continual Structure Learning Framework for Overcoming Catastrophic Forgetting

Life-long Learning Based on Dynamic Combination Model

Schematic Memory Persistence and Transience for Efficient and Robust Continual Learning

Drift to Remember

AdaER: An Adaptive Experience Replay Approach for Continual Lifelong Learning

Lifelong Learning With Cycle Memory Networks

Lifelong Neural Predictive Coding: Learning Cumulatively Online without Forgetting