A Simple Lifelong Learning Approach

Joshua T. Vogelstein,Jayanta Dey,Hayden S. Helm,Will LeVine,Ronak D. Mehta,Tyler M. Tomita,Haoyin Xu,Ali Geisa,Qingyang Wang,Gido M. van de Ven,Chenyu Gao,Weiwei Yang,Bryan Tower,Jonathan Larson,Christopher M. White,Carey E. Priebe
2024-06-12
Abstract:In lifelong learning, data are used to improve performance not only on the present task, but also on past and future (unencountered) tasks. While typical transfer learning algorithms can improve performance on future tasks, their performance on prior tasks degrades upon learning new tasks (called forgetting). Many recent approaches for continual or lifelong learning have attempted to maintain performance on old tasks given new tasks. But striving to avoid forgetting sets the goal unnecessarily low. The goal of lifelong learning should be to use data to improve performance on both future tasks (forward transfer) and past tasks (backward transfer). In this paper, we show that a simple approach -- representation ensembling -- demonstrates both forward and backward transfer in a variety of simulated and benchmark data scenarios, including tabular, vision (CIFAR-100, 5-dataset, Split Mini-Imagenet, and Food1k), and speech (spoken digit), in contrast to various reference algorithms, which typically failed to transfer either forward or backward, or both. Moreover, our proposed approach can flexibly operate with or without a computational budget.
Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?
### The Problem the Paper Aims to Solve This paper aims to address a core issue in lifelong learning, which is how to continuously learn new tasks while not only maintaining performance on past tasks but also improving performance on future tasks. Specifically: 1. **Avoiding Forgetting**: Traditional machine learning systems significantly degrade in performance on old tasks when training on new tasks, a phenomenon known as "catastrophic forgetting." Many existing continual learning methods attempt to maintain performance on old tasks while learning new ones, but they often only achieve the goal of avoiding forgetting without further improving performance. 2. **Forward and Backward Transfer**: The paper proposes a simple method—representation ensembling—that not only maintains good performance on old tasks while learning new ones (backward transfer) but also improves performance on future tasks while learning new ones (forward transfer). In contrast, other reference algorithms usually achieve only one type of transfer or fail to achieve both. 3. **Effectiveness in Low-Sample Environments**: Effective knowledge transfer is particularly important in scenarios with small sample sizes. Many existing lifelong learning algorithms perform poorly in such conditions, which is a key barrier to the capabilities of artificial intelligence. In summary, the main goal of the paper is to propose a general lifelong learning method that can achieve both forward and backward transfer across various data scenarios and remain effective in low-sample environments. This goal is achieved through representation ensembling, and the method has been validated on multiple benchmark datasets.