Abstract:Continual learning (CL) aims to incrementally learn different tasks (such as classification) in a non-stationary data stream without forgetting old ones. Most CL works focus on tackling catastrophic forgetting under a learning-from-scratch paradigm. However, with the increasing prominence of foundation models, pre-trained models equipped with informative representations have become available for various downstream requirements. Several CL methods based on pre-trained models have been explored, either utilizing pre-extracted features directly (which makes bridging distribution gaps challenging) or incorporating adaptors (which may be subject to forgetting). In this paper, we propose a concise and effective approach for CL with pre-trained models. Given that forgetting occurs during parameter updating, we contemplate an alternative approach that exploits training-free random projectors and class-prototype accumulation, which thus bypasses the issue. Specifically, we inject a frozen Random Projection layer with nonlinear activation between the pre-trained model's feature representations and output head, which captures interactions between features with expanded dimensionality, providing enhanced linear separability for class-prototype-based CL. We also demonstrate the importance of decorrelating the class-prototypes to reduce the distribution disparity when using pre-trained representations. These techniques prove to be effective and circumvent the problem of forgetting for both class- and domain-incremental continual learning. Compared to previous methods applied to pre-trained ViT-B/16 models, we reduce final error rates by between 20% and 62% on seven class-incremental benchmarks, despite not using any rehearsal memory. We conclude that the full potential of pre-trained models for simple, effective, and fast CL has not hitherto been fully tapped. Code is at <a class="link-external link-http" href="http://github.com/RanPAC/RanPAC" rel="external noopener nofollow">this http URL</a>.

ICL-TSVD: Bridging Theory and Practice in Continual Learning with Pre-trained Models

RanPAC: Random Projections and Pre-trained Models for Continual Learning

SLCA: Slow Learner with Classifier Alignment for Continual Learning on a Pre-trained Model

Don't Stop Learning: Towards Continual Learning for the CLIP Model

The Ideal Continual Learner: An Agent That Never Forgets

SLCA++: Unleash the Power of Sequential Fine-tuning for Continual Learning with Pre-training

CoSCL: Cooperation of Small Continual Learners is Stronger than a Big One

TS-ACL: A Time Series Analytic Continual Learning Framework for Privacy-Preserving and Class-Incremental Pattern Recognition

Continual Learning Using a Kernel-Based Method Over Foundation Models

Class Incremental Learning with Pre-trained Vision-Language Models

Do Pre-trained Models Benefit Equally in Continual Learning?

Achieving Forgetting Prevention and Knowledge Transfer in Continual Learning

CLiMB: A Continual Learning Benchmark for Vision-and-Language Tasks

Calibration of Continual Learning Models

Advancing Cross-domain Discriminability in Continual Learning of Vision-Language Models

Advancing Cross-domain Discriminability in Continual Learning of Vison-Language Models

Towards stable training of parallel continual learning

Enhancing Visual Continual Learning with Language-Guided Supervision

Mamba-CL: Optimizing Selective State Space Model in Null Space for Continual Learning

Continual learning with task specialist

Mitigating Catastrophic Forgetting in Task-Incremental Continual Learning with Adaptive Classification Criterion