PromptFusion: Decoupling Stability and Plasticity for Continual Learning

Haoran Chen,Zuxuan Wu,Xintong Han,Menglin Jia,Yu-Gang Jiang
2024-07-10
Abstract:Current research on continual learning mainly focuses on relieving catastrophic forgetting, and most of their success is at the cost of limiting the performance of newly incoming tasks. Such a trade-off is referred to as the stability-plasticity dilemma and is a more general and challenging problem for continual learning. However, the inherent conflict between these two concepts makes it seemingly impossible to devise a satisfactory solution to both of them simultaneously. Therefore, we ask, "is it possible to divide them into two separate problems to conquer them independently?". To this end, we propose a prompt-tuning-based method termed PromptFusion to enable the decoupling of stability and plasticity. Specifically, PromptFusion consists of a carefully designed \stab module that deals with catastrophic forgetting and a \boo module to learn new knowledge concurrently. Furthermore, to address the computational overhead brought by the additional architecture, we propose PromptFusion-Lite which improves PromptFusion by dynamically determining whether to activate both modules for each input image. Extensive experiments show that both PromptFusion and PromptFusion-Lite achieve promising results on popular continual learning datasets for class-incremental and domain-incremental settings. Especially on Split-Imagenet-R, one of the most challenging datasets for class-incremental learning, our method can exceed state-of-the-art prompt-based methods by more than 5\% in accuracy, with PromptFusion-Lite using 14.8\% less computational resources than PromptFusion.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper attempts to address the problem of how to balance the trade-off between stability and plasticity in Continual Learning, known as the stability-plasticity dilemma. In continual learning, when a model encounters new data, it ideally should maintain good performance on previous tasks (stability) while effectively learning new tasks (plasticity). However, most existing methods sacrifice the ability to learn new tasks while mitigating catastrophic forgetting, leading to a trade-off between the two. To tackle this challenge, the paper proposes a prompt tuning-based method called PromptFusion, which decouples stability and plasticity into two independent problems to handle them separately. Specifically, PromptFusion consists of two modules: a Stabilizer module to address catastrophic forgetting and a Booster module to simultaneously learn new knowledge. Additionally, to reduce the computational overhead brought by extra architecture, the paper also introduces a lightweight version of PromptFusion—PromptFusion-Lite, which further optimizes the use of computational resources by dynamically deciding whether to activate the two modules. Through extensive experiments, the researchers validated the effectiveness of PromptFusion and its lightweight version on multiple continual learning benchmark datasets, particularly excelling in class-incremental learning and domain-incremental learning settings. For instance, on the Split-Imagenet-R, the most challenging class-incremental learning dataset, PromptFusion achieved an average accuracy of 78.7% under memory-free conditions, outperforming the current state-of-the-art method CODA-Prompt by 5.3%. Meanwhile, PromptFusion-Lite maintained competitiveness while reducing computational resources by 14.8%.