Disentangled Representations for Continual Learning: Overcoming Forgetting and Facilitating Knowledge Transfer

Zhaopeng Xu,Qi Qin,Bing Liu,Dongyan Zhao
DOI: https://doi.org/10.1007/978-3-031-70359-1_9
2024-01-01
Abstract:Achieving knowledge transfer is a new challenge in continual learning, which requires the model to not only overcome catastrophic forgetting but also make full use of the knowledge from multiple tasks to solve a particular task. In this paper, we propose to disentangle representations in continual learning into task-shared and task-specific representations, using shared and task-specific encoders to obtain the corresponding disentangled representations, respectively. However, forgetting persists in the shared encoder, and task encoders are unable to transfer knowledge to each other. In order to overcome forgetting in the shared encoder, we introduce the Fisher mask to limit the update of important parameters of old tasks. Further, a multi-knowledge distillation method is proposed to promote the shared encoder to consolidate the knowledge of all tasks by interacting with task encoders. To facilitate knowledge transfer among task encoders, we select and fuse knowledge from the task encoder of each old task to apply to the new task. Experiments show that our proposed method achieves performance beyond state-of-the-art baselines using a smaller network without replay data.
What problem does this paper attempt to address?