An Empirical Study of Catastrophic Forgetting in Large Language Models During Continual Fine-tuning

Yun Luo,Zhen Yang,Fandong Meng,Yafu Li,Jie Zhou,Yue Zhang
DOI: https://doi.org/10.48550/arxiv.2308.08747
2023-01-01
Abstract:Catastrophic forgetting (CF) is a phenomenon that occurs in machine learningwhen a model forgets previously learned information while acquiring newknowledge. As large language models (LLMs) have demonstrated remarkableperformance, it is intriguing to investigate whether CF exists during thecontinual instruction tuning of LLMs. This study empirically evaluates theforgetting phenomenon in LLMs' knowledge during continual instruction tuningfrom the perspectives of domain knowledge, reasoning, and readingcomprehension. The experiments reveal that catastrophic forgetting is generallyobserved in LLMs ranging from 1b to 7b parameters. Moreover, as the model scaleincreases, the severity of forgetting intensifies. Comparing the decoder-onlymodel BLOOMZ with the encoder-decoder model mT0, BLOOMZ exhibits lessforgetting and retains more knowledge. Interestingly, we also observe that LLMscan mitigate language biases, such as gender bias, during continualfine-tuning. Furthermore, our findings indicate that ALPACA maintains moreknowledge and capacity compared to LLAMA during continual fine-tuning,suggesting that general instruction tuning can help alleviate the forgettingphenomenon in LLMs during subsequent fine-tuning processes.
What problem does this paper attempt to address?