Joint data augmentation and knowledge distillation for few-shot continual relation extraction
Zhongcheng Wei,Yunping Zhang,Bin Lian,Yongjian Fan,Jijun Zhao
DOI: https://doi.org/10.1007/s10489-024-05327-y
IF: 5.3
2024-03-01
Applied Intelligence
Abstract:Few-shot continual relation extraction (CRE) aims to perpetually learn new relations through a limited set of training samples. Its primary challenges include few-shot problems and catastrophic forgetting of old relations. Through empirical research on the existing CRE works, we observe that the cause of catastrophic forgetting is not only an increase in the number of new classes but confusion between similar relations. To address the above issues, we propose a joint data augmentation and knowledge distillation method for few-shot continual relation extraction (JDAKD). Specifically, JDAKD is designed to learn more accurate and robust relationship representations via a similar class-adversarial enhancement mechanism. Furthermore, a novel distillation structure is implemented in which the base model and the model from the previous stage serve as complementary teacher models to guide the learning process. Additionally, a generative adversarial network is employed to augment the data, effectively mitigating the few-shot problem. Extensive experiments conducted on the FewRel and TACRED datasets demonstrate that our proposed JDAKD model outperforms several competitive baseline methods. Notably, in the last task, JDAKD achieves remarkable accuracy improvements, surpassing the second-best model, SCKD, by 4.43% and 3.1%, respectively.
computer science, artificial intelligence