Defying Forgetting in Continual Relation Extraction Via Batch Spectral Norm Regularization

Rundong Gao,Wenkai Yang,Xu Sun
DOI: https://doi.org/10.1109/ijcnn60899.2024.10651110
2024-01-01
Abstract:Continual relation extraction (CRE) aims at incrementally training the model with new relations without forgetting the old ones. Recently, various methods, relying on the stored data, have been proposed and achieved outstanding performance. However, the practicability of storing data from previous tasks is limited by the storage space or privacy issues. Therefore, in this paper, we study overcoming the catastrophic forgetting in continual relation extraction under the memory-free setting, which means that no exemplars from old relations can be stored. Under the memory-free setting, we first empirically find that the commonly used linear trainable classifier leads to the severe catastrophic forgetting and the nearest-class-mean (NCM) classifier is a simple but more suitable substitute. In addition, we propose a simple yet effective loss term, named Batch Spectral Norm Regularization, to improve the robustness of the NCM classifier to the semantic drift in the embedding space when training the model on the current data. We perform extensive experiments on the two commonly used datasets, TACRED and FewRel. Experimental results show that our method can consistently bring improvement in the absence of the memory.
What problem does this paper attempt to address?