Knn-Tl: K-Nearest-neighbor Transfer Learning for Low-Resource Neural Machine Translation.

Shudong Liu,Xuebo Liu,Derek F. Wong,Zhaocong Li,Wenxiang Jiao,Lidia S. Chao,Min Zhang
DOI: https://doi.org/10.18653/v1/2023.acl-long.105
2023-01-01
Abstract:Transfer learning has been shown to be an effective technique for enhancing the performance of low-resource neural machine translation (NMT). This is typically achieved through either fine-tuning a child model with a pre-trained parent model, or by utilizing the output of the parent model during the training of the child model. However, these methods do not make use of the parent knowledge during the child inference, which may limit the translation performance. In this paper, we propose a k-Nearest-Neighbor Transfer Learning (kNN-TL) approach for low-resource NMT, which leverages the parent knowledge throughout the entire developing process of the child model. Our approach includes a parent-child representation alignment method, which ensures consistency in the output representations between the two models, and a child-aware datastore construction method that improves inference efficiency by selectively distilling the parent datastore based on relevance to the child model. Experimental results on four low-resource translation tasks show that kNN-TL outperforms strong baselines. Extensive analyses further demonstrate the effectiveness of our approach. Code and scripts are freely available at https://github.com/NLP2CT/kNN-TL.
What problem does this paper attempt to address?