Pluggable Neural Machine Translation Models Via Memory-augmented Adapters

Yuzhuang Xu,Shuo Wang,Peng Li,Xuebo Liu,Xiaolong Wang,Weidong Liu,Yang Liu
DOI: https://doi.org/10.48550/arxiv.2307.06029
2023-01-01
Abstract:Although neural machine translation (NMT) models perform well in the generaldomain, it remains rather challenging to control their generation behavior tosatisfy the requirement of different users. Given the expensive training costand the data scarcity challenge of learning a new model from scratch for eachuser requirement, we propose a memory-augmented adapter to steer pretrained NMTmodels in a pluggable manner. Specifically, we construct a multi-granularmemory based on the user-provided text samples and propose a new adapterarchitecture to combine the model representations and the retrieved results. Wealso propose a training strategy using memory dropout to reduce spuriousdependencies between the NMT model and the memory. We validate our approach onboth style- and domain-specific experiments and the results indicate that ourmethod can outperform several representative pluggable baselines.
What problem does this paper attempt to address?