An Ensemble Strategy with Gradient Conflict for Multi-Domain Neural Machine Translation

Zhibo Man,Yujie Zhang,Yu Li,Yuanmeng Chen,Yufeng Chen,Jinan Xu
DOI: https://doi.org/10.1145/3638248
IF: 1.471
2024-01-01
ACM Transactions on Asian and Low-Resource Language Information Processing
Abstract:Multi-domain neural machine translation aims to construct a unified neural machine translation model to translate sentences across various domains. Nevertheless, previous studies have one limitation is the incapacity to acquire both domain-general and domain-specific representations concurrently. To this end, we propose an ensemble strategy with gradient conflict for multi-domain neural machine translation that automatically learns model parameters by identifying both domain-shared and domain-specific features. Specifically, our approach consists of (1) a parameter-sharing framework, where the parameters of all the layers are originally shared and equivalent to each domain, and (2) ensemble strategy, in which we design an Extra Ensemble strategy via a piecewise condition function to learn direction and distance-based gradient conflict. In addition, we give a detailed theoretical analysis of the gradient conflict to further validate the effectiveness of our approach. Experimental results on two multi-domain datasets show the superior performance of our proposed model compared to previous work.
What problem does this paper attempt to address?