Document-Level Machine Translation with Effective Batch-Level Context Representation

Kang Zhong,Jie Zhang,Wu Guo
DOI: https://doi.org/10.1109/ijcnn60899.2024.10651489
2024-01-01
Abstract:It is critical to provide inter-sentential context for document-level neural machine translation (DocNMT) to achieve higher-quality translations. As the document-level information is naturally preserved in mini-batches in case sentences are not shuffled, in this work we propose an effective batch-level context representation (EBCR) for DocNMT by leveraging structural contextual clues in the mini-batches. The EBCR is a plug-in module that is added to each encoder layer of the conventional Transformer model and can condense the inter-sentential contextual information within the mini-batch and reinforce the inter-sentential local context through gating operation. The proposed method is evaluated on three English-German document translation datasets, and results show that our model can present the wide-range context more effectively than existing methods.
What problem does this paper attempt to address?