Point, Disambiguate and Copy: Incorporating Bilingual Dictionaries for Neural Machine Translation.

Tong Zhang,Long Zhang,Wei Ye,Bo Li,Jinan Sun,Xiaoyu Zhu,Wen Zhao,Shikun Zhang
DOI: https://doi.org/10.18653/v1/2021.acl-long.307
2021-01-01
Abstract:This paper proposes a sophisticated neural architecture to incorporate bilingual dictionaries into Neural Machine Translation (NMT) models. By introducing three novel components: Pointer, Disambiguator, and Copier, our method PDC achieves the following merits inherently compared with previous efforts: (1) Pointer leverages the semantic information from bilingual dictionaries, for the first time, to better locate source words whose translation in dictionaries can potentially be used; (2) Disambiguator synthesizes contextual information from the source view and the target view, both of which contribute to distinguishing the proper translation of a specific source word from multiple candidates in dictionaries; (3) Copier systematically connects Pointer and Disambiguator based on a hierarchical copy mechanism seamlessly integrated with Transformer, thereby building an end-to-end architecture that could avoid error propagation problems in alternative pipeline methods. The experimental results on Chinese-English and English-Japanese benchmarks demonstrate the PDC's overall superiority and effectiveness of each component.
What problem does this paper attempt to address?