Neural Machine Translation Based on Multi-sub-word Sequence Fusion

LIU Wanyue,AISHAN Wumaier,LI Zhe,HAN Yue,ZHANG Daren,YI Nian
DOI: https://doi.org/10.3969/j.issn.1003-0077.2023.02.009
2023-01-01
Abstract:In neural machine translation, BPE(Byte Pair Encoding) is apopular method to segment sub-word sequence to solve the problem of rare words and out-of-vocabulary words. However, BPE can only segment words into unique subword sequences. Facing rich languages, there are many different combinations of the same word. single sub-word sequence will prevent the model from better learning different combination characteristics of word. Instead of relying on this single sub-word sequence, this paper proposes a method of tagging and fusing multiple subword sequences. Different parameters of BPE are applied to segment the same training data to obtain different subword sequences, with corresponding tags assigned. Experiments show the proposed method has improved more than 0.5 BLEU score for both morphological-rich and noninflectional language pairs. In addition, the less overlap there is between the different subword sequences, the better the translation quality that can be achieved.
What problem does this paper attempt to address?