Learning Adaptive Segmentation Policy for Simultaneous Translation

Ruiqing Zhang,Chuanqiang Zhang,Zhongjun He,Hua Wu,Haifeng Wang
DOI: https://doi.org/10.18653/v1/2020.emnlp-main.178
2020-01-01
Abstract:Balancing accuracy and latency is a great challenge for simultaneous translation. To achieve high accuracy, the model usually needs to wait for more streaming text before translation, which results in increased latency. However, keeping low latency would probably hurt accuracy. Therefore, it is essential to segment the ASR output into appropriate units for translation. Inspired by human interpreters, we propose a novel adaptive segmentation policy for simultaneous translation. The policy learns to segment the source text by considering possible translations produced by the translation model, maintaining consistency between the segmentation and translation. Experimental results on Chinese-English and German-English translation show that our method achieves a better accuracy-latency trade-off over recently proposed state-of-the-art methods.
What problem does this paper attempt to address?