LIT Team's System Description for Japanese-Chinese Machine Translation Task in IWSLT 2020.

Yimeng Zhuang,Yuan Zhang,Lijie Wang
DOI: https://doi.org/10.18653/v1/2020.iwslt-1.12
2020-01-01
Abstract:This paper describes the LIT Team's submission to the IWSLT2020 open domain translation task, focusing primarily on Japanese-to-Chinese translation direction. Our system is based on the organizers' baseline system, but we do more works on improving the Transformer baseline system by elaborate data preprocessing. We manage to obtain significant improvements, and this paper aims to share some data processing experiences in this translation task. Large-scale back-translation on monolingual corpus is also investigated. In addition, we also try shared and exclusive word embeddings, compare different granularity of tokens like sub-word level. Our Japanese-to-Chinese translation system achieves a performance of BLEU=34.0 and ranks 2nd among all participating systems.
What problem does this paper attempt to address?