Self-Supervised Quality Estimation for Machine Translation.

Yuanhang Zheng,Zhixing Tan,Meng Zhang,Mieradilijiang Maimaiti,Huanbo Luan,Maosong Sun,Qun Liu,Yang Liu
DOI: https://doi.org/10.18653/v1/2021.emnlp-main.267
2021-01-01
Abstract:Quality estimation (QE) of machine translation (MT) aims to evaluate the quality of machine-translated sentences without references and is important in practical applications of MT. Training QE models require massive parallel data with hand-crafted quality annotations, which are time-consuming and laborintensive to obtain. To address the issue of the absence of annotated training data, previous studies attempt to develop unsupervised QE methods. However, very few of them can be applied to both sentence- and word-level QE tasks, and they may suffer from noises in the synthetic data. To reduce the negative impact of noises, we propose a self-supervised method for both sentence- and word-level QE, which performs quality estimation by recovering the masked target words. Experimental results show that our method outperforms previous unsupervised methods on several QE tasks in different language pairs and domains.(1)
What problem does this paper attempt to address?