Machine Translation Models are Zero-Shot Detectors of Translation Direction

Michelle Wastl,Jannis Vamvas,Rico Sennrich
2024-05-23
Abstract:Detecting the translation direction of parallel text has applications for machine translation training and evaluation, but also has forensic applications such as resolving plagiarism or forgery allegations. In this work, we explore an unsupervised approach to translation direction detection based on the simple hypothesis that $p(\text{translation}|\text{original})>p(\text{original}|\text{translation})$, motivated by the well-known simplification effect in translationese or machine-translationese. In experiments with massively multilingual machine translation models across 20 translation directions, we confirm the effectiveness of the approach for high-resource language pairs, achieving document-level accuracies of 82--96% for NMT-produced translations, and 60--81% for human translations, depending on the model used. Code and demo are available at
Computation and Language
What problem does this paper attempt to address?
The paper aims to address the problem of translation direction detection. Specifically, it explores an unsupervised method based on neural machine translation (NMT) models to automatically determine the original translation direction of parallel texts. This method is significant for machine translation training, evaluation, and forensic applications such as resolving plagiarism or forgery accusations. The main contributions of the paper include: 1. **Proposing a simple and unsupervised learning method**: This method identifies the translation direction based on the translation probabilities of the NMT model. It assumes that the original text is more difficult to translate than its translated version, i.e., \(p(\text{translation}|\text{original}) > p(\text{original}|\text{translation})\). 2. **Demonstrating the effectiveness of the method**: Experiments show that this method can effectively detect the original translation direction of texts generated by neural machine translation and is also somewhat applicable to human translations. 3. **Conducting qualitative analysis**: The authors performed a qualitative analysis of the detection performance and applied the method to a real forensic case to verify whether an English book was a forgery, thereby supporting specific hypotheses about the book's authenticity. 4. **Exploring the issue of directional bias**: The paper discusses the potential directional bias in multilingual translation models and proposes a method to quantify the bias. Experimental results show that for texts generated by neural machine translation systems, the detection accuracy of this method is relatively high; for human translations, the accuracy is slightly lower but still significant; and for outputs from early non-neural network translation systems, the accuracy is below random levels, possibly due to the lower quality of texts generated by these systems. Additionally, the study demonstrates the performance differences of different models across various language pairs and finds that document-level translation direction detection accuracy is generally higher than sentence-level. Finally, the paper validates the practical application value of the method through a specific case study.