Creative and Context-Aware Translation of East Asian Idioms with GPT-4

Kenan Tang,Peiyang Song,Yao Qin,Xifeng Yan
2024-10-02
Abstract:As a type of figurative language, an East Asian idiom condenses rich cultural background into only a few characters. Translating such idioms is challenging for human translators, who often resort to choosing a context-aware translation from an existing list of candidates. However, compiling a dictionary of candidate translations demands much time and creativity even for expert translators. To alleviate such burden, we evaluate if GPT-4 can help generate high-quality translations. Based on automatic evaluations of faithfulness and creativity, we first identify Pareto-optimal prompting strategies that can outperform translation engines from Google and DeepL. Then, at a low cost, our context-aware translations can achieve far more high-quality translations per idiom than the human baseline. We open-source all code and data to facilitate further research.
Computation and Language
What problem does this paper attempt to address?
This paper aims to address the challenges in the translation of East Asian idioms, especially the high - quality and creative translation of idioms in different contexts. Traditionally, idiom translation depends on candidate translations in a fixed list, which are often divorced from the specific context, resulting in translation results that may require a great deal of rewriting to adapt to a particular context. In addition, constructing such a translation list is time - consuming and requires a high degree of creativity, even for professional translators. To overcome these problems, the author evaluated the potential of GPT - 4 in generating high - quality idiom translations. By automatically evaluating the fidelity and creativity of translations, the author identified Pareto - optimal prompting strategies that can outperform commercial translation engines such as Google and DeepL. The research results show that, based on these strategies, GPT - 4 can generate more high - quality context - related translations for each idiom at a lower cost, far exceeding the human baseline level. Specifically, the main contributions of the paper include: 1. **Identifying Pareto - optimal prompting strategies**: By evaluating the effects of different prompting strategies, the author found the best strategies that can optimize both translation fidelity and creativity simultaneously. 2. **Generating high - quality translations**: Using these strategies, GPT - 4 can generate multiple high - quality context - related translations for each idiom, significantly improving the diversity and quality of translations. 3. **Open - sourcing code and data**: To promote further research, the author open - sourced all code and data. In conclusion, this paper provides an efficient and high - quality method for translating East Asian idioms by leveraging the powerful capabilities of GPT - 4, which helps to reduce the burden on translators and improve the overall quality of translations.