Towards Making the Most of ChatGPT for Machine Translation

Keqin Peng,Liang Ding,Qihuang Zhong,Li Shen,Xuebo Liu,Min Zhang,Yuanxin Ouyang,Dacheng Tao

2023-10-20

Abstract:ChatGPT shows remarkable capabilities for machine translation (MT). Several prior studies have shown that it achieves comparable results to commercial systems for high-resource languages, but lags behind in complex tasks, e.g., low-resource and distant-language-pairs translation. However, they usually adopt simple prompts which can not fully elicit the capability of ChatGPT. In this paper, we aim to further mine ChatGPT's translation ability by revisiting several aspects: temperature, task information, and domain information, and correspondingly propose an optimal temperature setting and two (simple but effective) prompts: Task-Specific Prompts (TSP) and Domain-Specific Prompts (DSP). We show that: 1) The performance of ChatGPT depends largely on temperature, and a lower temperature usually can achieve better performance; 2) Emphasizing the task information can further improve ChatGPT's performance, particularly in complex MT tasks; 3) Introducing domain information can elicit ChatGPT's generalization ability and improve its performance in the specific domain; 4) ChatGPT tends to generate hallucinations for non-English-centric MT tasks, which can be partially addressed by our proposed prompts but still need to be highlighted for the MT/NLP community. We also explore the effects of advanced in-context learning strategies and find a (negative but interesting) observation: the powerful chain-of-thought prompt leads to word-by-word translation behavior, thus bringing significant translation degradation.

Computation and Language

What problem does this paper attempt to address?

The paper aims to explore how to better tap into the potential of ChatGPT in machine translation tasks. Specifically, the study focuses on the following aspects: 1. **Temperature Setting**: The study finds that different temperature settings have a significant impact on ChatGPT's translation performance, with lower temperatures generally yielding better results. 2. **Task Information**: By proposing task-specific prompts (TSP) that emphasize the information of the translation task, ChatGPT's performance in complex translation tasks can be further improved. 3. **Domain Information**: Introducing domain-specific prompts (DSP) can enhance ChatGPT's translation quality in specific domains and narrow the gap with commercial systems. The paper also explores several advanced in-context learning strategies, including few-shot in-context learning (ICL) and chain-of-thought (CoT). The results show that few-shot learning can significantly improve ChatGPT's performance, while chain-of-thought may lead to word-by-word translation issues, thereby reducing translation quality. Overall, the study aims to improve ChatGPT's performance in machine translation tasks by optimizing temperature settings and improving prompt methods.

Towards Making the Most of ChatGPT for Machine Translation

How to Design Translation Prompts for ChatGPT: An Empirical Study

Is ChatGPT A Good Translator? Yes With GPT-4 As The Engine

Gradable ChatGPT Translation Evaluation

Prompting ChatGPT for Translation: A Comparative Analysis of Translation Brief and Persona Prompts

Optimizing Machine Translation through Prompt Engineering: An Investigation into ChatGPT's Customizability

Comparing Chinese-English MT Performance Involving ChatGPT and MT Providers and the Efficacy of AI mediated Post-Editing

Linguistically Informed ChatGPT Prompts to Enhance Japanese-Chinese Machine Translation: A Case Study on Attributive Clauses

Is ChatGPT Reliable in Scoring Learner's Translation Quality?

Guiding ChatGPT for Better Code Generation: an Empirical Study

Pushing the Limits of ChatGPT on NLP Tasks

How does ChatGPT Compare with Conventional Neural Machine Translation Systems in Performing a Chinese to English Translation Task?

ChatGPT Performance Evaluation on Chinese Language and Risk Measures

Can ChatGPT Understand Too? A Comparative Study on ChatGPT and Fine-tuned BERT

What is the Best Way for ChatGPT to Translate Poetry?

Exploring the effectiveness of ChatGPT-based feedback compared with teacher feedback and self-feedback: Evidence from Chinese to English translation

Distinguishing Translations by Human, NMT, and ChatGPT: A Linguistic and Statistical Approach

Don't Trust ChatGPT when Your Question is not in English: A Study of Multilingual Abilities and Types of LLMs

Evaluation on ChatGPT for Chinese Language Understanding

Research on Methods to Enhance Machine Translation Quality Between Low-Resource Languages and Chinese Based on ChatGPT

Improving ChatGPT Prompt for Code Generation