LLMs Are Zero-Shot Context-Aware Simultaneous Translators

Roman Koshkin,Katsuhito Sudoh,Satoshi Nakamura
2024-06-25
Abstract:The advent of transformers has fueled progress in machine translation. More recently large language models (LLMs) have come to the spotlight thanks to their generality and strong performance in a wide range of language tasks, including translation. Here we show that open-source LLMs perform on par with or better than some state-of-the-art baselines in simultaneous machine translation (SiMT) tasks, zero-shot. We also demonstrate that injection of minimal background information, which is easy with an LLM, brings further performance gains, especially on challenging technical subject-matter. This highlights LLMs' potential for building next generation of massively multilingual, context-aware and terminologically accurate SiMT systems that require no resource-intensive training or fine-tuning.
Computation and Language
What problem does this paper attempt to address?
The paper aims to address several key issues in Simultaneous Machine Translation (SiMT). The main goal is to achieve zero-shot simultaneous translation tasks using off-the-shelf large language models (LLMs) without resource-intensive training or fine-tuning, and to consider contextual information during translation, particularly to improve translation quality in highly technical domains. Specifically, the paper demonstrates the following points: 1. **Zero-shot Simultaneous Translation**: The study shows that instruction-tuned LLMs, without specialized training, can perform simultaneous translation tasks with quality and latency metrics comparable to or even surpassing some state-of-the-art systems, without the need for complex segmentation strategies. 2. **Context-aware Translation**: By injecting a small amount of background information, LLMs can significantly improve translation quality, especially when dealing with technical terms and content. 3. **Response Priming**: A simple yet effective improvement method is proposed—by inserting part of the target text into the "assistant" section of the prompt rather than the "user" section, effectively constraining the model's generation space and avoiding unnecessary annotations or explanatory text. In summary, the paper aims to improve the performance of simultaneous translation systems by leveraging the powerful reasoning and contextual learning capabilities of LLMs, especially when handling highly specialized scenarios.