Simul-LLM: A Framework for Exploring High-Quality Simultaneous Translation with Large Language Models

Victor Agostinelli,Max Wild,Matthew Raffel,Kazi Ahmed Asif Fuad,Lizhong Chen
2024-07-04
Abstract:Large language models (LLMs) with billions of parameters and pretrained on massive amounts of data are now capable of near or better than state-of-the-art performance in a variety of downstream natural language processing tasks. Neural machine translation (NMT) is one such task that LLMs have been applied to with great success. However, little research has focused on applying LLMs to the more difficult subset of NMT called simultaneous translation (SimulMT), where translation begins before the entire source context is available to the model. In this paper, we address key challenges facing LLMs fine-tuned for SimulMT, validate classical SimulMT concepts and practices in the context of LLMs, explore adapting LLMs that are fine-tuned for NMT to the task of SimulMT, and introduce Simul-LLM, the first open-source fine-tuning and evaluation pipeline development framework for LLMs focused on SimulMT.
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?
The paper aims to address the challenges of applying large language models (LLMs) in simultaneous translation (SimulMT) tasks. Specifically, the paper addresses the following issues: 1. **Adapting to Dynamic Input**: Traditional LLMs are pre-trained and fine-tuned under the assumption that the input is complete and static, whereas in simultaneous translation, the input is provided incrementally. Therefore, the study explores how to adapt LLMs to this dynamically changing input environment. 2. **Exploration of Fine-Tuning Strategies**: The paper investigates different methods of directly applying LLMs used for neural machine translation (NMT) to simultaneous translation, including adjusting the NMT model during inference and directly fine-tuning LLMs for simultaneous translation. 3. **Validation of Classic Concepts**: The paper validates whether some classic concepts and practices in the traditional simultaneous translation field are applicable to LLMs, such as whether a higher fine-tuning wait value (wait-k) can improve generalizability. 4. **Development of an Open-Source Framework**: To promote research and development of LLMs in the field of simultaneous translation, the paper proposes an open-source fine-tuning and evaluation framework called Simul-LLM. This framework supports multiple LLMs and can seamlessly integrate with the existing simultaneous translation evaluation tool SimulEval. Through these efforts, the paper aims to advance the application and development of LLMs in the field of simultaneous translation, addressing the main challenges currently faced by LLMs in this application.