Abstract:Translation-tailored Large language models (LLMs) exhibit remarkable translation capabilities, even competing with supervised-trained commercial translation systems. However, off-target translation remains an unsolved problem, especially for low-resource languages, hindering us from developing accurate LLMs-based translation models. To mitigate the off-target translation problem and enhance the performance of LLMs on translation, recent works have either designed advanced prompting strategies to highlight the functionality of translation instructions or exploited the in-context learning ability of LLMs by feeding few-shot demonstrations. However, these methods essentially do not improve LLM's ability to follow translation instructions, especially the language direction information. In this work, we design a two-stage fine-tuning algorithm to improve the instruction-following ability (especially the translation direction) of LLMs. Specifically, we first tune LLMs with the maximum likelihood estimation loss on the translation dataset to elicit the basic translation capabilities. In the second stage, we construct instruction-conflicting samples by randomly replacing the translation directions with a wrong one within the instruction, and then introduce an extra unlikelihood loss to learn those samples. Experiments on IWSLT and WMT benchmarks upon the LLaMA model spanning 16 zero-shot directions show that, compared to the competitive baseline -- translation-finetuned LLama, our method could effectively reduce the off-target translation ratio (averagely -53.3\%), thus improving translation quality with average +5.7 SacreBLEU and +16.4 BLEURT. Analysis shows that our method could preserve the model's general task performance on AlpacaEval. Code and models will be released at \url{

Locally Training the Log-Linear Model for SMT.

Discriminative Training for Log-Linear Based SMT: Global or Local Methods.

Softmax-margin Training for Statistical Machine Translation

Transductive Minimum Error Rate Training for Statistical Machine Translation.

Expected Error Minimization with Ultraconservative Update for SMT.

Adaptive development data selection for log-linear model in statistical machine translation

Coordinate System Selection for Minimum Error Rate Training in Statistical Machine Translation

A simple discriminative training method for machine translation with large-scale features

Building Accurate Translation-Tailored LLMs with Language Aware Instruction Tuning

Maximum Rank Correlation Training for Statistical Machine Translation.

Training MT Model Using Structural SVM

Instruction-Tuned LLMs Succeed in Document-Level MT Without Fine-Tuning -- But BLEU Turns a Blind Eye

Mitigating the Language Mismatch and Repetition Issues in LLM-based Machine Translation via Model Editing

Non-linear Learning for Statistical Machine Translation.

Enhancing Document-level Translation of Large Language Model via Translation Mixed-instructions

Tencent AI Lab - Shanghai Jiao Tong University Low-Resource Translation System for the WMT22 Translation Task

SemMT: A Semantic-based Testing Approach for Machine Translation Systems

LANDeRMT: Detecting and Routing Language-Aware Neurons for Selectively Finetuning LLMs to Machine Translation

Discriminative Training of 150 Million Translation Parameters and Its Application to Pruning.

LMTuner: An user-friendly and highly-integrable Training Framework for fine-tuning Large Language Models

Improvement Comparison of Different Lattice-based Discriminative Training Methods in Chinese-monolingual and Chinese-English-bilingual Speech Recognition