General2Specialized LLMs Translation for E-commerce

Kaidi Chen,Ben Chen,Dehong Gao,Huangyu Dai,Wen Jiang,Wei Ning,Shanqing Yu,Libin Yang,Xiaoyan Cai
2024-04-06
Abstract:Existing Neural Machine Translation (NMT) models mainly handle translation in the general domain, while overlooking domains with special writing formulas, such as e-commerce and legal documents. Taking e-commerce as an example, the texts usually include amounts of domain-related words and have more grammar problems, which leads to inferior performances of current NMT methods. To address these problems, we collect two domain-related resources, including a set of term pairs (aligned Chinese-English bilingual terms) and a parallel corpus annotated for the e-commerce domain. Furthermore, we propose a two-step fine-tuning paradigm (named G2ST) with self-contrastive semantic enhancement to transfer one general NMT model to the specialized NMT model for e-commerce. The paradigm can be used for the NMT models based on Large language models (LLMs). Extensive evaluations on real e-commerce titles demonstrate the superior translation quality and robustness of our G2ST approach, as compared with state-of-the-art NMT models such as LLaMA, Qwen, GPT-3.5, and even GPT-4.
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?
The focus of this paper is machine translation in the field of e-commerce. Existing Neural Machine Translation (NMT) models perform well in translating general domain texts, but their performance decreases when dealing with texts written in specific formats, such as e-commerce texts. This is because e-commerce texts often contain a large number of domain-specific vocabulary and have more grammar issues. In this paper, the authors collected two resources related to e-commerce: a set of term pairs (aligned bilingual terms in Chinese and English) and an annotated e-commerce parallel corpus. Then, they proposed a two-step fine-tuning paradigm (G2ST) that transforms a general NMT model into a specialized e-commerce NMT model using self-contrastive semantic augmentation. This approach can be applied to NMT models based on Large Language Models (LLMs). The experiments demonstrate that the G2ST method outperforms state-of-the-art NMT models, such as LLaMA, Qwen, GPT-3.5, and GPT-4, in terms of translation quality and robustness on real e-commerce titles.