Translate Meanings, Not Just Words: IdiomKB's Role in Optimizing Idiomatic Translation with Language Models

Shuang Li,Jiangjie Chen,Siyu Yuan,Xinyi Wu,Hao Yang,Shimin Tao,Yanghua Xiao
2023-12-25
Abstract:To translate well, machine translation (MT) systems and general-purposed language models (LMs) need a deep understanding of both source and target languages and cultures. Therefore, idioms, with their non-compositional nature, pose particular challenges for Transformer-based systems, as literal translations often miss the intended meaning. Traditional methods, which replace idioms using existing knowledge bases (KBs), often lack scale and context awareness. Addressing these challenges, our approach prioritizes context awareness and scalability, allowing for offline storage of idioms in a manageable KB size. This ensures efficient serving with smaller models and provides a more comprehensive understanding of idiomatic expressions. We introduce a multilingual idiom KB (IdiomKB) developed using large LMs to address this. This KB facilitates better translation by smaller models, such as BLOOMZ (7.1B), Alpaca (7B), and InstructGPT (6.7B), by retrieving idioms' figurative meanings. We present a novel, GPT-4-powered metric for human-aligned evaluation, demonstrating that IdiomKB considerably boosts model performance. Human evaluations further validate our KB's quality.
Computation and Language
What problem does this paper attempt to address?
The paper attempts to address the issue of accurately translating idioms in machine translation (MT) systems due to their non-compositional nature. Specifically, traditional machine translation models and general language models often translate idioms based on their literal meanings, which usually fails to convey the true meaning of the idioms. Additionally, existing idiom knowledge bases (KBs) are small in scale and lack multilingual support, resulting in limited effectiveness in practical applications. To solve these problems, the authors propose a new approach: constructing a multilingual idiom knowledge base (IDIOM KB) that utilizes large language models (LLMs) to generate the metaphorical meanings of idioms, thereby improving the performance of smaller models in idiom translation tasks. This approach not only enhances translation quality but also reduces reliance on large-scale models, allowing smaller models to excel in idiom translation. Furthermore, the paper introduces a new evaluation metric based on GPT-4 to more accurately assess the quality of idiom translations.