CRAT: A Multi-Agent Framework for Causality-Enhanced Reflective and Retrieval-Augmented Translation with Large Language Models

Meiqi Chen,Fandong Meng,Yingxue Zhang,Yan Zhang,Jie Zhou

2024-10-28

Abstract:Large language models (LLMs) have shown great promise in machine translation, but they still struggle with contextually dependent terms, such as new or domain-specific words. This leads to inconsistencies and errors that are difficult to address. Existing solutions often depend on manual identification of such terms, which is impractical given the complexity and evolving nature of language. While Retrieval-Augmented Generation (RAG) could provide some assistance, its application to translation is limited by issues such as hallucinations from information overload. In this paper, we propose CRAT, a novel multi-agent translation framework that leverages RAG and causality-enhanced self-reflection to address these challenges. This framework consists of several specialized agents: the Unknown Terms Identification agent detects unknown terms within the context, the Knowledge Graph (KG) Constructor agent extracts relevant internal knowledge about these terms and retrieves bilingual information from external sources, the Causality-enhanced Judge agent validates the accuracy of the information, and the Translator agent incorporates the refined information into the final output. This automated process allows for more precise and consistent handling of key terms during translation. Our results show that CRAT significantly improves translation accuracy, particularly in handling context-sensitive terms and emerging vocabulary.

Computation and Language

What problem does this paper attempt to address?

### The Problem the Paper Attempts to Solve This paper aims to address the issues of inconsistency and errors in handling context-dependent terms encountered by large language models (LLMs) in machine translation. Specifically: 1. **Handling of Context-Dependent Terms**: LLMs tend to exhibit inconsistencies and errors when dealing with new words or domain-specific terms. These terms may have different meanings in different contexts, leading to inaccurate translation results. 2. **Limitations of Manual Term Identification**: Existing solutions often rely on manual identification of these terms, but this approach is impractical in real-world applications. The complexity and ever-evolving nature of language make manual identification difficult to achieve. 3. **Hallucinations Caused by Information Overload**: Although retrieval-augmented generation (RAG) can provide some assistance, its application in translation is limited by the hallucination problem caused by information overload. Excessive external information can interfere with accurate translation, leading to errors. To address these issues, the paper proposes CRAT (Causally Reinforced Reflective and Retrieval-Augmented Translation Multi-Agent Framework). This framework improves translation consistency and accuracy by automatically identifying unknown terms, extracting internal and external knowledge, verifying the accuracy of information, and ultimately generating precise translations.

CRAT: A Multi-Agent Framework for Causality-Enhanced Reflective and Retrieval-Augmented Translation with Large Language Models

ActiveRAG: Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented Agents

Retrieval-Augmented Machine Translation with Unstructured Knowledge

Exploring Human-Like Translation Strategy with Large Language Models

MLR-Copilot: Autonomous Machine Learning Research based on Large Language Models Agents

(Perhaps) Beyond Human Translation: Harnessing Multi-Agent Collaboration for Translating Ultra-Long Literary Texts

TRANSAGENT: An LLM-Based Multi-Agent System for Code Translation

CMAT: A Multi-Agent Collaboration Tuning Framework for Enhancing Small Language Models

ClinicalAgent: Clinical Trial Multi-Agent System with Large Language Model-based Reasoning

Refining Translations with LLMs: A Constraint-Aware Iterative Prompting Approach

Cross Attention Augmented Transducer Networks for Simultaneous Translation.

DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory

RAP: Retrieval-Augmented Planning with Contextual Memory for Multimodal LLM Agents

Corrective Retrieval Augmented Generation

GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators

XAgents: A Framework for Interpretable Rule-Based Multi-Agents Cooperation

AssistRAG: Boosting the Potential of Large Language Models with an Intelligent Information Assistant

Chain of Agents: Large Language Models Collaborating on Long-Context Tasks

From LLM to Conversational Agent: A Memory Enhanced Architecture with Fine-Tuning of Large Language Models