1+1>2: Can Large Language Models Serve as Cross-Lingual Knowledge Aggregators?

Yue Huang,Chenrui Fan,Yuan Li,Siyuan Wu,Tianyi Zhou,Xiangliang Zhang,Lichao Sun
2024-06-21
Abstract:Large Language Models (LLMs) have garnered significant attention due to their remarkable ability to process information across various languages. Despite their capabilities, they exhibit inconsistencies in handling identical queries in different languages, presenting challenges for further advancement. This paper introduces a method to enhance the multilingual performance of LLMs by aggregating knowledge from diverse languages. This approach incorporates a low-resource knowledge detector specific to a language, a language selection process, and mechanisms for answer replacement and integration. Our experiments demonstrate notable performance improvements, particularly in reducing language performance disparity. An ablation study confirms that each component of our method significantly contributes to these enhancements. This research highlights the inherent potential of LLMs to harmonize multilingual capabilities and offers valuable insights for further exploration.
Computation and Language
What problem does this paper attempt to address?
The paper is primarily dedicated to addressing the consistency issues that large language models (LLMs) encounter when handling queries in different languages. Specifically, although LLMs perform excellently in processing multiple languages, their responses to queries with the same meaning in different languages are often inconsistent, leading to performance disparities and fairness issues. To solve this problem, the authors propose a new method that enhances the multilingual performance of LLMs by integrating knowledge from different languages. This method includes the following key steps: 1. **Low-Resource Knowledge Detector**: Used to identify whether the user's query contains low-resource knowledge in a specific language. 2. **Target Language Selection**: If low-resource knowledge is detected, the most suitable language for handling the query is selected. 3. **Answer Replacement and Integration**: The original query is translated into the target language, and the answer generated based on the target language is used for replacement or integration, finally translating the answer back into the user's original language. Experimental results show that this method can significantly improve the performance of LLMs in multilingual tasks, especially by reducing the performance gap between different languages, thereby enhancing the fairness of downstream applications. Additionally, ablation studies confirm the importance of each component in improving overall performance.