Haoyu Wang,Ruirui Li,Haoming Jiang,Jinjin Tian,Zhengyang Wang,Chen Luo,Xianfeng Tang,Monica Cheng,Tuo Zhao,Jing Gao
Abstract:Retrieval-augmented Large Language Models (LLMs) offer substantial benefits in enhancing performance across knowledge-intensive scenarios. However, these methods often face challenges with complex inputs and encounter difficulties due to noisy knowledge retrieval, notably hindering model effectiveness. To address this issue, we introduce BlendFilter, a novel approach that elevates retrieval-augmented LLMs by integrating query generation blending with knowledge filtering. BlendFilter proposes the blending process through its query generation method, which integrates both external and internal knowledge augmentation with the original query, ensuring comprehensive information gathering. Additionally, our distinctive knowledge filtering module capitalizes on the intrinsic capabilities of the LLM, effectively eliminating extraneous data. We conduct extensive experiments on three open-domain question answering benchmarks, and the findings clearly indicate that our innovative BlendFilter surpasses state-of-the-art baselines significantly.
What problem does this paper attempt to address?
### Problems the Paper Aims to Solve
This paper aims to address the challenges faced by Retrieval-Augmented Large Language Models (LLMs) when dealing with complex inputs and the issue of noisy information in the retrieved knowledge.
#### Specific Problems:
1. **Handling Complex Inputs**:
- Current retrieval-augmented methods struggle with complex inputs. Simple queries can effectively identify relevant information, but multifaceted and complex queries may fail to cover all key information, making it difficult to retrieve relevant documents.
2. **Filtering Noisy Knowledge**:
- The retrieved knowledge documents may contain irrelevant or misleading information. Typically, the top K documents returned by the retriever are used for augmentation, but these documents may not all be relevant to the task. Including such noisy information in the augmented query can lead to inaccurate outputs from the LLM.
### Solution
To address the above issues, the paper proposes **BlendFilter**, a novel framework that enhances the performance of retrieval-augmented LLMs through the integration of Query Generation Blending and Knowledge Filtering.
#### Core Components:
1. **Query Generation Blending Module**:
- Enriches the original query through various augmentation strategies, forming a composite of queries to tackle the challenges of complex queries. This module combines external and internal knowledge sources for augmentation.
2. **Knowledge Filtering Module**:
- Aims to eliminate irrelevant knowledge from the retrieved documents without requiring additional language models, utilizing the LLM's own filtering capabilities.
3. **Answer Generation Module**:
- In the final stage, the LLM combines the filtered knowledge with the original query to generate the final answer.
### Main Contributions:
1. **Introduction of a New Query Generation Blending Method**:
- Unlike existing works that rely on a single source, this method enriches queries using multiple knowledge sources, thereby covering relevant knowledge more comprehensively.
2. **Proposal of a New Effective Knowledge Filtering Module**:
- For the first time, it proposes using the LLM itself as a filter to eliminate irrelevant knowledge.
3. **Extensive Experimental Validation**:
- Conducted extensive experiments on three open-domain question-answering benchmarks, showing that the proposed BlendFilter model significantly outperforms baseline models.
### Experimental Results:
- **Performance Comparison**:
- BlendFilter shows significant performance improvements across different backbone models and datasets. For example, using GPT-3.5-turbo-Instruct, Vicuna 1.5-13b, and Qwen-7b as backbone models, it achieved average improvements of 9.7%, 7.4%, and 14.2%, respectively.
- **Generalization Across Different Retrievers**:
- BlendFilter performs well with both ColBERT v2 and BM25 retrievers, with particularly notable performance improvements on BM25.
- **Retrieval Effectiveness**:
- The Knowledge Filtering Module effectively eliminates irrelevant documents, improving retrieval accuracy.
- **Importance of Different Queries**:
- Original queries, external augmented queries, and internal augmented queries all play important roles in answer generation, especially when using the sparse retriever BM25, where internal augmented queries are particularly crucial.
- **Impact of the Number of Retrieved Documents**:
- Appropriately increasing the number of retrieved knowledge documents can improve model performance, but too many documents can introduce noisy information, leading to performance degradation.
- **Impact of Sampling Temperature**:
- Using different sampling temperatures has some impact on model performance, but overall, BlendFilter performs well across different temperature settings.