An Adaptive Framework for Generating Systematic Explanatory Answer in Online Q&A Platforms

Ziyang Chen,Xiaobin Wang,Yong Jiang,Jinzhi Liao,Pengjun Xie,Fei Huang,Xiang Zhao
2024-10-23
Abstract:Question Answering (QA) systems face challenges in handling complex questions that require multi-domain knowledge synthesis. The naive RAG models, although effective in information retrieval, struggle with complex questions that require comprehensive and in-depth answers. The pioneering task is defined as explanatory answer generation, which entails handling identified challenges such as the requirement for comprehensive information and logical coherence within the generated context. To address these issues, we refer to systematic thinking theory and propose SynthRAG, an innovative framework designed to enhance QA performance. SynthRAG improves on conventional models by employing adaptive outlines for dynamic content structuring, generating systematic information to ensure detailed coverage, and producing customized answers tailored to specific user inquiries. This structured approach guarantees logical coherence and thorough integration of information, yielding responses that are both insightful and methodically organized. Empirical evaluations underscore SynthRAG's effectiveness, demonstrating its superiority in handling complex questions, overcoming the limitations of naive RAG models, and significantly improving answer quality and depth. Furthermore, an online deployment on the Zhihu platform revealed that SynthRAG's answers achieved notable user engagement, with each response averaging 5.73 upvotes and surpassing the performance of 79.8% of human contributors, highlighting the practical relevance and impact of the proposed framework. Our code is available at <a class="link-external link-https" href="https://github.com/czy1999/SynthRAG" rel="external noopener nofollow">this https URL</a> .
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: Existing question - answering systems (QA systems) face challenges when dealing with complex questions, especially those requiring the integration of multi - domain knowledge and in - depth explanations. Although traditional Retrieval - Augmented Generation (RAG) models perform well in information retrieval, they are insufficient in generating comprehensive and logically coherent answers. To solve these problems, the author proposes a new task - explanatory answer generation, and designs the SynthRAG framework to improve the performance of QA systems. ### Specific manifestations of the problem 1. **Incomplete information coverage**: Traditional RAG methods rely on keywords and question embeddings to retrieve relevant information, and are prone to overlook information that is not explicitly mentioned in the question but is relevant, resulting in incomplete information coverage. 2. **Weak logical organization**: Traditional RAG methods usually retrieve fragmented information, and the generated answers simply aggregate these isolated information fragments, lacking logical coherence and depth. ### Solutions of the SynthRAG framework The SynthRAG framework solves the above problems through the following three main steps: 1. **Adaptive Outline Generation**: - Learn the optimal outline structures of different types of questions from historical data. - Dynamically adjust the outline to ensure comprehensive coverage of key information and logical coherence. 2. **Systematic Information Generation**: - Conduct hierarchical information retrieval for each sub - part of the outline to ensure the detail and consistency of information. - Generate content in parallel to ensure the coherence and logical consistency of each part of the content. 3. **Customized Answer Generation**: - Use high - quality historical answers as references to help the model better understand the overall situation of the question. - Refine the generated content, eliminate redundancy, and ensure that the final answer has a clear structure, relevant content and logical coherence. ### Main contributions of the paper - **Systematically analyze the limitations of LLM in dealing with questions requiring comprehensive and in - depth answers**, especially the deficiencies in information integration and answer depth. - **Propose the SynthRAG framework**, which is an innovative Retrieval - Augmented Generation framework that can effectively integrate information fragments and provide a comprehensive and robust knowledge base. - **Introduce an overall perspective**, promote high - level associative thinking and reasoning, and solve the key problem of information fragmentation links in traditional RAG models. - **Through a series of human - and LLM - based evaluations**, prove the advantages of this method over existing methods, especially in online deployment, where SynthRAG's answers have achieved significant user engagement. In conclusion, this paper aims to improve the ability of QA systems to handle complex questions through the SynthRAG framework and provide more comprehensive, in - depth and logical answers.