Enhancing Large Language Models with Pseudo- and Multisource- Knowledge Graphs for Open-ended Question Answering

Jiaxiang Liu,Tong Zhou,Yubo Chen,Kang Liu,Jun Zhao
2024-02-15
Abstract:Mitigating the hallucinations of Large Language Models (LLMs) and enhancing them is a crucial task. Although some existing methods employ model self-enhancement techniques, they fall short of effectively addressing unknown factual hallucinations. Using Knowledge Graph (KG) enhancement approaches fails to address the generalization across different KG sources and the enhancement of open-ended answer questions simultaneously. To tackle these limitations, there is a framework that combines Pseudo-Graph Generation and Atomic Knowledge Verification proposed. The enhancement of LLM using KG in an open-ended question-answering setting is implemented by leveraging the Pseudo-Graph Generation. Atomic Knowledge Verification utilizes atomic-level knowledge querying and verification to achieve generalizability under different KG sources. Compared to the baseline, this approach yields a minimum improvement of 11.5 in the ROUGE-L score for open-ended questions. For precise questions, we observe a minimum accuracy improvement of 7.5. Moreover, there is also demonstration that this framework exhibits generalizability across different KG sources. In summary, our results pave the way for enhancing LLMs by incorporating Pseudo- and Multisource-KGs, particularly in the context of open-ended questions.
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?
The paper primarily addresses the issues of hallucinations and lack of domain-specific knowledge in large language models (LLMs) when handling open-ended question answering. Specifically, the paper attempts to solve the following key problems: 1. **Mitigating Hallucination Issues**: LLMs may generate incorrect information or uncertain knowledge when dealing with complex questions, a phenomenon known as "hallucinations." Existing methods, although capable of improving this issue through model self-enhancement techniques (such as Chain-of-Thought, CoT prompts), still cannot fundamentally solve the hallucination problem. 2. **Enhancing LLMs with Knowledge Graphs**: Knowledge Graphs (KGs) are highly valuable in LLM tasks due to their structured knowledge, high accuracy, and timely updated information. However, existing methods have limitations when utilizing knowledge graphs for enhancement, such as the inability to effectively handle knowledge graphs from different sources and limited enhancement effects for open-ended question answering. To address the above challenges, the paper proposes a method that combines Pseudo-Graph Generation and Atomic Knowledge Verification. This method aims to leverage the capabilities of LLMs to generate pseudo-triples related to the question and further use these pseudo-triples to query the knowledge graph to obtain actual triple information. Additionally, an atomic knowledge verification step is proposed to ensure that the method maintains good generalization capabilities across different knowledge graph sources. The main contributions of this method include: - **Pseudo-Graph Generation**: Utilizing LLMs to generate pseudo-triples related to the question, allowing the framework to use knowledge graphs as a means of knowledge enhancement for LLMs, particularly suitable for open-ended question answering. - **Atomic Knowledge Verification**: Using atomic-level knowledge for querying and verification, ensuring good generalization capabilities across different knowledge graph sources. - **Empirical Results**: Experiments conducted on the Natural Questions dataset show that this method not only performs well on this dataset but also performs well on other existing datasets such as QALD-10 and SimpleQuestions. In summary, this paper proposes a new method to address the limitations of large language models in open-ended question answering and demonstrates its effectiveness and generalization capabilities through experiments.