Learning to Plan for Retrieval-Augmented Large Language Models from Knowledge Graphs

Junjie Wang,Mingyang Chen,Binbin Hu,Dan Yang,Ziqi Liu,Yue Shen,Peng Wei,Zhiqiang Zhang,Jinjie Gu,Jun Zhou,Jeff Z. Pan,Wen Zhang,Huajun Chen

2024-10-23

Abstract:Improving the performance of large language models (LLMs) in complex question-answering (QA) scenarios has always been a research focal point. Recent studies have attempted to enhance LLMs' performance by combining step-wise planning with external retrieval. While effective for advanced models like GPT-3.5, smaller LLMs face challenges in decomposing complex questions, necessitating supervised fine-tuning. Previous work has relied on manual annotation and knowledge distillation from teacher LLMs, which are time-consuming and not accurate enough. In this paper, we introduce a novel framework for enhancing LLMs' planning capabilities by using planning data derived from knowledge graphs (KGs). LLMs fine-tuned with this data have improved planning capabilities, better equipping them to handle complex QA tasks that involve retrieval. Evaluations on multiple datasets, including our newly proposed benchmark, highlight the effectiveness of our framework and the benefits of KG-derived planning data.

Computation and Language,Artificial Intelligence

What problem does this paper attempt to address?

The problem this paper attempts to address is: how to improve the performance of large language models (LLMs) in complex question-answering tasks, especially for smaller LLMs that face challenges in decomposing complex questions. Existing methods typically rely on manual annotation or distilling knowledge from teacher models, which are both time-consuming and inaccurate. Therefore, this paper proposes a new framework that enhances the planning capabilities of LLMs by using planning data generated from knowledge graphs (KGs), thereby improving their performance in complex question-answering tasks. Specifically, the main contributions of the paper include: 1. **Introducing the LPKG framework**: This framework leverages planning data extracted from knowledge graphs to enhance the planning capabilities of LLMs. 2. **Constructing the CLQA-Wiki benchmark dataset**: This is a comprehensive and challenging complex logical question-answering benchmark dataset that covers multi-hop, comparison, intersection, and union types of questions. 3. **Experimental validation**: On multiple traditional complex question-answering benchmark datasets, the LPKG framework outperforms existing baseline methods and significantly improves the planning capabilities of LLMs by using KG-sourced planning data. Through these contributions, the paper aims to provide a more efficient and accurate method to enhance the performance of LLMs in complex question-answering tasks.

Learning to Plan for Retrieval-Augmented Large Language Models from Knowledge Graphs

Learning to Plan for Retrieval-Augmented Large Language Models from Knowledge Graphs

Retrieval and Reasoning on KGs: Integrate Knowledge Graphs into Large Language Models for Complex Question Answering

Enhancing Large Language Models with Knowledge Graphs for Robust Question Answering

Plan-on-Graph: Self-Correcting Adaptive Planning of Large Language Model on Knowledge Graphs

Clue-Guided Path Exploration: Optimizing Knowledge Graph Retrieval with Large Language Models to Address the Information Black Box Challenge

Retrieve-Rewrite-Answer: A KG-to-Text Enhanced LLMs Framework for Knowledge Graph Question Answering

KnowledGPT: Enhancing Large Language Models with Retrieval and Storage Access on Knowledge Bases

Augmented Large Language Models with Parametric Knowledge Guiding

Knowledge Graph-Enhanced Large Language Models via Path Selection

Enhancing Large Language Models with Pseudo- and Multisource- Knowledge Graphs for Open-ended Question Answering

Large Language Models Can Better Understand Knowledge Graphs Than We Thought

Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation

Making Large Language Models Perform Better in Knowledge Graph Completion

ChatGPT is not Enough: Enhancing Large Language Models with Knowledge Graphs for Fact-aware Language Modeling

Prompting Large Language Models with Knowledge Graphs for Question Answering Involving Long-tail Facts

Self-Knowledge Guided Retrieval Augmentation for Large Language Models

KnowledgeNavigator: Leveraging Large Language Models for Enhanced Reasoning over Knowledge Graph

Reasoning on Efficient Knowledge Paths:Knowledge Graph Guides Large Language Model for Domain Question Answering

CogMG: Collaborative Augmentation Between Large Language Model and Knowledge Graph