Conversational Question Answering with Reformulations over Knowledge Graph

Lihui Liu,Blaine Hill,Boxin Du,Fei Wang,Hanghang Tong
2024-03-29
Abstract:Conversational question answering (convQA) over knowledge graphs (KGs) involves answering multi-turn natural language questions about information contained in a KG. State-of-the-art methods of ConvQA often struggle with inexplicit question-answer pairs. These inputs are easy for human beings to understand given a conversation history, but hard for a machine to interpret, which can degrade ConvQA performance. To address this problem, we propose a reinforcement learning (RL) based model, CornNet, which utilizes question reformulations generated by large language models (LLMs) to improve ConvQA performance. CornNet adopts a teacher-student architecture where a teacher model learns question representations using human writing reformulations, and a student model to mimic the teacher model's output via reformulations generated by LLMs. The learned question representation is then used by an RL model to locate the correct answer in a KG. Extensive experimental results show that CornNet outperforms state-of-the-art convQA models.
Artificial Intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to handle implicit or unclear question - answer pairs in Conversational Question Answering (ConvQA) on knowledge graphs. Specifically, these questions are easy for humans to understand in the context of the conversation history, but difficult for machines to parse, which will reduce the performance of the ConvQA system. To solve this problem, the paper proposes a model based on Reinforcement Learning (RL) - CoRnNet, which uses question restatements generated by Large Language Models (LLMs) to improve the performance of ConvQA. ### Main contributions: 1. **Analysis**: The paper shows that although LLMs are excellent question - restatement tools, their performance still lags behind the human level. 2. **Algorithm**: A model based on RL, CoRnNet, is proposed. This model uses question restatements to improve question - answering performance. Through the teacher - student architecture, it uses the restatements generated by LLMs to approach human - level performance. 3. **Empirical evaluation**: The experimental results show that CoRnNet has a more consistent performance than the existing state - of - the - art conversational question - answering baseline methods on multiple real - world datasets. ### Key points of the solution: - **Question restatement**: Use LLMs to generate high - quality question restatements to enhance the semantic integrity of the questions. - **Teacher - student architecture**: The teacher model is trained with human - written restatements, and the student model uses the restatements generated by LLMs to imitate the output of the teacher model. - **Reinforcement learning**: Navigate through the knowledge graph through the RL model to find the correct answer. The RL model samples actions according to the policy network, guides the search direction and identifies candidate answers. ### Experimental results: - **Overall performance**: On the ConvQuestions and ConvRef datasets, CoRnNet outperforms other baseline methods in both the H@5 and MRR metrics. For example, on the H@5 metric, CoRnNet is 4.5% higher than CONQUER and 20% higher than CONVEX; on the MRR metric, CONVEX has the worst performance, 13.7% lower than CoRnNet. - **Comparison methods**: OAT performs best on the P@1 metric, but is 7.3% lower than CoRnNet on the MRR metric. Focal Entity ranks second on the P@1 metric and third on the MRR metric. In conclusion, this paper significantly improves the performance of the conversational question - answering system in handling implicit questions by introducing question - restatement and reinforcement - learning techniques.