Graph Reasoning for Question Answering with Triplet Retrieval

Shiyang Li,Yifan Gao,Haoming Jiang,Qingyu Yin,Zheng Li,Xifeng Yan,Chao Zhang,Bing Yin
2023-05-30
Abstract:Answering complex questions often requires reasoning over knowledge graphs (KGs). State-of-the-art methods often utilize entities in questions to retrieve local subgraphs, which are then fed into KG encoder, e.g. graph neural networks (GNNs), to model their local structures and integrated into language models for question answering. However, this paradigm constrains retrieved knowledge in local subgraphs and discards more diverse triplets buried in KGs that are disconnected but useful for question answering. In this paper, we propose a simple yet effective method to first retrieve the most relevant triplets from KGs and then rerank them, which are then concatenated with questions to be fed into language models. Extensive results on both CommonsenseQA and OpenbookQA datasets show that our method can outperform state-of-the-art up to 4.6% absolute accuracy.
Computation and Language
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to effectively use the knowledge in Knowledge Graphs (KGs) to improve the ability to answer complex questions. Specifically, existing methods usually use entities in the question to retrieve local sub - graphs and input them into a knowledge - graph encoder (such as a graph neural network) to model their local structure, and then combine with a language model for question answering. However, this method restricts the knowledge retrieved from the knowledge graph to only the local sub - graphs and ignores more diverse triplets in the knowledge graph that are not directly connected but are useful for question answering. Therefore, the paper proposes a new method to more effectively use the knowledge in the knowledge graph by first retrieving the most relevant triplets from the knowledge graph, then re - ranking these triplets, and finally inputting them together with the question into the language model. Experimental results show that this method can significantly outperform existing methods on the CommonsenseQA and OpenbookQA datasets, with an absolute accuracy improvement of up to 4.6%.