Abstract:Knowledge graph question answering (KGQA) aims to answer natural language questions from structured knowledge graphs (KGs). Traditional KQGA methods are usually limited to single-hop queries and cannot handle complex questions involving multi-hop reasoning well. To overcome this issue, multi-hop KGQA based on reinforcement learning (RL) has been proposed. However, multi-hop KGQA based on RL still faces some challenges. Firstly, due to the insufficient availability of latent environmental information during the reasoning process, the agent finds it challenging to make coherent and correct decisions. Secondly, the agent only receives rewards from the environment upon reaching the answer entity during the exploration, leading to slow or even obstructed learning. To address these shortcomings, we construct multi-perspective information based on the state of the environment, and integrate multi-perspective information with RL framework, thereby creating the Multi-Perspective Information Fusion Reasoning Network (MPIFRN). MPIFRN achieves the goal via three steps. (1) We construct three different views of information, i.e., expectation embedding, instruction-guided embedding, and path-aware embedding. These environmental cues provide more reliable support for decision-making. (2) We still adopt the method of mapping entities and relations into the knowledge graph embedding space to answer multi-hop questions. At each step of reasoning, we use a scoring function to measure the plausibility of each "triple" ¡topic entity, question, candidate entity¿ in the embedding space. (3) Furthermore, we employ the asynchronous advantage actor-critic (A3C) algorithm to guide the agent in selecting the most promising entities and to expand the reasoning paths in parallel by updating policy and value network parameters, thereby facilitating multi-hop knowledge graph question answering. We conduct extensive experiments on KGQA benchmark datasets, providing substantial evidence to demonstrate the effectiveness of our approach.

A Multi-step Q Reinforcement Learning Algorithm

A Unified Approach for Multi-step Temporal-Difference Learning with Eligibility Traces in Reinforcement Learning

Gradient Q(σ, Λ): A Unified Algorithm with Function Approximation for Reinforcement Learning

Gradient Q : A Unified Algorithm with Function Approximation for Reinforcement Learning

Multi-step Reinforcement Learning: A Unifying Algorithm

A Fuzzy Multi-Step Q Learning Algorithm Based On Q(Lambda)- Learning And Its Application

Understanding Multi-Step Deep Reinforcement Learning: A Systematic Study of the DQN Target

Optimal Exploration Algorithm of Multi-Agent Reinforcement Learning Methods (Student Abstract)

Towards a Simple Approach to Multi-step Model-based Reinforcement Learning

Multi-Agent Alternate Q-Learning.

Incorporating multi-perspective information into reinforcement learning to address multi-hop knowledge graph question answering

A Multi-Agent Reinforcement Learning Algorithm Based on Stackelberg Game

Mitigating Relative Over-Generalization in Multi-Agent Reinforcement Learning

Multiple rewards fuzzy reinforcement learning algorithm in RoboCup environment

Decentralised Q-Learning for Multi-Agent Markov Decision Processes with a Satisfiability Criterion

A Deep Reinforcement Learning Architecture for Multi-stage Optimal Control

Context-aware Active Multi-Step Reinforcement Learning

A multiagent reinforcement learning approach based on different states

Combating the Compounding-Error Problem with a Multi-step Model

Two-Step Q-Learning

Randomized Ensembled Double Q-Learning: Learning Fast Without a Model