Abstract:Hippocampal pyramidal cells and interneurons play a key role in spatial navigation. In goal-directed behavior associated with rewards, the spatial firing pattern of pyramidal cells is modulated by the animal's moving direction toward a reward, with a dependence on auditory, olfactory, and somatosensory stimuli for head orientation. Additionally, interneurons in the CA1 region of the hippocampus monosynaptically connected to CA1 pyramidal cells are modulated by a complex set of interacting brain regions related to reward and recall. The computational method of reinforcement learning (RL) has been widely used to investigate spatial navigation, which in turn has been increasingly used to study rodent learning associated with the reward. The rewards in RL are used for discovering a desired behavior through the integration of two streams of neural activity: trial-and-error interactions with the external environment to achieve a goal, and the intrinsic motivation primarily driven by brain reward system to accelerate learning. Recognizing the potential benefit of the neural representation of this reward design for novel RL architectures, we propose a RL algorithm based on [Formula: see text]-learning with a perspective on biomimetics (neuro-inspired RL) to decode rodent movement trajectories. The reward function, inspired by the neuronal information processing uncovered in the hippocampus, combines the preferred direction of pyramidal cell firing as the extrinsic reward signal with the coupling between pyramidal cell-interneuron pairs as the intrinsic reward signal. Our experimental results demonstrate that the neuro-inspired RL, with a combined use of extrinsic and intrinsic rewards, outperforms other spatial decoding algorithms, including RL methods that use a single reward function. The new RL algorithm could help accelerate learning convergence rates and improve the prediction accuracy for moving trajectories.

Modular deep reinforcement learning from reward and punishment for robot navigation

Mapless Collaborative Navigation for a Multi-Robot System Based on the Deep Reinforcement Learning

Reward-Punishment Reinforcement Learning with Maximum Entropy

Spiking Reinforcement Learning with Memory Ability for Mapless Navigation

Bio-robots Automatic Navigation with Graded Electric Reward Stimulation Based on Reinforcement Learning

Deep Model-Based Reinforcement Learning for Predictive Control of Robotic Systems with Dense and Sparse Rewards

Modular inverse reinforcement learning for visuomotor behavior

Enhancing Robotic Navigation: An Evaluation of Single and Multi-Objective Reinforcement Learning Strategies

Hierarchical reinforcement learning for handling sparse rewards in multi-goal navigation

Autonomous Learning and Navigation of Mobile Robots Based on Deep Reinforcement Learning

Generalization in Deep Reinforcement Learning for Robotic Navigation by Reward Shaping

Research on Autonomous Robots Navigation based on Reinforcement Learning

On Reward Shaping for Mobile Robot Navigation: A Reinforcement Learning and SLAM Based Approach

Accelerated Robot Learning via Human Brain Signals

Beyond Tabula-Rasa: a Modular Reinforcement Learning Approach for Physically Embedded 3D Sokoban

Neuro-Inspired Reinforcement Learning to Improve Trajectory Prediction in Reward-Guided Behavior

Learning Reward Function with Matching Network for Mapless Navigation

Learning of Long-Horizon Sparse-Reward Robotic Manipulator Tasks With Base Controllers

Navigation of Mobile Robots Based on Deep Reinforcement Learning: Reward Function Optimization and Knowledge Transfer

A Unified Approach to Multi-task Legged Navigation: Temporal Logic Meets Reinforcement Learning

Learning Navigation Policies for Mobile Robots in Deep Reinforcement Learning with Random Network Distillation