Abstract:Hippocampal pyramidal cells and interneurons play a key role in spatial navigation. In goal-directed behavior associated with rewards, the spatial firing pattern of pyramidal cells is modulated by the animal's moving direction toward a reward, with a dependence on auditory, olfactory, and somatosensory stimuli for head orientation. Additionally, interneurons in the CA1 region of the hippocampus monosynaptically connected to CA1 pyramidal cells are modulated by a complex set of interacting brain regions related to reward and recall. The computational method of reinforcement learning (RL) has been widely used to investigate spatial navigation, which in turn has been increasingly used to study rodent learning associated with the reward. The rewards in RL are used for discovering a desired behavior through the integration of two streams of neural activity: trial-and-error interactions with the external environment to achieve a goal, and the intrinsic motivation primarily driven by brain reward system to accelerate learning. Recognizing the potential benefit of the neural representation of this reward design for novel RL architectures, we propose a RL algorithm based on [Formula: see text]-learning with a perspective on biomimetics (neuro-inspired RL) to decode rodent movement trajectories. The reward function, inspired by the neuronal information processing uncovered in the hippocampus, combines the preferred direction of pyramidal cell firing as the extrinsic reward signal with the coupling between pyramidal cell-interneuron pairs as the intrinsic reward signal. Our experimental results demonstrate that the neuro-inspired RL, with a combined use of extrinsic and intrinsic rewards, outperforms other spatial decoding algorithms, including RL methods that use a single reward function. The new RL algorithm could help accelerate learning convergence rates and improve the prediction accuracy for moving trajectories.

Reward Signal Design for Autonomous Racing

High-speed Autonomous Racing using Trajectory-aided Deep Reinforcement Learning

Racing Towards Reinforcement Learning based control of an Autonomous Formula SAE Car

Formula RL: Deep Reinforcement Learning for Autonomous Racing using Telemetry Data

Deep Reinforcement Learning for Local Path Following of an Autonomous Formula SAE Vehicle

Assisted Robust Reward Design

Reaching the Limit in Autonomous Racing: Optimal Control versus Reinforcement Learning

Model-based Reinforcement Learning from Signal Temporal Logic Specifications

Autonomous Racing using a Hybrid Imitation-Reinforcement Learning Architecture

Learn 2 Rage: Experiencing The Emotional Roller Coaster That Is Reinforcement Learning

Residual Policy Learning Facilitates Efficient Model-Free Autonomous Racing

HGRL: Human-Driving-Data Guided Reinforcement Learning for Autonomous Driving

Accelerated Robot Learning via Human Brain Signals

Reward Design in Cooperative Multi-agent Reinforcement Learning for Packet Routing

Constrained Residual Race: an Efficient Hybrid Controller for Autonomous Racing

Neuro-Inspired Reinforcement Learning to Improve Trajectory Prediction in Reward-Guided Behavior

Exploring the design of reward functions in deep reinforcement learning-based vehicle velocity control algorithms

A LiDAR-based approach to autonomous racing with model-free reinforcement learning

Self-Driving Car Racing: Application of Deep Reinforcement Learning

REBEL: A Regularization-Based Solution for Reward Overoptimization in Robotic Reinforcement Learning from Human Feedback

Accelerated Inverse Reinforcement Learning with Randomly Pre-sampled Policies for Autonomous Driving Reward Design.