Abstract:Hippocampal pyramidal cells and interneurons play a key role in spatial navigation. In goal-directed behavior associated with rewards, the spatial firing pattern of pyramidal cells is modulated by the animal's moving direction toward a reward, with a dependence on auditory, olfactory, and somatosensory stimuli for head orientation. Additionally, interneurons in the CA1 region of the hippocampus monosynaptically connected to CA1 pyramidal cells are modulated by a complex set of interacting brain regions related to reward and recall. The computational method of reinforcement learning (RL) has been widely used to investigate spatial navigation, which in turn has been increasingly used to study rodent learning associated with the reward. The rewards in RL are used for discovering a desired behavior through the integration of two streams of neural activity: trial-and-error interactions with the external environment to achieve a goal, and the intrinsic motivation primarily driven by brain reward system to accelerate learning. Recognizing the potential benefit of the neural representation of this reward design for novel RL architectures, we propose a RL algorithm based on [Formula: see text]-learning with a perspective on biomimetics (neuro-inspired RL) to decode rodent movement trajectories. The reward function, inspired by the neuronal information processing uncovered in the hippocampus, combines the preferred direction of pyramidal cell firing as the extrinsic reward signal with the coupling between pyramidal cell-interneuron pairs as the intrinsic reward signal. Our experimental results demonstrate that the neuro-inspired RL, with a combined use of extrinsic and intrinsic rewards, outperforms other spatial decoding algorithms, including RL methods that use a single reward function. The new RL algorithm could help accelerate learning convergence rates and improve the prediction accuracy for moving trajectories.

A Design of Reward Function in Multi-Target Trajectory Recovery with Deep Reinforcement Learning

A Novel Trajectory Planning Method Based on Trust Region Policy Optimization

Deep Reinforcement Learning Multi-UAV Trajectory Control for Target Tracking

Trajectory Planning for Airborne Radar in Extended Target Tracking Based on Deep Reinforcement Learning

DRL-Based Trajectory Tracking for Motion-Related Modules in Autonomous Driving

Sub-trajectory clustering with deep reinforcement learning

Long-Term Tracking of Evasive Urban Target Based on Intention Inference and Deep Reinforcement Learning

Neuro-Inspired Reinforcement Learning to Improve Trajectory Prediction in Reward-Guided Behavior

3DTRIP: A General Framework for 3D Trajectory Recovery Integrated with Prediction.

Three-Dimensional Trajectory Design for Multi-User MISO UAV Communications: A Deep Reinforcement Learning Approach

Trajectory Design for UAV-Based Internet of Things Data Collection: A Deep Reinforcement Learning Approach

Collaborative Deep Reinforcement Learning for Multi-object Tracking

Adaptive trajectory-constrained exploration strategy for deep reinforcement learning

Deep Reinforcement Learning-Based Rehabilitation Robot Trajectory Planning with Optimized Reward Functions

DeTra: A Unified Model for Object Detection and Trajectory Forecasting

Learning Guidance Rewards with Trajectory-space Smoothing

Deep Reinforcement Learning Based Trajectory Planning Under Uncertain Constraints

Visual Tracking Via Hierarchical Deep Reinforcement Learning

Collaborative Reinforcement Learning Based Unmanned Aerial Vehicle (UAV) Trajectory Design for 3D UAV Tracking

Online Trajectory Planning Method for Midcourse Guidance Phase Based on Deep Reinforcement Learning