Abstract:Hippocampal pyramidal cells and interneurons play a key role in spatial navigation. In goal-directed behavior associated with rewards, the spatial firing pattern of pyramidal cells is modulated by the animal's moving direction toward a reward, with a dependence on auditory, olfactory, and somatosensory stimuli for head orientation. Additionally, interneurons in the CA1 region of the hippocampus monosynaptically connected to CA1 pyramidal cells are modulated by a complex set of interacting brain regions related to reward and recall. The computational method of reinforcement learning (RL) has been widely used to investigate spatial navigation, which in turn has been increasingly used to study rodent learning associated with the reward. The rewards in RL are used for discovering a desired behavior through the integration of two streams of neural activity: trial-and-error interactions with the external environment to achieve a goal, and the intrinsic motivation primarily driven by brain reward system to accelerate learning. Recognizing the potential benefit of the neural representation of this reward design for novel RL architectures, we propose a RL algorithm based on [Formula: see text]-learning with a perspective on biomimetics (neuro-inspired RL) to decode rodent movement trajectories. The reward function, inspired by the neuronal information processing uncovered in the hippocampus, combines the preferred direction of pyramidal cell firing as the extrinsic reward signal with the coupling between pyramidal cell-interneuron pairs as the intrinsic reward signal. Our experimental results demonstrate that the neuro-inspired RL, with a combined use of extrinsic and intrinsic rewards, outperforms other spatial decoding algorithms, including RL methods that use a single reward function. The new RL algorithm could help accelerate learning convergence rates and improve the prediction accuracy for moving trajectories.

Enhancement of Hippocampal Spatial Decoding Using a Dynamic Q-Learning Method With a Relative Reward Using Theta Phase Precession

Neuro-Inspired Reinforcement Learning to Improve Trajectory Prediction in Reward-Guided Behavior

An Improved Dyna-Q Algorithm Inspired by the Forward Prediction Mechanism in the Rat Brain for Mobile Robot Path Planning

A Brain-Inspired Model of Hippocampal Spatial Cognition Based on a Memory-Replay Mechanism

Rapid learning of predictive maps with STDP and theta phase precession

THETA PHASE PRECESSION ENHANCING MEMORY OF PLACE SEQUENCE IN SINGLE TRIAL LEARNING

Hippocampal representations emerge when training recurrent neural networks on a memory dependent maze navigation task

A Prospective, Case-Controlled Study Evaluating the Use of Enamel Matrix Derivative on Human Buccal Recession Defects: A Human Histologic Examination.

A computational model of learning flexible navigation in a maze by layout-conforming replay of place cells

Vision Enhanced Neuro-Cognitive Structure for Robotic Spatial Cognition

A Navigation Cognitive System Driven by Hierarchical Spiking Neural Network.

HG2P: Hippocampus-inspired High-reward Graph and Model-Free Q-Gradient Penalty for Path Planning and Motion Control

An Entorhinal-Hippocampal Model for Simultaneous Cognitive Map Building.

A Computational Theory of Learning Flexible Reward-Seeking Behavior with Place Cells

Theta Phase Precession Enhance Single Trial Learning in an STDP Network

Learning predictive cognitive maps with spiking neurons during behavior and replays

Multi-Scale Extension in an Entorhinal-Hippocampal Model for Cognitive Map Building.

Proarrhythmia related to a kinetic and dynamic interaction of mexiletine and theophylline.

Prioritized Sweeping Neural DynaQ with Multiple Predecessors, and Hippocampal Replays

Correcting the hebbian mistake: Toward a fully error-driven hippocampus

Reinforcement Learning Navigation for Robots Based on Hippocampus Episode Cognition