Maximum Correntropy Based Attention-Gated Reinforcement Learning Designed for Brain Machine Interface.

Hongbao Li,Fang Wang,Qiaosheng Zhang,Shaomin Zhang,Yiwen Wang,Xiaoxiang Zheng,Jose C. Principe
DOI: https://doi.org/10.1109/embc.2016.7591374
2016-01-01
Abstract:Reinforcement learning is an effective algorithm for brain machine interfaces (BMIs) which interprets the mapping between neural activities with plasticity and the kinematics. Exploring large state-action space is difficulty when the complicated BMIs needs to assign credits over both time and space. For BMIs attention gated reinforcement learning (AGREL) has been developed to classify multi-actions for spatial credit assignment task with better efficiency. However, the outliers existing in the neural signals still make interpret the neural-action mapping difficult. We propose an enhanced AGREL algorithm using correntropy as a criterion, which is more insensitive to noise. Then the algorithm is tested on the neural data where the monkey is trained to do the obstacle avoidance task. The new method converges faster during the training period, and improves from 44.63% to 68.79% on average in success rate compared with the original AGREL. The result indicates that the combination of correntropy criterion and AGREL can reduce the effect of the outliers with better performance when interpreting the mapping between neural signal and kinematics.
What problem does this paper attempt to address?