A Portable Accelerator of Proximal Policy Optimization for Robots

Weiyi Zhang,Yancao Jiang,Fasih Ud Din Farrukh,Chun Zhang,Xiang Xie
DOI: https://doi.org/10.1109/ICTA53157.2021.9661840
2021-01-01
Abstract:Reinforcement learning has great potential to solve robotic controlling tasks for different environments. Proximal policy optimization (PPO) is one of the most efficient algorithms of reinforcement learning, which implements three neural net-works during the training and inference. However, the practical applications of reinforcement learning algorithms in robots are limited by the computational c...
What problem does this paper attempt to address?