Model Learning for Two-Wheeled Robot Self-Balance Control
Enbo Li,Haibo Feng,Haitao Zhou,Xu Li,Yanwu Zhai,Songyuan Zhang,Yili Fu
DOI: https://doi.org/10.1109/ROBIO49542.2019.8961382
2019-01-01
Abstract:Two-wheeled robots have many advantages over other mobile robots, but they are difficult to self-balance compared with other wheeled robots. Reinforcement learning (RL) is a general framework for sequential decision-making problems. So far, there are many applications of reinforcement learning to solve robot control problems, but most of them are used in simulators because of the large amount of data required. In addition, due to the reality gap, the policy learned in a simulation environment cannot be transferred directly to a real robot. Real robot data often expensive due to the potential damage to the robot. Model-based methods require far fewer robot data than model-free methods, but these methods have the problem of model bias. In this paper, we use a model-based reinforcement learning method to achieve self-balance of a two-wheeled robot. We present a model learning method that can reduce the problem of model bias. Our method combines the simulator and a few real robot data to learn a probabilistic dynamics model of the robot through an iterative way, which requires no expertise and can learn from scratch. Then the control policy is optimized based on the learned model.