A Q-learning approach to the continuous control problem of robot inverted pendulum balancing

Mohammad Safeea,Pedro Neto
DOI: https://doi.org/10.1016/j.iswa.2023.200313
2023-12-05
Abstract:This study evaluates the application of a discrete action space reinforcement learning method (Q-learning) to the continuous control problem of robot inverted pendulum balancing. To speed up the learning process and to overcome technical difficulties related to the direct learning on the real robotic system, the learning phase is performed in simulation environment. A mathematical model of the system dynamics is implemented, deduced by curve fitting on data acquired from the real system. The proposed approach demonstrated feasible, featuring its application on a real world robot that learned to balance an inverted pendulum. This study also reinforces and demonstrates the importance of an accurate representation of the physical world in simulation to achieve a more efficient implementation of reinforcement learning algorithms in real world, even when using a discrete action space algorithm to control a continuous action.
Robotics,Machine Learning
What problem does this paper attempt to address?
The paper attempts to address the problem of applying discrete action space reinforcement learning methods (Q-learning) to the continuous control problem of balancing a robotic inverted pendulum. Specifically, the researchers trained the Q-learning algorithm in a simulation environment and applied the learned policy to an actual robotic system to achieve stable control of the inverted pendulum. This approach aims to overcome the technical challenges and time efficiency issues of learning directly on real systems, while also demonstrating that it is possible to effectively control continuous actions using discrete action space algorithms. Additionally, the paper emphasizes the importance of high-precision physical world simulation for achieving efficient reinforcement learning algorithms.