Agile Robotics: Optimal Control, Reinforcement Learning, and Differentiable Simulation

Yunlong Song,Davide Scaramuzza
2024-05-25
Abstract:Control systems are at the core of every real-world robot. They are deployed in an ever-increasing number of applications, ranging from autonomous racing and search-and-rescue missions to industrial inspections and space exploration. To achieve peak performance, certain tasks require pushing the robot to its maximum agility. How can we design control algorithms that enhance the agility of autonomous robots and maintain robustness against unforeseen disturbances? This paper addresses this question by leveraging fundamental principles in optimal control, reinforcement learning, and differentiable simulation.
Robotics
What problem does this paper attempt to address?
The paper primarily aims to address several key issues in the field of robotic control, particularly in enhancing robot agility and handling unknown disturbances. Specifically, the authors design control algorithms that can enhance the agility of autonomous robots and maintain robustness against unexpected disturbances by combining three methods: Optimal Control, Reinforcement Learning, and Differentiable Simulation. The paper explores the following three research questions: 1. **The fundamental differences between Reinforcement Learning and Optimal Control**: By comparing the differences in optimization strategies and objectives between the two methods, the study investigates the advantages of Reinforcement Learning over Optimal Control. 2. **The combination of Reinforcement Learning and Optimal Control**: A framework is proposed that combines the strengths of Reinforcement Learning with Optimal Control to tackle high-difficulty robotic control tasks. 3. **Using Differentiable Simulation for policy training**: It demonstrates how to train control policies through Differentiable Simulation to achieve faster convergence and a more stable training process. Through these studies, the authors aim to develop an advanced control framework that integrates the precision and safety of Optimal Control with the adaptability and robustness of Reinforcement Learning, thereby achieving a higher level of robotic control performance.