Abstract:Reinforcement learning (RL) is one of the popular methods for intelligent control and decision making in the field of robotics recently. The goal of RL is to learn an optimal policy of the agent by interacting with the environment via trail and error. There are two main algorithms for RL problems, including model-free and model-based methods. Model-free RL is driven by historical trajectories and empirical data of the agent to optimize the policy, which needs to take actions in the environment to collect the trajectory data and may cause the damage of the robot during training in the real environment. The main different between model-based and model-free RL is that a model of the transition probability in the interaction environment is employed. Thus the agent can search the optimal policy through internal simulation. However, the model of the transition probability is usually estimated from historical data in a single environment with statistical errors. Therefore, an issue is faced by the agent is that the optimal policy is sensitive to perturbations in the model of the environment which can lead to serious degradation in performance. Robust RL aims to learn a robust optimal policy that accounts for model uncertainty of the transition probability to systematically mitigate the sensitivity of the optimal policy in perturbed environments. In this overview, we begin with an introduction to the algorithms in RL, then focus on the model uncertainty of the transition probability in robust RL. In parallel, we highlight the current research and challenges of robust RL for robot control. To conclude, we describe some research areas in robust RL and look ahead to the future work about robot control in complex environments.

Training a Robust Reinforcement Learning Controller for the Uncertain System Based on Policy Gradient Method

Model-Based Robot Learning Control with Uncertainty Directed Exploration

Robust Adaptive Control for a Small Unmanned Helicopter Using Reinforcement Learning

Deterministic Policy Gradient with Integral Compensator for Robust Quadrotor Control

Neural Internal Model Control: Learning a Robust Control Policy via Predictive Error Feedback

Adaptive Extremum Seeking Controller Via Nonlinear Variable Gain for Uncertainty Model Multirotor

H_∞ Model-free Reinforcement Learning with Robust Stability Guarantee

Model-free Control Design Using Policy Gradient Reinforcement Learning in LPV Framework

Learning Robust Policies via Interpretable Hamilton-Jacobi Reachability-Guided Disturbances

Robust Control Strategy for Quadrotor Drone Using Reference Model-Based Deep Deterministic Policy Gradient

Robust Adaptive Control for Robotic System with External Disturbance and Guaranteed Parameter Estimation

Control of UAV Quadrotor Using Reinforcement Learning and Robust Controller

Quadrotor Control Using Reinforcement Learning under Wind Disturbance

Robust Control for Affine Nonlinear Systems under the Reinforcement Learning Framework

Robust Reinforcement Learning for Risk-Sensitive Linear Quadratic Gaussian Control

Policy Gradient Method For Robust Reinforcement Learning

A Model Free Controller Based on Reinforcement Learning for Active Steering System with Uncertainties

An Overview of Robust Reinforcement Learning.

Model-based robust control design and experimental validation of SCARA robot system with uncertainty

Robotic Control in Adversarial and Sparse Reward Environments: A Robust Goal-Conditioned Reinforcement Learning Approach

Robust Reinforcement Learning Control for Quadrotor with Input Delay and Uncertainties