Robust Deep Reinforcement Learning for Quadcopter Control

Aditya M. Deshpande,Ali A. Minai,Manish Kumar

DOI: https://doi.org/10.48550/arXiv.2111.03915

2021-11-07

Abstract:Deep reinforcement learning (RL) has made it possible to solve complex robotics problems using neural networks as function approximators. However, the policies trained on stationary environments suffer in terms of generalization when transferred from one environment to another. In this work, we use Robust Markov Decision Processes (RMDP) to train the drone control policy, which combines ideas from Robust Control and RL. It opts for pessimistic optimization to handle potential gaps between policy transfer from one environment to another. The trained control policy is tested on the task of quadcopter positional control. RL agents were trained in a MuJoCo simulator. During testing, different environment parameters (unseen during the training) were used to validate the robustness of the trained policy for transfer from one environment to another. The robust policy outperformed the standard agents in these environments, suggesting that the added robustness increases generality and can adapt to non-stationary environments. Codes: <a class="link-external link-https" href="https://github.com/adipandas/gym_multirotor" rel="external noopener nofollow">this https URL</a>

Robotics,Artificial Intelligence,Machine Learning,Systems and Control,Optimization and Control

What problem does this paper attempt to address?

The problem that this paper attempts to solve is the insufficient generalization ability of the quad - rotor UAV control strategy under different environmental conditions. Specifically, the control strategies trained using traditional reinforcement learning methods will experience a significant performance decline when transferred from one environment to another due to changes in environmental parameters (such as mass, moment of inertia, air resistance or friction, etc.). To overcome this challenge, this paper proposes a method based on Robust Markov Decision Process (RMDP) to train the control strategy of the quad - rotor UAV in order to improve its adaptability and robustness under different environmental conditions. By combining robust control theory and deep reinforcement learning, the paper aims to develop a control strategy that can maintain high performance in non - static environments. In the experiment, the researchers trained RL agents in the MuJoCo simulator and verified the robustness of the trained strategies by changing unseen environmental parameters. The results show that, compared with standard RL agents, the robust strategies perform better in these environments, indicating that the increased robustness improves the generality of the strategies, enabling them to adapt to non - static environments.

Robust Deep Reinforcement Learning for Quadcopter Control

Robust Control Strategy for Quadrotor Drone Using Reference Model-Based Deep Deterministic Policy Gradient

Deterministic Policy Gradient with Integral Compensator for Robust Quadrotor Control

Deep Reinforcement Learning-based Quadcopter Controller: A Practical Approach and Experiments

Control of UAV Quadrotor Using Reinforcement Learning and Robust Controller

Quadrotor motion control using deep reinforcement learning

How to Train Your Quadrotor: A Framework for Consistently Smooth and Responsive Flight Control via Reinforcement Learning

Aggressive and robust low-level control and trajectory tracking for quadrotors with deep reinforcement learning

Learning Stabilization Control of Quadrotor in Near-Ground Setting Using Reinforcement Learning

Neural Internal Model Control: Learning a Robust Control Policy via Predictive Error Feedback

Retro-RL: Reinforcing Nominal Controller With Deep Reinforcement Learning for Tilting-Rotor Drones

Reinforcement Learning for UAV Attitude Control

Quadrotor Control Using Reinforcement Learning under Wind Disturbance

Evaluation of Reinforcement and Deep Learning Algorithms in Controlling Unmanned Aerial Vehicles

Model-assisted Reinforcement Learning of a Quadrotor

End-to-end Reinforcement Learning for Time-Optimal Quadcopter Flight

QuadSwarm: A Modular Multi-Quadrotor Simulator for Deep Reinforcement Learning with Direct Thrust Control

Collision Avoidance and Navigation for a Quadrotor Swarm Using End-to-end Deep Reinforcement Learning

Learning Robust Policies via Interpretable Hamilton-Jacobi Reachability-Guided Disturbances

Developmental Reinforcement Learning of Control Policy of a Quadcopter UAV with Thrust Vectoring Rotors

Fault-tolerant Control for Unmanned Aerial Vehicle Using Deep Reinforcement Learning