Abstract:Reinforcement learning (RL) has been successfully applied to a variety of robotics applications, where it outperforms classical methods. However, the safety aspect of RL and the transfer to the real world remain an open challenge. A prominent field for tackling this challenge and ensuring the safety of the agents during training and execution is safe reinforcement learning. Safe RL can be achieved through constrained RL and safe exploration approaches. The former learns the safety constraints over the course of training to achieve a safe behavior by the end of training, at the cost of high number of collisions at earlier stages of the training. The latter offers robust safety by enforcing the safety constraints as hard constraints, which prevents collisions but hinders the exploration of the RL agent, resulting in lower rewards and poor performance. To overcome those drawbacks, we propose a novel safety shield, that combines the robustness of the optimization-based controllers with the long prediction capabilities of the RL agents, allowing the RL agent to adaptively tune the parameters of the controller. Our approach is able to improve the exploration of the RL agents for navigation tasks, while minimizing the number of collisions. Experiments in simulation show that our approach outperforms state-of-the-art baselines in the reached goals-to-collisions ratio in different challenging environments. The goals-to-collisions ratio metrics emphasizes the importance of minimizing the number of collisions, while learning to accomplish the task. Our approach achieves a higher number of reached goals compared to the classic safety shields and fewer collisions compared to constrained RL approaches. Finally, we demonstrate the performance of the proposed method in a real-world experiment.

Reachability-Based Trajectory Safeguard (RTS): A Safe and Fast Reinforcement Learning Safety Layer for Continuous Control

Train Trajectory Optimization with High-Risk State Space Boundaries: A Safe Reinforcement Learning Approach

Safe Reinforcement Learning Using Black-Box Reachability Analysis

Guaranteed Safe Reachability-based Trajectory Design for a High-Fidelity Model of an Autonomous Passenger Vehicle

Reachability-based Trajectory Design with Neural Implicit Safety Constraints

Safe Reinforcement Learning of Robot Trajectories in the Presence of Moving Obstacles

Safe Model-Based Reinforcement Learning with an Uncertainty-Aware Reachability Certificate

A Dynamic Safety Shield for Safe and Efficient Reinforcement Learning of Navigation Tasks

Reinforcement Learning in a Safety-Embedded MDP with Trajectory Optimization

Model-free Neural Lyapunov Control for Safe Robot Navigation

Safe reinforcement learning for probabilistic reachability and safety specifications: A Lyapunov-based approach

Iterative Reachability Estimation for Safe Reinforcement Learning

Model-Based Safe Reinforcement Learning with Time-Varying State and Control Constraints: An Application to Intelligent Vehicles

Deep Reinforcement Learning Based Trajectory Planning Under Uncertain Constraints

Model-Based Safe Reinforcement Learning With Time-Varying Constraints: Applications to Intelligent Vehicles

Ensuring Safety of Learning-Based Motion Planners Using Control Barrier Functions

SAFER: Safe Collision Avoidance using Focused and Efficient Trajectory Search with Reinforcement Learning

An Efficient and Responsive Robot Motion Controller for Safe Human-Robot Collaboration

Searching for Optimal Runtime Assurance via Reachability and Reinforcement Learning

Runtime Safety Assurance Using Reinforcement Learning

REFINE: Reachability-Based Trajectory Design Using Robust Feedback Linearization and Zonotopes