Abstract:This paper presents a novel composite obstacle avoidance control method to generate safe motion trajectories for autonomous systems in an adaptive manner. First, system safety is described using forward invariance, and the barrier function is encoded into the cost function such that the obstacle avoidance problem can be characterized by an infinite-horizon optimal control problem. Next, a safe reinforcement learning framework is proposed by combining model-based policy iteration and state-following-based approximation. Upon real-time data and extrapolated experience data, this learning design is implemented through the actor-critic structure, in which critic networks are tuned by gradient-descent adaption and actor networks produce adaptive control policies via gradient projection. Then, system stability and weight convergence are theoretically analyzed using Lyapunov method. Finally, the proposed learning-based controller is demonstrated on a two-dimensional single integrator system and a nonlinear unicycle kinematic system. Simulation results reveal that the system or agent can smoothly reach the target point while keeping a safe distance from each obstacle; at the same time, other three avoidance control methods are used to provide side-by-side comparisons and to verify some claimed advantages of the present method. Note to Practitioners—This paper is motivated by the obstacle avoidance problem of real-time navigation of an agent to the target point, which applies to practical autonomous systems such as vehicles and robots. Pre-generative methods and reactive methods have been widely employed to generate safe motion trajectories in the obstacle environment. However, these methods cannot strike a good balance between safety and optimality. In this paper, the obstacle avoidance problem is formulated in the sense of optimal control, and a safe reinforcement learning method is designed to generate safe motion trajectories. This method combines the advantages of model-based policy iteration and state-following-based approximation, in which the former ensures regional optimality while the latter ensures local safety. Based on the proposed adaptive tuning laws, engineers are able to design learning-based avoidance controllers in the environment with static obstacles. In future research, we will address the dynamic avoidance problem against moving obstacles.

Safe Reinforcement Learning and Adaptive Optimal Control With Applications to Obstacle Avoidance Problem

Barrier-Certified Adaptive Reinforcement Learning with Applications to Brushbot Navigation

Fuzzy Adaptive Control-based Real-time Obstacle Avoidance under Uncertain Perturbations

Hybrid Feedback Control Design for Non-Convex Obstacle Avoidance

Safe and efficient collision avoidance control for autonomous vehicles

An Efficient and Responsive Robot Motion Controller for Safe Human-Robot Collaboration

An Efficient Approach for Obstacle Avoidance and Navigation in Robots

Model-Based Safe Reinforcement Learning with Time-Varying State and Control Constraints: An Application to Intelligent Vehicles

Model-Based Safe Reinforcement Learning With Time-Varying Constraints: Applications to Intelligent Vehicles

Constraint‐Oriented Obstacle Avoidance Control for Autonomous Vehicles Without Local Trajectory Replanning

Safe Reinforcement Learning of Robot Trajectories in the Presence of Moving Obstacles

Robot obstacle avoidance system using deep reinforcement learning

An obstacle avoidance method for robotic arm based on reinforcement learning

Uniform Finite Time Safe Path Tracking Control for Obstacle Avoidance of Autonomous Vehicle Via Barrier Function Approach

A hybrid controller for safe and efficient collision avoidance control

Safe Nonlinear Control Using Robust Neural Lyapunov-Barrier Functions

A safe reinforcement learning approach for autonomous navigation of mobile robots in dynamic environments

Ensuring Safety of Learning-Based Motion Planners Using Control Barrier Functions

A bio-inspired kinematic controller for obstacle avoidance during reaching tasks with real robots

Learning-based Model Predictive Control for Safe Exploration and Reinforcement Learning

Reactive Collision Avoidance for Safe Agile Navigation