Safe Non-Stochastic Control of Control-Affine Systems: An Online Convex Optimization Approach

Hongyu Zhou,Yichen Song,Vasileios Tzoumas
2023-09-29
Abstract:We study how to safely control nonlinear control-affine systems that are corrupted with bounded non-stochastic noise, i.e., noise that is unknown a priori and that is not necessarily governed by a stochastic model. We focus on safety constraints that take the form of time-varying convex constraints such as collision-avoidance and control-effort constraints. We provide an algorithm with bounded dynamic regret, i.e., bounded suboptimality against an optimal clairvoyant controller that knows the realization of the noise a prior. We are motivated by the future of autonomy where robots will autonomously perform complex tasks despite real-world unpredictable disturbances such as wind gusts. To develop the algorithm, we capture our problem as a sequential game between a controller and an adversary, where the controller plays first, choosing the control input, whereas the adversary plays second, choosing the noise's realization. The controller aims to minimize its cumulative tracking error despite being unable to know the noise's realization a prior. We validate our algorithm in simulated scenarios of (i) an inverted pendulum aiming to stay upright, and (ii) a quadrotor aiming to fly to a goal location through an unknown cluttered environment.
Systems and Control,Robotics
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to safely control nonlinear control - affine systems in the presence of bounded non - random noise. Specifically, the paper focuses on how to achieve safe control of these systems under time - varying safety constraints, such as collision avoidance and control effort constraints. The authors propose an algorithm that can provide bounded dynamic regret, that is, the sub - optimality of its performance is bounded compared to an optimal omniscient controller that knows the noise realization in advance. The main motivation of the paper lies in the future development of autonomous technologies, that is, robots need to autonomously perform complex tasks when facing unpredictable interferences (such as wind) in the real world. The paper develops the algorithm by formulating the problem as a sequential game between the controller and the adversary, where the controller first selects the control input and the adversary subsequently selects the realization of the noise. The goal of the controller is to minimize the cumulative tracking error without knowing the noise realization. To verify the effectiveness of the algorithm, the authors tested it in two simulation scenarios: (i) an inverted pendulum staying upright; (ii) a quadrotor drone flying to a target location in an unknown cluttered environment. Through these experiments, the authors showed that the proposed algorithm can not only guarantee safety at all times, but also outperform or be comparable to linear feedback controllers, DDPG controllers, geometric controllers, and robust nonlinear model - predictive controllers (R - NMPC) in terms of tracking performance.