Minimax control of ambiguous linear stochastic systems using the Wasserstein metric

Kihyun Kim,Insoon Yang
DOI: https://doi.org/10.48550/arXiv.2003.13258
2020-03-30
Abstract:In this paper, we propose a minimax linear-quadratic control method to address the issue of inaccurate distribution information in practical stochastic systems. To construct a control policy that is robust against errors in an empirical distribution of uncertainty, our method is to adopt an adversary, which selects the worst-case distribution. To systematically adjust the conservativeness of our method, the opponent receives a penalty proportional to the amount, measured with the Wasserstein metric, of deviation from the empirical distribution. In the finite-horizon case, using a Riccati equation, we derive a closed-form expression of the unique optimal policy and the opponent's policy that generates the worst-case distribution. This result is then extended to the infinite-horizon setting by identifying conditions under which the Riccati recursion converges to the unique positive semi-definite solution to an associated algebraic Riccati equation (ARE). The resulting optimal policy is shown to stabilize the expected value of the system state under the worst-case distribution. We also discuss that our method can be interpreted as a distributional generalization of the $H_\infty$-method.
Systems and Control,Optimization and Control
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the decline in control performance due to inaccurate distribution information in actual stochastic systems. Specifically, the paper focuses on how to design a robust control strategy to meet the challenges brought by this uncertainty in the presence of uncertainty or distribution information errors. To achieve this goal, the author proposes a min - max linear quadratic control method based on the Wasserstein distance. By introducing a hypothesized adversary to select the worst - case distribution and penalizing the degree of deviation from the empirical distribution, the conservatism of the control strategy is adjusted. This method can not only provide a closed - form optimal strategy expression, but also ensure the stability of the closed - loop system in both finite and infinite time horizons. In addition, a theoretical connection is established between this method and the classical H∞ control method, providing a new perspective for connecting stochastic control and robust control.