Stochastic Kinematic Optimal Control on SO(3)

Xi Wang,Xiaoyi Wang,Victor Solo
2024-12-11
Abstract:In this paper, we develop a novel method for deriving a global optimal control strategy for stochastic attitude kinematics on the special orthogonal group SO(3). We first introduce a stochastic Lie-Hamilton-Jacobi-Bellman (SL-HJB) equation on SO(3), which theoretically provides an optimality condition for the global optimal control strategy of the stochastic attitude kinematics. Then we propose a novel numerical method, the Successive Wigner-Galerkin Approximation (SWGA) method, to solve the SL-HJB equation on SO(3). The SWGA method leverages the Wigner-D functions to represent the Galerkin solution of the SL-HJB equation in a policy iteration framework, providing a computationally efficient approach to derive a global optimal control strategy for systems on SO(3). We demonstrate the effectiveness of the SWGA method through numerical simulation on stochastic attitude stabilization.
Optimization and Control
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to develop a globally optimal control strategy for stochastic attitude kinematics on the special orthogonal group SO(3). Specifically, the article aims to overcome the limitations encountered by existing methods in dealing with stochastic optimal control on SO(3). For example, traditional Euclidean stochastic optimal control methods can only produce locally optimal strategies, and when extending the control strategy to the entire SO(3), it will lead to singularity or ambiguity problems. The main contributions of the article include: 1. Deriving the stochastic Lie - Hamilton - Jacobi - Bellman (SL - HJB) equation on SO(3), which provides theoretically optimal conditions for the globally optimal control strategy. 2. Proposing a new numerical method for solving the SL - HJB equation - the continuous Wigner - Galerkin approximation (SWGA) method. This method uses the Wigner - D function to represent the Galerkin solution and provides a computationally efficient method in the policy iteration framework to derive the globally optimal control strategy for the SO(3) system. 3. Demonstrating the effectiveness of the proposed method in stochastic attitude stabilization through numerical simulation. Expressed in formulas, this article mainly focuses on the attitude kinematics described by the following stochastic differential equation: \[ dR(t) = R(t)S(u(t))dt+\sum_{k = 1}^{3}\sigma_k\circ dW_k(t) \] where \(R(t)\in SO(3)\) is the state variable, representing the attitude of a rigid body in three - dimensional space; \(u(t)\in\mathbb{R}^3\) is the control input, corresponding to the angular velocity of the rigid body; \(\sigma_k\) are independent vectors; \(W_k(t)\) are independent standard Brownian motions. In order to find the optimal control strategy \(u^*(R(t))\), it is necessary to minimize the expected cost functional: \[ J(u)=\mathbb{E}\left[\int_{0}^{t_0}l(R(t))+\|u\|^2_Wdt\right] \] where \(t_0\in\mathbb{R}\) is a fixed time horizon, \(\|u\|^2_W = u^TWu\) represents the control cost, and \(l(R)\) is the running cost, which is a non - negative but not necessarily quadratic cost function. By introducing the SL - HJB equation and using the SWGA method to solve it, the article successfully solves the above - mentioned problems and verifies the effectiveness of the method through numerical simulation.