Solving Finite-Horizon HJB for Optimal Control of Continuous-Time Systems

Ziyu Lin,Jingliang Duan,Shengbo Eben Li,Jie Li,Haitong Ma,Qi Sun,Jianyu Chen,Bo Cheng
DOI: https://doi.org/10.1109/icccr49711.2021.9349412
2021-01-01
Abstract:Hamilton-Jacobi-Bellman (HJB) equation is the sufficient and necessary condition for continuous-time optimal control problem (OCP). Different from HJB equation in infinite horizon, finite-horizon HJB equation contains a time-dependent value function, whose partial derivative with respect to time is an intractable unknown term. My study has found that the partial derivative exactly equals the terminal-time utility function by analyzing the initial-time equivalency between fixed time horizon OCP and fixed terminal time OCP. We also provide another proof, which uses the definition of partial derivative. This finding allows reusing traditional approximate dynamic programming (ADP) algorithm to approximate optimal policy with a parameterized function like neural network, thus solving the continuous-time finite-horizon OCP. The correctness of our finding is evaluated by analyzing a linear quadratic problem.
What problem does this paper attempt to address?