Further extensions on the successive approximation method for hierarchical optimal control problems and its application to learning

Getachew K. Befekadu

2024-11-24

Abstract:In this paper, further extensions of the result of the paper "A successive approximation method in functional spaces for hierarchical optimal control problems and its application to learning, <a class="link-https" data-arxiv-id="2410.20617" href="https://arxiv.org/abs/2410.20617">arXiv:2410.20617</a> [math.OC], 2024" concerning a class of learning problem of point estimations for modeling of high-dimensional nonlinear functions are given. In particular, we present two viable extensions within the nested algorithm of the successive approximation method for the hierarchical optimal control problem, that provide better convergence property and computationally efficiency, which ultimately leading to an optimal parameter estimate. The first extension is mainly concerned with the convergence property of the steps involving how the two agents, i.e., the "leader" and the "follower," update their admissible control strategies, where we introduce augmented Hamiltonians for both agents and we further reformulate the admissible control updating steps as as sub-problems within the nested algorithm of the hierarchical optimal control problem that essentially provide better convergence property. Whereas the second extension is concerned with the computationally efficiency of the steps involving how the agents update their admissible control strategies, where we introduce intermediate state variable for each agent and we further embed the intermediate states within the optimal control problems of the "leader" and the "follower," respectively, that further lend the admissible control updating steps to be fully efficient time-parallelized within the nested algorithm of the hierarchical optimal control problem.

Optimization and Control

What problem does this paper attempt to address?

The problem that this paper attempts to solve is related to the point - estimation learning in the hierarchical optimal control problem, especially for high - dimensional nonlinear function modeling. Specifically, the paper aims to provide better convergence and computational efficiency by improving the successive approximation method. The following are the specific problems that the paper tries to solve: 1. **Improving Convergence**: - In order to improve the convergence performance when the "leader" and the "follower" update their feasible control strategies, augmented Hamiltonians are introduced. - The feasible control update step is reformulated as a sub - problem in the nested algorithm, thus improving the convergence performance. 2. **Improving Computational Efficiency**: - In order to improve the computational efficiency when the "leader" and the "follower" update their feasible control strategies, intermediate state variables are introduced. - These intermediate state variables are embedded into the optimal control problems of the "leader" and the "follower", so that the feasible control update step can be fully parallelized in the nested algorithm, thereby improving the efficiency of time - parallel computing. ### Formula Summary 1. **Augmented Hamiltonians**: - For the "follower", the augmented Hamiltonian is defined as: \[ \tilde{H}_2(\theta_{u_2}, p_2, \bar{u}_2, u_2) = H_2(\theta_{u_2}, p_2, u_2)+\frac{\gamma_2}{2}\left\|\frac{\partial H_2(\theta_{u_2}, p_2, u_2)}{\partial p_2}-\frac{\partial H_2(\theta_{\bar{u}_2}, p_2, \bar{u}_2)}{\partial p_2}\right\|^2+\frac{\gamma_2}{2}\left\|\frac{\partial H_2(\theta_{u_2}, p_2, u_2)}{\partial \theta_{u_2}}-\frac{\partial H_2(\theta_{\bar{u}_2}, p_2, \bar{u}_2)}{\partial \theta_{\bar{u}_2}}\right\|^2 \] - For the "leader", the augmented Hamiltonian is defined as: \[ \tilde{H}_1(\theta_{u_1}, p_1, \bar{u}_1, u_1) = H_1(\theta_{u_1}, p_1, u_1)+\frac{\gamma_1}{2}\left\|\frac{\partial H_1(\theta_{u_1}, p_1, u_1)}{\partial p_1}-\frac{\partial H_1(\theta_{\bar{u}_1}, p_1, \bar{u}_1)}{\partial p_1}\right\|^2+\frac{\gamma_1}{2}\left\|\frac{\partial H_1(\theta_{u_1}, p_1, u_1)}{\partial \theta_{u_1}}-\frac{\partial H_1(\theta_{\bar{u}_1}, p_1, \bar{u}_1)}{\partial \theta_{\bar{u}_1}}\right\|^2 \] 2. **Intermediate State Variables**: - For the "follower", the intermediate state variable is defined as: \[ m^2_{u_2}(t_k)=\fra

Further extensions on the successive approximation method for hierarchical optimal control problems and its application to learning

A successive approximation method in functional spaces for hierarchical optimal control problems and its application to learning

Application of an adaptive model hierarchy to parametrized optimal control problems

Human-in-the-loop Distributed Cooperative Tracking Control with Applications to Autonomous Ground Vehicles: A Data-Driven Mixed Iteration Approach

On the hierarchical optimal control of a chain of distributed systems

Model-free Adaptive Dynamic Programming for Optimal Control of Discrete-time Affine Nonlinear System

Approximating optimal feedback controllers of finite horizon control problems using hierarchical tensor formats

A Hierarchical Distributed Data-Driven Adaptive Learning Control for Nonaffine Nonlinear MASs

Sparse successive approximation for nonlinear H2 and H∞ optimal control problems under residual errors

Data-based Control, Optimization, Modeling and Applications

Towards optimal hierarchical training of neural networks

RLOC: Neurobiologically Inspired Hierarchical Reinforcement Learning Algorithm for Continuous Control of Nonlinear Dynamical Systems

Data-Driven Near-Optimal Control of Nonlinear Systems Over Finite Horizon

Optimal Control and Filtering for Hierarchical Decision Problems with $H_{\infty }$ Constraint based on Stackelberg Strategy

Reinforcement Control with Hierarchical Backpropagated Adaptive Critics

Improved Hierarchical ADMM for Nonconvex Cooperative Distributed Model Predictive Control

Iterative Learning Controllers for Discrete-Time Large-Scale Systems to Track Trajectories with Distinct Magnitudes.

Adaptive Optimal Control of Nonlinear Systems with Multiple Time-scale Eligibility Traces

A Neural Network Approach for Stochastic Optimal Control

Adaptive dynamic programming-based hierarchical decision-making of non-affine systems

A Parallel Framework of Adaptive Dynamic Programming Algorithm with Off-Policy Learning.