Getachew K. Befekadu
Abstract:In this paper, further extensions of the result of the paper "A successive approximation method in functional spaces for hierarchical optimal control problems and its application to learning, <a class="link-https" data-arxiv-id="2410.20617" href="https://arxiv.org/abs/2410.20617">arXiv:2410.20617</a> [math.OC], 2024" concerning a class of learning problem of point estimations for modeling of high-dimensional nonlinear functions are given. In particular, we present two viable extensions within the nested algorithm of the successive approximation method for the hierarchical optimal control problem, that provide better convergence property and computationally efficiency, which ultimately leading to an optimal parameter estimate. The first extension is mainly concerned with the convergence property of the steps involving how the two agents, i.e., the "leader" and the "follower," update their admissible control strategies, where we introduce augmented Hamiltonians for both agents and we further reformulate the admissible control updating steps as as sub-problems within the nested algorithm of the hierarchical optimal control problem that essentially provide better convergence property. Whereas the second extension is concerned with the computationally efficiency of the steps involving how the agents update their admissible control strategies, where we introduce intermediate state variable for each agent and we further embed the intermediate states within the optimal control problems of the "leader" and the "follower," respectively, that further lend the admissible control updating steps to be fully efficient time-parallelized within the nested algorithm of the hierarchical optimal control problem.
What problem does this paper attempt to address?
The problem that this paper attempts to solve is related to the point - estimation learning in the hierarchical optimal control problem, especially for high - dimensional nonlinear function modeling. Specifically, the paper aims to provide better convergence and computational efficiency by improving the successive approximation method. The following are the specific problems that the paper tries to solve:
1. **Improving Convergence**:
- In order to improve the convergence performance when the "leader" and the "follower" update their feasible control strategies, augmented Hamiltonians are introduced.
- The feasible control update step is reformulated as a sub - problem in the nested algorithm, thus improving the convergence performance.
2. **Improving Computational Efficiency**:
- In order to improve the computational efficiency when the "leader" and the "follower" update their feasible control strategies, intermediate state variables are introduced.
- These intermediate state variables are embedded into the optimal control problems of the "leader" and the "follower", so that the feasible control update step can be fully parallelized in the nested algorithm, thereby improving the efficiency of time - parallel computing.
### Formula Summary
1. **Augmented Hamiltonians**:
- For the "follower", the augmented Hamiltonian is defined as:
\[
\tilde{H}_2(\theta_{u_2}, p_2, \bar{u}_2, u_2) = H_2(\theta_{u_2}, p_2, u_2)+\frac{\gamma_2}{2}\left\|\frac{\partial H_2(\theta_{u_2}, p_2, u_2)}{\partial p_2}-\frac{\partial H_2(\theta_{\bar{u}_2}, p_2, \bar{u}_2)}{\partial p_2}\right\|^2+\frac{\gamma_2}{2}\left\|\frac{\partial H_2(\theta_{u_2}, p_2, u_2)}{\partial \theta_{u_2}}-\frac{\partial H_2(\theta_{\bar{u}_2}, p_2, \bar{u}_2)}{\partial \theta_{\bar{u}_2}}\right\|^2
\]
- For the "leader", the augmented Hamiltonian is defined as:
\[
\tilde{H}_1(\theta_{u_1}, p_1, \bar{u}_1, u_1) = H_1(\theta_{u_1}, p_1, u_1)+\frac{\gamma_1}{2}\left\|\frac{\partial H_1(\theta_{u_1}, p_1, u_1)}{\partial p_1}-\frac{\partial H_1(\theta_{\bar{u}_1}, p_1, \bar{u}_1)}{\partial p_1}\right\|^2+\frac{\gamma_1}{2}\left\|\frac{\partial H_1(\theta_{u_1}, p_1, u_1)}{\partial \theta_{u_1}}-\frac{\partial H_1(\theta_{\bar{u}_1}, p_1, \bar{u}_1)}{\partial \theta_{\bar{u}_1}}\right\|^2
\]
2. **Intermediate State Variables**:
- For the "follower", the intermediate state variable is defined as:
\[
m^2_{u_2}(t_k)=\fra