Accelerated primal-dual methods with enlarged step sizes and operator learning for nonsmooth optimal control problems

Yongcun Song,Xiaoming Yuan,Hangrui Yue
2023-07-26
Abstract:We consider a general class of nonsmooth optimal control problems with partial differential equation (PDE) constraints, which are very challenging due to its nonsmooth objective functionals and the resulting high-dimensional and ill-conditioned systems after discretization. We focus on the application of a primal-dual method, with which different types of variables can be treated individually and thus its main computation at each iteration only requires solving two PDEs. Our target is to accelerate the primal-dual method with either larger step sizes or operator learning techniques. For the accelerated primal-dual method with larger step sizes, its convergence can be still proved rigorously while it numerically accelerates the original primal-dual method in a simple and universal way. For the operator learning acceleration, we construct deep neural network surrogate models for the involved PDEs. Once a neural operator is learned, solving a PDE requires only a forward pass of the neural network, and the computational cost is thus substantially reduced. The accelerated primal-dual method with operator learning is mesh-free, numerically efficient, and scalable to different types of PDEs. The acceleration effectiveness of these two techniques is promisingly validated by some preliminary numerical results.
Optimization and Control,Machine Learning,Numerical Analysis
What problem does this paper attempt to address?
### Problems Addressed by the Paper This paper primarily proposes an improved primal-dual method for nonsmooth optimal control problems constrained by partial differential equations (PDEs). These types of problems are numerically challenging due to their nonsmooth objective functions, which, when discretized, result in high-dimensional and ill-conditioned systems. The focus of the paper is on accelerating the primal-dual method through the following two approaches: 1. **Increasing Step Size**: Enhancing the numerical performance of the primal-dual method by appropriately increasing the step size parameter. 2. **Operator Learning Acceleration**: Utilizing deep neural networks to construct surrogate models for PDEs, thereby significantly reducing the computational cost required to solve the PDEs. Specifically: - **Method of Increasing Step Size**: By adjusting the step size parameter in the primal-dual method, the efficiency of the algorithm can be significantly improved while ensuring convergence. - **Method of Operator Learning**: By training neural networks to approximate the solution operator of the PDE, each PDE solution requires only a single forward propagation, greatly reducing computational costs. This method not only avoids the need for mesh processing but is also applicable to different types of PDE problems. ### Main Contributions - Proposed an acceleration strategy based on the primal-dual method, improving algorithm efficiency by increasing the step size or using operator learning techniques. - Theoretically proved the convergence of the primal-dual method under increased step size conditions and demonstrated how deep neural networks can be used to accelerate the solution process. - Verified the effectiveness of these two acceleration techniques through preliminary numerical experiments.