An Operator Learning Approach to Nonsmooth Optimal Control of Nonlinear PDEs

Yongcun Song,Xiaoming Yuan,Hangrui Yue,Tianyou Zeng
2024-09-22
Abstract:Optimal control problems with nonsmooth objectives and nonlinear partial differential equation (PDE) constraints are challenging, mainly because of the underlying nonsmooth and nonconvex structures and the demanding computational cost for solving multiple high-dimensional and ill-conditioned systems after mesh-based discretization. To mitigate these challenges numerically, we propose an operator learning approach in combination with an effective primal-dual optimization idea which can decouple the treatment of the control and state variables so that each of the resulting iterations only requires solving two PDEs. Our main purpose is to construct neural surrogate models for the involved PDEs by operator learning, allowing the solution of a PDE to be obtained with only a forward pass of the neural network. The resulting algorithmic framework offers a hybrid approach that combines the flexibility and generalization of operator learning with the model-based nature and structure-friendly efficiency of primal-dual-based algorithms. The primal-dual-based operator learning approach offers numerical methods that are mesh-free, easy to implement, and adaptable to various optimal control problems with nonlinear PDEs. It is notable that the neural surrogate models can be reused across iterations and parameter settings, hence computational cost can be substantially alleviated. We validate the effectiveness and efficiency of the primal-dual-based operator learning approach across a range of typical optimal control problems with nonlinear PDEs, including optimal control of stationary Burgers equations, sparse bilinear control of parabolic equations, and optimal control of semilinear parabolic equations.
Optimization and Control
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the non - smooth optimal control problem under the constraint of nonlinear partial differential equations (PDEs). Due to their inherent non - smoothness and non - convexity structures, and the need to solve multiple high - dimensional and ill - conditioned systems after grid - based discretization, such problems are very challenging. Specifically, the problem mentioned in the paper can be formalized as the following optimization problem: \[ \begin{aligned} & \min_{u \in U, y \in Y} J(y, u)=\frac{1}{2}\|y - y_d\|^2_Y+\frac{\alpha}{2}\|u\|^2_U+\theta(u) \\ & \text{s.t. } y = S(u), \end{aligned} \] where: - \(U\) and \(Y\) are Hilbert spaces, corresponding to the domains of the control variable \(u\) and the state variable \(y\), respectively; - \(y_d\in Y\) represents the given target state; - \(\alpha> 0\) is a regularization parameter; - \(S:U\rightarrow Y\) is the solution operator corresponding to the state equation, representing the mapping from the control variable \(u\) to the state variable \(y\); - \(\theta:U\rightarrow\mathbb{R}\cup\{+\infty\}\) is a regularization functional, which is used to impose additional constraints on the control variable \(u\), such as boundedness, sparsity and discontinuity. The main purpose of the paper is to propose a method that combines operator learning and effective primal - dual optimization ideas to alleviate the above challenges. Through this method, the processing of control variables and state variables can be decoupled, so that only two PDEs need to be solved in each iteration. In addition, the paper also constructs a neural network surrogate model to approximate the involved PDEs, so that the solutions of PDEs can be directly obtained through the forward propagation of the neural network, greatly reducing the computational cost. This method is not only applicable to the optimal control problems of various nonlinear PDEs, but also the neural network surrogate model can be reused in different iterations and parameter settings, further reducing the computational burden. The paper verifies the effectiveness and efficiency of this method in a series of typical nonlinear PDE optimal control problems, including the optimal control of the steady - state Burgers equation, the sparse bilinear control of the parabolic equation and the optimal control of the semi - linear parabolic equation.