Neural Q-learning for solving elliptic PDEs
Deqing Jiang,Samuel N. Cohen,Justin A. Sirignano
DOI: https://doi.org/10.48550/arXiv.2203.17128
Abstract:Solving high-dimensional partial differential equations (PDEs) is a major challenge in scientific computing. We develop a new numerical method for solving elliptic-type PDEs by adapting the Q-learning algorithm in reinforcement learning. Our “Q-PDE” algorithm is mesh-free and therefore has the potential to overcome the curse of dimensionality. Using a neural tangent kernel (NTK) approach, we prove that the neural network approximator for the PDE solution, trained with the Q-PDE algorithm, converges to the trajectory of an infinite-dimensional ordinary differential equation (ODE) as the number of hidden units → ∞ . For monotone PDE (i.e. those given by monotone operators, which may be nonlinear), despite the lack of a spectral gap in the NTK, we then prove that the limit neural network, which satisfies the infinite-dimensional ODE, converges in L 2 to the PDE solution as the training time → ∞ . More generally, we can prove that any fixed point of the wide-network limit for the Q-PDE algorithm is a solution of the PDE (not necessarily under the monotone condition). The numerical performance of the Q-PDE algorithm is studied for several elliptic PDEs. this approach to study the performance of the biased gradient flow as a training algorithm, and show that the wide-network limit satisfies an infinite-dimensional ordinary differential equation (ODE).Weapply our Q-PDE approach to monotone PDEs, that is, where the differential operator satisfies a strong monotonicity condition. Monotone PDEs arise in various applications, particularly in PDEs arising from stochastic modeling – the generators of ergodic stochastic processes are monotone (when evaluated with their stationary distributions), which suggests a variety of possible applications of our approach. Further, the subdifferentials of convex functionals (i.e. of maps from functions to the real line) are monotone; this suggests that monotone PDEs may be a particularly well-suited class of equations for gradient methods, as they correspond to (a generalization of) minimizations of convex functionals. We will see that, given this monotonicity assumption, we can prove that the limit Q-PDE algorithm will converge (strongly in L 2 ) to the solution of the monotone PDE. More generally, we can prove that any fixed point of the wide-network limit for the Q-PDE algorithm is a solution of the PDE (not necessarily under the monotone condition).
Mathematics,Computer Science