Neural ODEs as the deep limit of ResNets with constant weights

Benny Avelin,Kaj Nyström,Kaj Nystrom
DOI: https://doi.org/10.1142/S0219530520400023
IF: 1.9559
2020-05-21
Analysis and Applications
Abstract:In this paper, we prove that, in the deep limit, the stochastic gradient descent on a ResNet type deep neural network, where each layer shares the same weight matrix, converges to the stochastic gradient descent for a Neural ODE and that the corresponding value/loss functions converge. Our result gives, in the context of minimization by stochastic gradient descent, a theoretical foundation for considering Neural ODEs as the deep limit of ResNets. Our proof is based on certain decay estimates for associated Fokker–Planck equations.
mathematics, applied
What problem does this paper attempt to address?