Quantified advantage of discontinuous weight selection in approximations with deep neural networks

Dmitry Yarotsky
DOI: https://doi.org/10.48550/arXiv.1705.01365
2017-05-03
Neural and Evolutionary Computing
Abstract:We consider approximations of 1D Lipschitz functions by deep ReLU networks of a fixed width. We prove that without the assumption of continuous weight selection the uniform approximation error is lower than with this assumption at least by a factor logarithmic in the size of the network.
What problem does this paper attempt to address?