Les Houches lectures on deep learning at large and infinite width *

Yasaman Bahri,Boris Hanin,Antonin Brossollet,Vittorio Erba,Christian Keup,Rosalba Pacelli,James B Simon
DOI: https://doi.org/10.1088/1742-5468/ad2dd3
2024-10-31
Journal of Statistical Mechanics Theory and Experiment
Abstract:These lectures, presented at the 2022 Les Houches Summer School on Statistical Physics and Machine Learning, focus on the infinite-width limit and large-width regime of deep neural networks. Topics covered include the various statistical and dynamical properties of these networks. In particular, the lecturers discuss properties of random deep neural networks, connections between trained deep neural networks, linear models, kernels and Gaussian processes that arise in the infinite-width limit, and perturbative and non-perturbative treatments of large but finite-width networks, at initialization and after training.
physics, mathematical,mechanics
What problem does this paper attempt to address?