Mean-field limits of trained weights in deep learning: A dynamical systems perspective

TitleMean-field limits of trained weights in deep learning: A dynamical systems perspective
Publication TypeJournal Article
Year of Publication2022
AuthorsSmirnov, A, Hamzi, B, Owhadi, H
JournalDolomites Research Notes on Approximation
Volume15
Issue3
Pagination125-145
Date Published10/2022
PublisherPadova University Press
Place PublishedPadova, IT
ISSN Number2035-6803
Abstract

Training a residual neural network with L2 regularization on weights and biases is equivalent to minim- izing a discrete least action principle and to controlling a discrete Hamiltonian system representing the propagation of input data across layers. The kernel/feature map analysis of this Hamiltonian system suggests a mean-field limit for trained weights and biases as the number of data points goes to infinity. The purpose of this paper is to investigate this mean-field limit and illustrate its existence through numerical experiments and analysis (for simple kernels).

URLhttps://drna.padovauniversitypress.it/2022/3/12
DOI10.14658/pupj-drna-2022-3-12