Laplacian Smoothing Gradient Descent

Osher Stanley,Wang Bao,Yin Penghang,Luo Xiyang,Barekat Farzin,Pham Minh,Lin Alex
DOI: https://doi.org/10.1007/s40687-022-00351-1
2022-01-01
Research in the Mathematical Sciences
Abstract:We propose a class of very simple modifications of gradient descent and stochastic gradient descent leveraging Laplacian smoothing. We show that when applied to a large variety of machine learning problems, ranging from logistic regression to deep neural nets, the proposed surrogates can dramatically reduce the variance, allow to take a larger step size, and improve the generalization accuracy. The methods only involve multiplying the usual (stochastic) gradient by the inverse of a positive definitive matrix (which can be computed efficiently by FFT) with a low condition number coming from a one-dimensional discrete Laplacian or its high-order generalizations. Given any vector, e.g., gradient vector, Laplacian smoothing preserves the mean and increases the smallest component and decreases the largest component. Moreover, we show that optimization algorithms with these surrogates converge uniformly in the discrete Sobolev $$H_\sigma ^p$$ sense and reduce the optimality gap for convex optimization problems. The code is available at: https://github.com/BaoWangMath/LaplacianSmoothing-GradientDescent .
What problem does this paper attempt to address?