Train Deep Neural Networks in 40-D Subspaces

Tao Li,Lei Tan,Qinghua Tao,Yipeng Liu,Xiaolin Huang
2021-01-01
Abstract:Although there are massive parameters in deep neural networks, the training can actually proceed in a rather lowdimensional space. By investigating such low-dimensional properties of the training trajectory, we propose a Dynamic Linear Dimensionality Reduction (DLDR), which dramatically reduces the parameter space to a variable subspace of significantly lower dimension. Since there are only a few variables to optimize, second-order methods become applicable. Following this idea, we develop a quasi-Newtonbased algorithm to train these variables obtained by DLDR, rather than the original parameters of neural networks. The experimental results strongly support the dimensionality reduction performance: for many standard neural networks, optimizing over only 40 variables, one can achieve comparable performance against the regular training over thousands or even millions of parameters.
What problem does this paper attempt to address?