LALR: Theoretical and Experimental validation of Lipschitz Adaptive Learning Rate in Regression and Neural Networks

Snehanshu Saha,Tejas Prashanth,Suraj Aralihalli,Sumedh Basarkod,T.S.B Sudarshan,Soma S Dhavala
DOI: https://doi.org/10.48550/arXiv.2006.13307
2020-05-19
Abstract:We propose a theoretical framework for an adaptive learning rate policy for the Mean Absolute Error loss function and Quantile loss function and evaluate its effectiveness for regression tasks. The framework is based on the theory of Lipschitz continuity, specifically utilizing the relationship between learning rate and Lipschitz constant of the loss function. Based on experimentation, we have found that the adaptive learning rate policy enables up to 20x faster convergence compared to a constant learning rate policy.
Machine Learning
What problem does this paper attempt to address?