SPAN: A Stochastic Projected Approximate Newton Method
Xunpeng Huang,Xianfeng Liang,Zhengyang Liu,Lei Li,Yue Yu,Yitan Li
DOI: https://doi.org/10.1609/aaai.v34i02.5511
2020-01-01
Proceedings of the AAAI Conference on Artificial Intelligence
Abstract:Second-order optimization methods have desirable convergence properties.However, the exact Newton method requires expensive computation for the Hessianand its inverse. In this paper, we propose SPAN, a novel approximate and fastNewton method. SPAN computes the inverse of the Hessian matrix via low-rankapproximation and stochastic Hessian-vector products. Our experiments onmultiple benchmark datasets demonstrate that SPAN outperforms existingfirst-order and second-order optimization methods in terms of the convergencewall-clock time. Furthermore, we provide a theoretical analysis of theper-iteration complexity, the approximation error, and the convergence rate.Both the theoretical analysis and experimental results show that our proposedmethod achieves a better trade-off between the convergence rate and theper-iteration efficiency.