Abstract: In this paper, we consider the problem of stochastic optimization, where the objective function is in terms of the expectation of a (possibly non-convex) cost function that is parametrized by a random variable. While the convergence speed is critical for many emerging applications, most existing stochastic optimization methods suffer from slow convergence. Furthermore, the emerging technology of parallel computing has motivated an increasing demand for designing new stochastic optimization schemes that can handle parallel optimization for implementation in distributed systems. We propose a fast parallel stochastic optimization framework that can solve a large class of possibly non-convex stochastic optimization problems that may arise in applications with multi-agent systems. In the proposed method, each agent updates its control variable in parallel, by solving a convex quadratic subproblem independently. The convergence of the proposed method to the optimal solution for convex problems and to a stationary point for general non-convex problems is established. The proposed algorithm can be applied to solve a large class of optimization problems arising in important applications from various fields, such as machine learning and wireless networks. As a representative application of our proposed stochastic optimization framework, we focus on large-scale support vector machines and demonstrate how our algorithm can efficiently solve this problem, especially in modern applications with huge datasets. Using popular real-world datasets, we present experimental results to demonstrate the merits of our proposed framework by comparing its performance to the state-of-the-art in the literature. Numerical results show that the proposed method can significantly outperform the state-of-the-art methods in terms of the convergence speed while having the same or lower complexity and storage requirement.

Accelerating Stochastic Newton Method Via Chebyshev Polynomial Approximation.

Parallel Stochastic Optimization Framework for Large-Scale Non-Convex Stochastic Problems

Stochastic Newton Proximal Extragradient Method

Stochastic Sub-Sampled Newton Method with Variance Reduction

Nesterov's Acceleration For Approximate Newton.

SPAN: A Stochastic Projected Approximate Newton Method

Approximate Newton Methods and Their Local Convergence.

A Stochastic Quasi-Newton Method for Non-convex Optimization with Non-uniform Smoothness

Stochastic Optimization for Non-convex Problem with Inexact Hessian Matrix, Gradient, and Function

A Single-Loop Stochastic Proximal Quasi-Newton Method for Large-Scale Nonsmooth Convex Optimization

Stochastic Optimization for Nonconvex Problem with Inexact Hessian Matrix, Gradient, and Function

Stochastic Variance-Reduced Newton: Accelerating Finite-Sum Minimization with Large Batches

Advancing the lower bounds: An accelerated, stochastic, second-order method with optimal adaptation to inexactness

A Fast Anderson-Chebyshev Acceleration for Nonlinear Optimization.

An Anderson-Chebyshev Mixing Method for Nonlinear Optimization

Newton Meets Marchenko-Pastur: Massively Parallel Second-Order Optimization with Hessian Sketching and Debiasing

A Unifying Framework for Convergence Analysis of Approximate Newton Methods.

A Bregman Proximal Stochastic Gradient Method with Extrapolation for Nonconvex Nonsmooth Problems

Solving Stochastic Optimization by Newton-type methods with Dimension-Adaptive Sparse Grid Quadrature

Stochastic Steffensen method

Fast Unconstrained Optimization via Hessian Averaging and Adaptive Gradient Sampling Methods