Trust-Region Stochastic Optimization with Variance Reduction Technique

Xinshou Zheng
2024-12-01
Abstract:We propose a novel algorithm, TR-SVR, for solving unconstrained stochastic optimization problems. This method builds on the trust-region framework, which effectively balances local and global exploration in optimization tasks. TR-SVR incorporates variance reduction techniques to improve both computational efficiency and stability when addressing stochastic objective functions. The algorithm applies a sequential quadratic programming (SQP) approach within the trust-region framework, solving each subproblem approximately using variance-reduced gradient estimators. This integration ensures a robust convergence mechanism while maintaining efficiency, making TR-SVR particularly suitable for large-scale stochastic optimization challenges.
Optimization and Control
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the **large - scale unconstrained stochastic optimization problem**, especially the common challenges in machine learning and data - driven tasks. Specifically, the paper proposes a new algorithm, TR - SVR (Trust - Region Stochastic Variance Reduced), aiming to improve computational efficiency and stability by combining the trust - region method and variance - reduction techniques. ### Specific description of the problem: 1. **High - variance gradient estimation**: In large - scale stochastic optimization problems, the traditional Stochastic Gradient Descent (SGD) method has a slow and unstable convergence rate due to its high - variance gradient estimation. 2. **Limitations of traditional methods**: Existing Sequential Quadratic Programming (SQP) methods are usually applicable to deterministic or constrained optimization problems and perform poorly when dealing with unconstrained stochastic optimization problems. 3. **Challenges in large - scale optimization**: In large - scale machine - learning and data - driven tasks, optimization algorithms need to be efficient and scalable to handle vast amounts of data and complex models. ### Main features of the TR - SVR algorithm: - **Trust - region framework**: By defining a reliable region, ensure that the quadratic approximation of the objective function can effectively guide the optimization process, balancing local and global exploration. - **Variance - reduction techniques**: Draw on methods such as Stochastic Variance Reduced Gradient (SVRG) to significantly reduce the noise in stochastic gradient estimation and improve the stability and accuracy of the optimization process. - **Dynamically adjust step size**: Dynamically adjust the trust - region radius according to the quality of the current solution and the quality of the gradient estimation to ensure that the algorithm can converge robustly even in a noisy environment. - **Efficient quadratic sub - problem solving**: Use the approximate value of the Hessian matrix to solve the quadratic sub - problem, avoiding expensive second - order calculations and maintaining computational efficiency. ### Application scenarios: TR - SVR is particularly suitable for large - scale machine - learning and data - driven tasks, such as deep learning, signal processing, and control systems, where optimization algorithms are required to have high efficiency and scalability. ### Summary: By proposing the TR - SVR algorithm, the paper solves the problems of high - variance gradient estimation and inefficiency in existing methods when dealing with large - scale unconstrained stochastic optimization problems, providing a more efficient and stable solution for practical applications.