Stochastic Zeroth-Order Optimization under Strongly Convexity and Lipschitz Hessian: Minimax Sample Complexity
Qian Yu,Yining Wang,Baihe Huang,Qi Lei,Jason D. Lee
2024-06-28
Abstract:Optimization of convex functions under stochastic zeroth-order feedback has been a major and challenging question in online learning. In this work, we consider the problem of optimizing second-order smooth and strongly convex functions where the algorithm is only accessible to noisy evaluations of the objective function it queries. We provide the first tight characterization for the rate of the minimax simple regret by developing matching upper and lower bounds. We propose an algorithm that features a combination of a bootstrapping stage and a mirror-descent stage. Our main technical innovation consists of a sharp characterization for the spherical-sampling gradient estimator under higher-order smoothness conditions, which allows the algorithm to optimally balance the bias-variance tradeoff, and a new iterative method for the bootstrapping stage, which maintains the performance for unbounded Hessian.
Machine Learning,Information Theory,Optimization and Control
What problem does this paper attempt to address?
This paper attempts to solve the problem of optimizing second - order smooth and strongly convex functions under stochastic zero - order feedback. Specifically, when the objective function has strong convexity and high - order smoothness, the author studies the optimal sample complexity. In such a setting, the algorithm can only access the noisy evaluation values of the objective function and cannot directly obtain gradient information. Therefore, how to effectively optimize the objective function under such conditions and achieve the optimal sample complexity is the core issue of this paper.
### Main Contributions
1. **Optimal Sample Complexity**:
- The author provides the first tight minimax simple regret rate characterization in stochastic zero - order optimization for objective functions that are strongly convex and Hessian Lipschitz continuous, proven by matching upper and lower bounds.
- An algorithm that combines a bootstrapping phase and a mirror descent phase is proposed, which can balance the bias - variance trade - off and maintain performance in the case of an unbounded Hessian.
2. **Technological Innovations**:
- **Gradient Estimator**: An exact characterization of the spherical sampling gradient estimator under high - order smoothness conditions is proposed, which enables the algorithm to handle the bias - variance trade - off more effectively.
- **New Method for the Bootstrapping Phase**: A new iterative method is introduced, which can maintain performance in the bootstrapping phase and work effectively even in the case of an unbounded Hessian.
3. **Theoretical Analysis**:
- The lower bound of the minimax regret is derived by the KL - divergence method, thereby fully characterizing the optimal sample complexity.
### Specific Problem Description
- **Objective Function**: Assume that the objective function \( f: \mathbb{R}^d \to \mathbb{R} \) is twice differentiable and satisfies the following conditions:
- **Lipschitz Hessian**: There exists a constant \( \rho \in (0, +\infty) \) such that for all \( x, x' \in \mathbb{R}^d \), \( \| \nabla^2 f(x) - \nabla^2 f(x') \|_F \leq \rho \| x' - x \|_2 \).
- **Strong Convexity**: There exists a constant \( M \in (0, +\infty) \) such that for any \( x \in \mathbb{R}^d \), the smallest eigenvalue of the Hessian matrix \( \nabla^2 f(x) \) is greater than \( M \).
- **Bounded Distance from Initialization to Optimal Point**: There exists a constant \( R \in (0, +\infty) \) such that the infimum of \( f(x) \) within the hypersphere \( \| x \|_2 \leq R \) is equal to the infimum of \( f(x) \) over the entire \( \mathbb{R}^d \).
- **Optimization Objective**: Design a learning algorithm \( A \) such that after \( T \) samplings, it can approach the global minimum point \( x^* \) with high probability, that is, minimize the expected difference \( E[f(x_T) - f(x^*)] \).
### Conclusion
This paper, by proposing an algorithm that combines a bootstrapping phase and a mirror descent phase, for the first time solves the problem of optimal sample complexity in optimizing second - order smooth and strongly convex functions under stochastic zero - order feedback. This result not only promotes the theoretical research in the field of zero - order optimization but also provides effective algorithmic tools for practical applications. Future research directions can be further extended to the average regret metric in online settings and the fundamental trade - off between simple regret and average regret.