Abstract:Optimization of convex functions under stochastic zeroth-order feedback has been a major and challenging question in online learning. In this work, we consider the problem of optimizing second-order smooth and strongly convex functions where the algorithm is only accessible to noisy evaluations of the objective function it queries. We provide the first tight characterization for the rate of the minimax simple regret by developing matching upper and lower bounds. We propose an algorithm that features a combination of a bootstrapping stage and a mirror-descent stage. Our main technical innovation consists of a sharp characterization for the spherical-sampling gradient estimator under higher-order smoothness conditions, which allows the algorithm to optimally balance the bias-variance tradeoff, and a new iterative method for the bootstrapping stage, which maintains the performance for unbounded Hessian.

What problem does this paper attempt to address?

This paper attempts to solve the problem of optimizing second - order smooth and strongly convex functions under stochastic zero - order feedback. Specifically, when the objective function has strong convexity and high - order smoothness, the author studies the optimal sample complexity. In such a setting, the algorithm can only access the noisy evaluation values of the objective function and cannot directly obtain gradient information. Therefore, how to effectively optimize the objective function under such conditions and achieve the optimal sample complexity is the core issue of this paper. ### Main Contributions 1. **Optimal Sample Complexity**: - The author provides the first tight minimax simple regret rate characterization in stochastic zero - order optimization for objective functions that are strongly convex and Hessian Lipschitz continuous, proven by matching upper and lower bounds. - An algorithm that combines a bootstrapping phase and a mirror descent phase is proposed, which can balance the bias - variance trade - off and maintain performance in the case of an unbounded Hessian. 2. **Technological Innovations**: - **Gradient Estimator**: An exact characterization of the spherical sampling gradient estimator under high - order smoothness conditions is proposed, which enables the algorithm to handle the bias - variance trade - off more effectively. - **New Method for the Bootstrapping Phase**: A new iterative method is introduced, which can maintain performance in the bootstrapping phase and work effectively even in the case of an unbounded Hessian. 3. **Theoretical Analysis**: - The lower bound of the minimax regret is derived by the KL - divergence method, thereby fully characterizing the optimal sample complexity. ### Specific Problem Description - **Objective Function**: Assume that the objective function \( f: \mathbb{R}^d \to \mathbb{R} \) is twice differentiable and satisfies the following conditions: - **Lipschitz Hessian**: There exists a constant \( \rho \in (0, +\infty) \) such that for all \( x, x' \in \mathbb{R}^d \), \( \| \nabla^2 f(x) - \nabla^2 f(x') \|_F \leq \rho \| x' - x \|_2 \). - **Strong Convexity**: There exists a constant \( M \in (0, +\infty) \) such that for any \( x \in \mathbb{R}^d \), the smallest eigenvalue of the Hessian matrix \( \nabla^2 f(x) \) is greater than \( M \). - **Bounded Distance from Initialization to Optimal Point**: There exists a constant \( R \in (0, +\infty) \) such that the infimum of \( f(x) \) within the hypersphere \( \| x \|_2 \leq R \) is equal to the infimum of \( f(x) \) over the entire \( \mathbb{R}^d \). - **Optimization Objective**: Design a learning algorithm \( A \) such that after \( T \) samplings, it can approach the global minimum point \( x^* \) with high probability, that is, minimize the expected difference \( E[f(x_T) - f(x^*)] \). ### Conclusion This paper, by proposing an algorithm that combines a bootstrapping phase and a mirror descent phase, for the first time solves the problem of optimal sample complexity in optimizing second - order smooth and strongly convex functions under stochastic zero - order feedback. This result not only promotes the theoretical research in the field of zero - order optimization but also provides effective algorithmic tools for practical applications. Future research directions can be further extended to the average regret metric in online settings and the fundamental trade - off between simple regret and average regret.

Stochastic Zeroth-Order Optimization under Strongly Convexity and Lipschitz Hessian: Minimax Sample Complexity

Improved Complexity for Smooth Nonconvex Optimization: A Two-Level Online Learning Approach with Quasi-Newton Methods

Stochastic Zeroth-order Optimization Via Variance Reduction Method.

Zeroth-order Gradient and Quasi-Newton Methods for Nonsmooth Nonconvex Stochastic Optimization

Online Optimization Perspective on First-Order and Zero-Order Decentralized Nonsmooth Nonconvex Stochastic Optimization

Stochastic Optimization under Hidden Convexity

Gradient-Free Methods for Non-Smooth Convex Stochastic Optimization with Heavy-Tailed Noise on Convex Compact

Sample Complexity for Quadratic Bandits: Hessian Dependent Bounds and Optimal Algorithms

A Stochastic Quasi-Newton Method for Non-convex Optimization with Non-uniform Smoothness

Online learning under one sided -smooth function

Accelerated zero-order SGD under high-order smoothness and overparameterized regime

Stochastic Second-order Methods for Non-convex Optimization with Inexact Hessian and Gradient

Safe Online Convex Optimization with Multi-Point Feedback

An Algorithm with Optimal Dimension-Dependence for Zero-Order Nonsmooth Nonconvex Stochastic Optimization

Zeroth-Order Non-Convex Optimization for Cooperative Multi-Agent Systems with Diminishing Step Size and Smoothing Radius

Beyond Convexity: Stochastic Quasi-Convex Optimization

Accelerated Stochastic Optimization Methods under Quasar-convexity

Near Optimal Stochastic Algorithms for Finite-Sum Unbalanced Convex-Concave Minimax Optimization

Simple and Optimal Stochastic Gradient Methods for Nonsmooth Nonconvex Optimization

Universal Online Convex Optimization with Minimax Optimal Second-Order Dynamic Regret

Zeroth-order Random Subspace Algorithm for Non-smooth Convex Optimization