The complexity of first-order optimization methods from a metric perspective

Adrian S. Lewis,Tonghua Tian
2023-05-05
Abstract:A central tool for understanding first-order optimization algorithms is the Kurdyka-Lojasiewicz inequality. Standard approaches to such methods rely crucially on this inequality to leverage sufficient decrease conditions involving gradients or subgradients. However, the KL property fundamentally concerns not subgradients but rather "slope", a purely metric notion. By highlighting this view, and avoiding any use of subgradients, we present a simple and concise complexity analysis for first-order optimization algorithms on metric spaces. This subgradient-free perspective also frames a short and focused proof of the KL property for nonsmooth semi-algebraic functions.
Optimization and Control,Numerical Analysis
What problem does this paper attempt to address?
The problem that this paper attempts to solve is about the complexity analysis of first - order optimization algorithms in metric spaces. Specifically, the authors hope to provide a simple and concise framework for the complexity analysis of first - order optimization algorithms by emphasizing the metric properties of the Kurdyka - Łojasiewicz (KL) inequality, rather than the traditional sub - gradient method. This method is not only applicable to non - Euclidean spaces, such as optimization problems on manifolds, but also can simplify and clarify the existing KL inequality proofs while maintaining its wide range of applications. ### Main contributions: 1. **KL inequality from a metric perspective**: The paper re - examines the KL inequality from the perspective of metric spaces, emphasizing the pure metric concept of "slope" rather than sub - gradients. This makes the analysis more concise and transparent. 2. **Complexity analysis without sub - gradients**: By avoiding the use of sub - gradients, the authors provide a new, simplified complexity analysis method that is applicable to a wider range of metric spaces. 3. **KL properties of semi - algebraic functions**: The paper provides a new, slope - based proof of the KL properties of semi - algebraic functions, simplifying the previous complexity analysis. 4. **Convergence results**: The authors study the iterative sequences that satisfy the basic slope - descent conditions and derive local and global convergence results. 5. **Application examples**: The paper takes the majorization - minimization method on Riemann manifolds and more general geodesic spaces as examples to show the application of the proposed framework. ### Key technical points: - **KL inequality**: A central tool for understanding the convergence of first - order optimization algorithms. - **Slope**: Defined as the rate of change of the optimal value after minimizing the perturbation near a point of a function, it is a pure metric concept. - **Metric space**: The method in the paper is applicable not only to Euclidean spaces but also to more general metric spaces. - **Semi - algebraic function**: A class of functions with a specific structure, suitable for many practical optimization problems. ### Paper structure: 1. **Introduction**: Introduce the historical background and importance of the KL inequality, as well as the research motivation of this paper. 2. **KL properties**: Define the KL properties in detail and explain them from a metric perspective. 3. **Convergence of objective values**: Discuss the convergence of objective values of sequences that satisfy the basic descent conditions. 4. **Slope - based proof of KL properties**: Provide a simplified, slope - based proof of the KL properties. 5. **Convergence to minimum points**: Study the convergence of sequences that satisfy the slope - descent conditions. 6. **Convergence to critical points**: Study the conditions for sequences to converge to critical points without the assumption of global convexity. 7. **Majorization - minimization method**: Take the majorization - minimization method on Riemann manifolds as an example to show the application of the proposed framework. In conclusion, by emphasizing the metric properties of the KL inequality, this paper provides a new, concise method for the complexity analysis of first - order optimization algorithms, which is applicable to a wider range of metric spaces. This not only simplifies the existing theory but also provides a new perspective for optimization problems in non - Euclidean spaces.