A general framework of Riemannian adaptive optimization methods with a convergence analysis

Hiroyuki Sakai,Hideaki Iiduka

2024-09-02

Abstract:This paper proposes a general framework of Riemannian adaptive optimization methods. The framework encapsulates several stochastic optimization algorithms on Riemannian manifolds and incorporates the mini-batch strategy that is often used in deep learning. Within this framework, we also propose AMSGrad on embedded submanifolds of Euclidean space. Moreover, we give convergence analyses valid for both a constant and a diminishing step size. Our analyses also reveal the relationship between the convergence rate and mini-batch size. In numerical experiments, we applied the proposed algorithm to principal component analysis and the low-rank matrix completion problem, which can be considered to be Riemannian optimization problems. Python implementations of the methods used in the numerical experiments are available at <a class="link-external link-https" href="https://github.com/iiduka-researches/202408-adaptive" rel="external noopener nofollow">this https URL</a>.

Optimization and Control

What problem does this paper attempt to address?

The paper aims to address the following issues: 1. **Propose a General Framework**: The paper proposes a general framework (Algorithm 1) for adaptive optimization methods on Riemannian manifolds. This framework can encapsulate various stochastic optimization algorithms and incorporates the mini-batch strategy commonly used in deep learning. 2. **Propose AMSGrad on Embedded Submanifolds**: Within this framework, the authors further propose a variant of AMSGrad on submanifolds embedded in Euclidean space (Algorithm 2). 3. **Convergence Analysis**: The authors conduct a convergence analysis of the proposed algorithms, proving that the method is applicable to both constant step sizes and diminishing step sizes, and revealing the relationship between the convergence rate and the mini-batch size. Through numerical experiments, the authors apply the proposed methods to Principal Component Analysis (PCA) and low-rank matrix completion problems, demonstrating their performance on different datasets. The experimental results indicate that the proposed adaptive optimization methods outperform existing Riemannian adaptive optimization methods in certain cases.

A general framework of Riemannian adaptive optimization methods with a convergence analysis

Convergence Analysis of Gradient Algorithms on Riemannian Manifolds Without Curvature Constraints and Application to Riemannian Mass

A Riemannian Alternating Descent Ascent Algorithmic Framework for Nonconvex-Linear Minimax Problems on Riemannian Manifolds

Riemannian stochastic fixed point optimization algorithm

Convergence of Inexact Steepest Descent Algorithm for Multiobjective Optimizations on Riemannian Manifolds Without Curvature Constraints

Learning to Optimize on Riemannian Manifolds

Learning-Rate-Free Stochastic Optimization over Riemannian Manifolds

A Framework for Bilevel Optimization on Riemannian Manifolds

Linear Convergence of Subgradient Algorithm for Convex Feasibility on Riemannian Manifolds

A Riemannian rank-adaptive method for low-rank matrix completion

On the Convergence of Adaptive Gradient Methods for Nonconvex Optimization

Convergence and worst-case complexity of adaptive Riemannian trust-region methods for optimization on manifolds

A Riemannian ADMM

Riemannian Stochastic Proximal Gradient Methods for Nonsmooth Optimization over the Stiefel Manifold

Convergence of Riemannian Stochastic Gradient Descent on Hadamard Manifold

Unified Convergence Analysis for Adaptive Optimization with Moving Average Estimator

A Comprehensive Framework for Analyzing the Convergence of Adam: Bridging the Gap with SGD

Riemannian Stochastic Hybrid Gradient Algorithm for Nonconvex Optimization

Adaptive Relaxed ADMM: Convergence Theory and Practical Implementation

A Riemannian optimization method on the indefinite Stiefel manifold

A Unified Approach to Adaptive Regularization in Online and Stochastic Optimization