On the adaptive deterministic block coordinate descent methods with momentum for solving large linear least-squares problems

Long-Ze Tan,Ming-Yu Deng,Jia-Li Qiu,Xue-Ping Guo

2024-10-26

Abstract:In this work, we first present an adaptive deterministic block coordinate descent method with momentum (mADBCD) to solve the linear least-squares problem, which is based on Polyak's heavy ball method and a new column selection criterion for a set of block-controlled indices defined by the Euclidean norm of the residual vector of the normal equation. The mADBCD method eliminates the need for pre-partitioning the column indexes of the coefficient matrix, and it also obviates the need to compute the Moore-Penrose pseudoinverse of a column sub-matrix at each iteration. Moreover, we demonstrate the adaptability and flexibility in the automatic selection and updating of the block control index set. When the coefficient matrix has full rank, the theoretical analysis of the mADBCD method indicates that it linearly converges towards the unique solution of the linear least-squares problem. Furthermore, by effectively integrating count sketch technology with the mADBCD method, we also propose a novel count sketch adaptive block coordinate descent method with momentum (CS-mADBCD) for solving highly overdetermined linear least-squares problems and analysis its convergence. Finally, numerical experiments illustrate the advantages of the proposed two methods in terms of both CPU times and iteration counts compared to recent block coordinate descent methods.

Numerical Analysis

What problem does this paper attempt to address?

This paper attempts to find efficient solution methods for large - scale linear least - squares problems. Specifically, the author proposes two new algorithms: 1. **The method of combining Adaptive Deterministic Block Coordinate Descent with Momentum (mADBCD)**: - This method accelerates convergence by introducing Polyak's Heavy Ball Method and uses a new column - selection criterion to automatically select and update the block - control index set. This method does not require pre - partitioning the column indices of the coefficient matrix, nor does it need to calculate the Moore - Penrose pseudo - inverse of the sub - matrix in each iteration, thus improving computational efficiency. - Theoretical analysis shows that in the case where the coefficient matrix is full - rank, the mADBCD method can linearly converge to the unique solution of the linear least - squares problem. 2. **The method of combining Count - Sketch Adaptive Deterministic Block Coordinate Descent with Momentum (CS - mADBCD)**: - This method further combines the Count - Sketch technique to solve highly over - determined linear least - squares problems. The Count - Sketch technique reduces the computational complexity while maintaining computational accuracy by compressing large matrices into smaller approximate forms, significantly improving the efficiency of the algorithm. - Theoretical analysis shows that the CS - mADBCD method linearly converges to the unique solution of the linear least - squares problem with probability \(1-\delta\). Numerical experiments show that these two methods are superior to existing block - coordinate - descent methods (such as GBGS, MRBGS, and FBCD methods) in terms of the number of iterations and CPU time, especially when dealing with high - dimensional data and large - scale linear systems.

On the adaptive deterministic block coordinate descent methods with momentum for solving large linear least-squares problems

On the adaptive deterministic block Kaczmarz method with momentum for solving large-scale consistent linear systems

Block Acceleration Without Momentum: On Optimal Stepsizes of Block Gradient Descent for Least-Squares

On the Efficiency of Random Permutation for ADMM and Coordinate Descent

Randomized block coordinate descent method for linear ill-posed problems

A Fast Block Coordinate Descent Method for Solving Linear Least-Squares Problems

On the Flexibility of Block Coordinate Descent for Large-Scale Optimization.

On maximum residual block Kaczmarz method for solving large consistent linear systems

Parallel Direction Method of Multipliers

A Bregman-style Partially Symmetric Alternating Direction Method of Multipliers for Nonconvex Multi-block Optimization

Efficient Accelerated Coordinate Descent Methods and Faster Algorithms for Solving Linear Systems

A Bregman-Style Improved ADMM and its Linearized Version in the Nonconvex Setting: Convergence and Rate Analyses

A linear algebra perspective on the random multi-block ADMM: the QP case

Block Coordinate Descent Methods for Structured Nonconvex Optimization with Nonseparable Constraints: Optimality Conditions and Global Convergence

An Extended ADMM for 3-Block Nonconvex Nonseparable Problems with Applications

Extended ADMM and BCD for Nonseparable Convex Minimization Models with Quadratic Coupling Terms: Convergence Analysis and Insights

A Multi-Level ADMM Algorithm for Elliptic PDE-constrained Optimization Problems

A Block Coordinate Descent Approach for Sparse Principal Component Analysis

Multi-block relaxed-dual linear inertial ADMM algorithm for nonconvex and nonsmooth problems with nonseparable structures

Convergent Prediction-Correction-based ADMM for Multi-Block Separable Convex Programming

A Unified Alternating Direction Method of Multipliers by Majorization Minimization