On the adaptive deterministic block coordinate descent methods with momentum for solving large linear least-squares problems

Long-Ze Tan,Ming-Yu Deng,Jia-Li Qiu,Xue-Ping Guo
2024-10-26
Abstract:In this work, we first present an adaptive deterministic block coordinate descent method with momentum (mADBCD) to solve the linear least-squares problem, which is based on Polyak's heavy ball method and a new column selection criterion for a set of block-controlled indices defined by the Euclidean norm of the residual vector of the normal equation. The mADBCD method eliminates the need for pre-partitioning the column indexes of the coefficient matrix, and it also obviates the need to compute the Moore-Penrose pseudoinverse of a column sub-matrix at each iteration. Moreover, we demonstrate the adaptability and flexibility in the automatic selection and updating of the block control index set. When the coefficient matrix has full rank, the theoretical analysis of the mADBCD method indicates that it linearly converges towards the unique solution of the linear least-squares problem. Furthermore, by effectively integrating count sketch technology with the mADBCD method, we also propose a novel count sketch adaptive block coordinate descent method with momentum (CS-mADBCD) for solving highly overdetermined linear least-squares problems and analysis its convergence. Finally, numerical experiments illustrate the advantages of the proposed two methods in terms of both CPU times and iteration counts compared to recent block coordinate descent methods.
Numerical Analysis
What problem does this paper attempt to address?
This paper attempts to find efficient solution methods for large - scale linear least - squares problems. Specifically, the author proposes two new algorithms: 1. **The method of combining Adaptive Deterministic Block Coordinate Descent with Momentum (mADBCD)**: - This method accelerates convergence by introducing Polyak's Heavy Ball Method and uses a new column - selection criterion to automatically select and update the block - control index set. This method does not require pre - partitioning the column indices of the coefficient matrix, nor does it need to calculate the Moore - Penrose pseudo - inverse of the sub - matrix in each iteration, thus improving computational efficiency. - Theoretical analysis shows that in the case where the coefficient matrix is full - rank, the mADBCD method can linearly converge to the unique solution of the linear least - squares problem. 2. **The method of combining Count - Sketch Adaptive Deterministic Block Coordinate Descent with Momentum (CS - mADBCD)**: - This method further combines the Count - Sketch technique to solve highly over - determined linear least - squares problems. The Count - Sketch technique reduces the computational complexity while maintaining computational accuracy by compressing large matrices into smaller approximate forms, significantly improving the efficiency of the algorithm. - Theoretical analysis shows that the CS - mADBCD method linearly converges to the unique solution of the linear least - squares problem with probability \(1-\delta\). Numerical experiments show that these two methods are superior to existing block - coordinate - descent methods (such as GBGS, MRBGS, and FBCD methods) in terms of the number of iterations and CPU time, especially when dealing with high - dimensional data and large - scale linear systems.