Better than best low-rank approximation with the singular value decomposition

David F. Gleich
2024-02-28
Abstract:The Eckhart-Young theorem states that the best low-rank approximation of a matrix can be constructed from the leading singular values and vectors of the matrix. Here, we illustrate that the practical implications of this result crucially depend on the organization of the matrix data. In particular, we will show examples where a rank 2 approximation of the matrix data in a different representation more accurately represents the entire matrix than a rank 5 approximation of the original matrix data -- even though both approximations have the same number of underlying parameters. Beyond images, we show examples of how flexible orientation enables better approximation of time series data, which suggests additional applicability of the findings. Finally, we conclude with a theoretical result that the effect of data organization can result in an unbounded improvement to the matrix approximation factor as the matrix dimension grows.
Numerical Analysis
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: how to improve the accuracy of low - rank approximation by reorganizing matrix data. Specifically, the author points out that when the traditional singular value decomposition (SVD) method is used for low - rank approximation, its effect highly depends on the way data is organized in the matrix. By changing the arrangement of data, more accurate approximation results can be obtained with the same number of parameters. ### Specific description of the problem 1. **Eckart - Young Theorem**: According to the Eckart - Young theorem, the best low - rank approximation of a matrix can be constructed by its first few singular values and the corresponding singular vectors. However, the effect of this result depends on the way data is organized in the matrix. 2. **Reorganization of matrix data**: The paper shows that by reorganizing matrix data, the quality of low - rank approximation can be significantly improved. For example, in image data, dividing the image into multiple small blocks (tiles) and then performing low - rank approximation on these small blocks can obtain better results than directly performing low - rank approximation on the entire image. 3. **Application in time - series data**: For time - series data, by reorganizing the data (for example, dividing the data into different time periods), the changing trend of the data can be better captured, thereby improving the accuracy of low - rank approximation. 4. **Theoretical results**: The paper also provides a theoretical result, indicating that as the matrix dimension increases, unbounded improvement effects can be obtained by reorganizing the data. ### Formula representation To express the above content more clearly, the following are some key formulas involved in the paper: - **Low - rank approximation problem under Frobenius norm**: \[ \text{minimize} \sum_{i,j} |X_{ij} - Y_{ij}|^2 \] where \(Y\) is a matrix with rank not exceeding \(k\). - **SVD decomposition**: \[ X = U \Sigma V^T \] where \(U\) and \(V\) are the left and right singular vector matrices respectively, and \(\Sigma\) is the singular value matrix. - **Best low - rank approximation**: \[ Y = \sum_{i = 1}^{r} \sigma_i u_i v_i^T \] where \(\sigma_i\) is the \(i\)-th singular value, and \(u_i\) and \(v_i\) are the corresponding left and right singular vectors respectively. Through these formulas, the paper shows that reorganizing matrix data can significantly improve the accuracy of low - rank approximation, especially in the applications of image and time - series data.