Construction of Hierarchically Semi-Separable matrix Representation using Adaptive Johnson-Lindenstrauss Sketching

Yotam Yaniv,Osman Asif Malik,Pieter Ghysels,Xiaoye S. Li
2025-01-03
Abstract:We present an extension of an adaptive, partially matrix-free, Hierarchically Semi-Separable (HSS) matrix construction algorithm by Gorman et al. [SIAM J. Sci. Comput. 41(5), 2019] which uses Gaussian sketching operators to a broader class of Johnson--Lindenstrauss (JL) sketching operators. We develop theoretical work which justifies this extension. In particular, we extend the earlier concentration bounds to all JL sketching operators and examine this bound for specific classes of such operators including the original Gaussian sketching operators, subsampled randomized Hadamard transform (SRHT) and the sparse Johnson--Lindenstrauss transform (SJLT). We discuss the implementation details of applying SJLT and SRHT efficiently. Then we demonstrate experimentally that using SJLT or SRHT instead of Gaussian sketching operators leads to up to 2.5x speedups of the serial HSS construction implementation in the STRUMPACK C++ library. Additionally, we discuss the implementation of a parallel distributed HSS construction that leverages Gaussian or SJLT sketching operators. We observe a performance improvement of up to 35x when using SJLT sketching operators over Gaussian sketching operators. The generalized algorithm allows users to select their own JL sketching operators with theoretical lower bounds on the size of the operators which may lead to faster run time with similar HSS construction accuracy.
Numerical Analysis
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? This paper aims to solve the problem of how to improve computational efficiency by using a broader Johnson - Lindenstrauss (JL) sketching operator when constructing Hierarchically Semi - Separable (HSS) matrices. Specifically, the goals of the paper include: 1. **Expand the adaptive HSS compression algorithm**: Generalize the existing adaptive HSS compression algorithm that depends on the Gaussian sketching operator to any JL sketching operator, thereby providing more choices and flexibility. 2. **Increase the speed of the sketching step**: By introducing the Sparse Johnson - Lindenstrauss Transform (SJLT) and the Randomized Hadamard Transform (SRHT), reduce the time complexity of the sketching step from the original O(n^2d) to O(nαd), where α ≪ d. 3. **Ensure compression accuracy**: While increasing the speed, ensure that the accuracy of HSS matrix compression using new sketching operators (such as SJLT and SRHT) is comparable to that of the traditional Gaussian sketching operator. 4. **Parallel implementation and optimization**: Develop a parallel and distributed HSS construction algorithm, utilize Gaussian and SJLT sketching operators, and observe a performance improvement of up to 35 times. ### Core contributions of the paper - **Theoretical analysis**: Prove the effectiveness of the Frobenius norm stopping criterion for all JL sketching operators and provide a theoretical lower bound for these operators. - **Experimental verification**: Through experiments, show that using SJLT and SRHT can significantly accelerate the construction of HSS matrices compared to the Gaussian sketching operator while maintaining similar compression accuracy. - **Implementation details**: Implement a general - purpose HSS compression algorithm in the STRUMPACK C++ library. Users can choose different sketching operators, and the specific implementation details are discussed. ### Formula summary - **Frobenius norm stopping criterion**: \[ \frac{\|bS\|_F}{\|\tilde{S}\|_F} < \epsilon_{\text{rel}}, \quad \|bS\|_F < \epsilon_{\text{abs}} \] where \(\tilde{S} = A_{\nu_i, \bar{\nu_j}} R\) is the newly generated sketching matrix, and \(bS=(I - Q_\tau Q_\tau^*) \tilde{S}\) is the projection of the new sketch on the orthogonal complement space of the current sketch. - **Conditions of the JL lemma**: \[ d \geq 4 \left( \frac{\epsilon^2}{2} - \frac{\epsilon^3}{3} \right)^{-1} \log m \] For any points \(u, v\in P\) in the point set \(P\), satisfy: \[ (1 - \epsilon)\|u - v\|^2 \leq \|f(u)-f(v)\|^2 \leq (1 + \epsilon)\|u - v\|^2 \] - **Conditions of the Gaussian sketching operator**: \[ d \geq 20 \epsilon^{-2} \log(2/\delta) \] Ensure: \[ (1 - \epsilon)\|A\|_F^2 \leq \|AR\|_F^2 \leq (1 + \epsilon)\|A\|_F^2 \] Through these improvements and optimizations, the paper not only improves the efficiency of HSS matrix construction but also provides a theoretical basis and practical guidance for future research.