High Performance Evaluation of the Interpolations and Anterpolations in the GPU-Accelerated Massively Parallel MLFMA

Wei-Jia He,Zeng Yang,Xiao-Wei Huang,Wu Wang,Ming-Lin Yang,Xin-Qing Sheng
DOI: https://doi.org/10.1109/tap.2023.3269106
IF: 5.7
2023-01-01
IEEE Transactions on Antennas and Propagation
Abstract:This communication investigates high-performance computation schemes for local Lagrange interpolation and anterpolation operations in the parallel graphics processing unit (GPU)-accelerated distributed-memory multilevel fast multipole algorithm (MLFMA). Two ELLPACK format-based schemes, namely, block ELLPACK (ELL-B) and hybrid compressed sparse column (CSC)-block ELLPACK (CSC-ELL-B), are proposed for the evaluation of interpolation and anterpolation operations, respectively, which ensure high computational throughput for GPU calculation. Optimization using the GPU hierarchical memory architecture, the mechanism of the stream and the CPU/GPU asynchronous computation pattern are employed to further improve the overall performance. The proposed schemes are proven to be an order of magnitude faster than the conventional schemes for aggregation/disaggregation operations. For an aircraft model involving over 10 billion unknowns, the iteration time is reduced by over half, which is remarkable progress in the development of GPU-accelerated parallelization of MLFMA.
telecommunications,engineering, electrical & electronic
What problem does this paper attempt to address?