Research on GRAPES Semi-Implicit Semi-Lagrangian Computation Optimization Based on CPU+GPU Heterogeneity

Zhengsheng Ning,Jianqiang Huang,Haodong Bian,Zhanjie Tan
DOI: https://doi.org/10.1109/ecnct63103.2024.10704385
2024-01-01
Abstract:In the GRAPES numerical weather forecast system, the efficient solution of semi-implicit semi-Lagrangian calculation is crucial to the overall performance of the system. This paper carries out the porting optimization work from the aspects of performance hotspot analysis, code parallelism analysis, heterogeneous parallel strategy design and GPU optimization. The load-balanced MPI+CUDA is used to allocate tasks to the CPU and GPU for simultaneous calculation, and the performance is optimized by using merged memory access, shared memory and fixed memory. The experimental results show that MPI+CUDA achieves 9.6 times and 5.3 times performance improvement on the two hotspot modules respectively compared with MPI. It also proves that the MPI+CUDA solution has good scalability and guarantees correctness.
What problem does this paper attempt to address?