Performance modeling and running strategy of parallel cdugksFOAM program

Yunlan Wang,Yufeng Liu,Rui Zhang,Tianhai Zhao,Sha Liu,Congshan Zhuo,Chengwen Zhong
DOI: https://doi.org/10.1016/j.cpc.2024.109186
IF: 4.717
2024-03-26
Computer Physics Communications
Abstract:The cdugksFOAM program realizes the physical space grid and the velocity space grid in parallel at the same time. Its distinguishing feature lies in its potential for large-scale parallelism. However, the running time of the cdugksFOAM program is significantly dependent on the number of physical and velocity space partitions. In order to find the optimal partitioning strategies for a specific CFD problem running on a parallel computer, we performed performance modeling of the cdugksFOAM program. Firstly, we proposed a floating-point operation count model, a MPI communication volume model, and a memory consumption model. Based on these models, we established a roofline model to predict the computational time, and a model to predict communication time. According to the computational time model and the communication time model, the execution time model was proposed and its effectiveness was verified with two cases. Finally, the optimal running strategy that minimizes the product of the number of computing nodes and execution time was identified, providing meaningful guidance for the economic operation of the program.
physics, mathematical,computer science, interdisciplinary applications
What problem does this paper attempt to address?