An Optimized GPU Implementation of Weakly-Compressible SPH Using CUDA-Based Strategies

Yuejin Cai,Jianguo Wei,Qingzhi Hou,Ruixue Gao
DOI: https://doi.org/10.1007/978-3-030-95384-3_23
2022-01-01
Abstract:SPH (Smoothed Particle Hydrodynamics) is a meshless method that is widely used to simulate computational fluid problems but is very time-consuming. Although many GPU-based solutions have been widely applied to accelerate the SPH method, the potential bottlenecks of optimizing the GPU implementation are less investigated. This study puts forward a fully optimized GPU-based implementation to accelerate SPH simulations. To this end, different aspects of GPU optimization, including splitting CUDA kernels, using fast instructions, simplifying data arrays, using texture memory and unrolling loops, are developed to speed up the GPU implementation. Our parallel-optimized GPU framework is able to be further applied to implement other SPH-based algorithms. The performance results show that these GPU optimization strategies can greatly improve the computational performance on graphics devices, and the fully optimized GPU implementation is on average 3 times faster than basic GPU implementation without optimizations. Besides, the parallel computing power of an advanced GPU is introduced to accelerate SPH codes with a speedup of 3338x in comparison to its serial version. The same simulation run on a rather outdated GPU is also 693 times as efficient as the implementation achieved on a mainstream single CPU.
What problem does this paper attempt to address?