Parallelization and Optimization of Laser-Plasma-Interaction Simulation

Haipeng WU,Minhua WEN,Simon SEE,James LIN
DOI: https://doi.org/10.3778/j.issn.1673-9418.1611092
2018-01-01
Abstract:The progress in generating intense ultra-short laser pulse demands more and more for kinetic descriptions of the interaction of such laser pulse with plasmas.Particle-in-cell(PIC)algorithm is a widely-used method in plasma physics to study the trajectories of charged particles under electromagnetic fields.Though there have been some implementations of PIC algorithm on GPU, some important issues still need to be clarified in detail, based on the characteristic of the laser-plasma-interaction simulation.This paper introduces a way to change the original CPU laser-plasma-interaction code into a parameterized adaptive GPU implementation with the whole algorithm ported.Then, this paper develops a series of methods to speed up the particle scatter phase:dynamic duplication algorithm,mix-precision computing and a parameterized particle sorting algorithm.Furthermore,this paper utilizes the GPUDirect RDMA(remote direct memory access)technique in a Kepler cluster and evaluates how it can benefit the MPI com-munication performance.The results from the numerical experiment show that these optimizations produce a 6.1x speed-up compared with the initial GPU version using the same number of GPUs for the key"Scatter"phase.The speed-up for the MPI communication part is 2.8x when the message size is over 3 KB.All the findings demonstrate that particular optimizations based on the features of the simulation and modern GPU cluster are essential for achiev-ing significantly improved performance.
What problem does this paper attempt to address?