A Heterogeneous CPU-GPU Implementation for Discrete Elements Simulation with Multiple GPUs.

Yuan Tian,Ji Qi,Junjie Lai,Qingguo Zhou,Lei Yang
DOI: https://doi.org/10.1109/icawst.2013.6765500
2013-01-01
Abstract:To calculate the large number of particles in discrete elements simulation, a heterogeneous CPU-GPU implementation with multiple GPUs is developed. The implementation is achieved by combining two different parallel programming languages so that it can be assigned to a CPU-GPU cluster. The communication between nodes uses Massage Passing Interface (MPI) implementation for dynamic domain decomposition, particles re-mapping and data copying of overlapping areas. Other works are assigned to GPUs to obtain a high computational speed. The results of strong and weak scalability tests are analyzed for different number of GPUs. Last, the LAMMPS is used as CPU platform to compare with multi-GPU application for reflecting the superiority of using heterogeneous implementation.
What problem does this paper attempt to address?