Parallelism for cryo-EM 3D reconstruction on CPU-GPU heterogeneous system

Xingjian Li,Linchuan Li,Guangming Tan,Peiheng Zhang
DOI: https://doi.org/10.13245/j.hust.2011.s1.003
2011-01-01
Abstract:It is a challenge to efficiently utilize massive parallelism on both applications and architectures for heterogeneous systems. A practice of accelerating a cryo-EM 3D program was presented on how to exploit and orchestrate parallelism of applications to take advantage of the underlying parallelism exposed at the architecture level. All possible parallelism in cryo-EM 3D was exploited, and a self-adaptive dynamic scheduling algorithm was leveraged to efficiently implement parallelism mapping between the application and architecture. The experiment on a part of dawning nebulae system (32 nodes) confirms that a hierarchical parallelism is an efficient pattern of parallel programming to utilize capabilities of both CPU and GPU on a heterogeneous system. The hybrid CPU-GPU program improves performance by 2.4 times over the best CPU-only one for certain problem sizes.
What problem does this paper attempt to address?