Introducing Molly: Distributed Memory Parallelization with LLVM

Michael Kruse
DOI: https://doi.org/10.48550/arXiv.1409.2088
2014-09-07
Abstract:Programming for distributed memory machines has always been a tedious task, but necessary because compilers have not been sufficiently able to optimize for such machines themselves. Molly is an extension to the LLVM compiler toolchain that is able to distribute and reorganize workload and data if the program is organized in statically determined loop control-flows. These are represented as polyhedral integer-point sets that allow program transformations applied on them. Memory distribution and layout can be declared by the programmer as needed and the necessary asynchronous MPI communication is generated automatically. The primary motivation is to run Lattice QCD simulations on IBM Blue Gene/Q supercomputers, but since the implementation is not yet completed, this paper shows the capabilities on Conway's Game of Life.
Programming Languages,Distributed, Parallel, and Cluster Computing
What problem does this paper attempt to address?