An Effective Loop Scheduling Algorithm for OpenMP on Cluster Systems
吴少刚,章隆兵,蔡飞,胡伟武
2004-01-01
Abstract:OpenMP provides a standardized interface for shared memory parallel programming across a wide range of shared memory architectures. It is widely accepted because of good programability and incremental parallelization of sequential programs. The shared memory programming model assumes uniform memory access times for the entire address space. Unfortunately, this is not true for software DSM which OpenMP is targeted. On cluster the owner-computing principle is important to get high performance, which means that the placement of data and computations is such that the data accessed by each process is local to the processor that the process is running. Many loop scheduling mechanisms in OpenMP consider only load balancing, not data locality. A locality-based scheduling (LBS) algorithm is proposed to use the information of data distribution to guide loop scheduling according to the owner-computing principle. It is implemented by OpenMP language extensions on the OpenMP/JIAJIA system. The experimental results show that LBS is an effective loop scheduling algorithm for the OpenMP on cluster.
What problem does this paper attempt to address?