Spread-n-share

Xiongchao Tang,Haojie Wang,Xiaosong Ma,Nosayba El-Sayed,Jidong Zhai,Wenguang Chen,Ashraf Aboulnaga
DOI: https://doi.org/10.1145/3295500.3356152
2019-01-01
Abstract:Traditional batch job schedulers adopt the Compact-n-Exclusive (CE) strategy, packing processes of a parallel job into as few compute nodes as possible. While CE minimizes inter-node network communication, it often brings self-contention among tasks of a resource-intensive application. Recent studies have used virtual containers to balance CPU utilization and memory capacity across physical nodes, but the imbalance in cache and memory bandwidth usage is still under-investigated.
What problem does this paper attempt to address?