Macroflow: A Fine-Grained Networking Abstraction for Job Completion Time Oriented Scheduling in Datacenters

Chen Tian,Junhua Yan,Alex X. Liu,Yizhou Tang,Yuankun Zhong,Zi Li
DOI: https://doi.org/10.1109/icnp.2016.7784473
2016-01-01
Abstract:For a datacenter running a data-parallel analytic framework, minimizing job completion time (JCT) is crucial for application performance. The key observation is that JCT could be improved, if network scheduling can exploit the opportunity of decreasing the amount of occupied machine slot-time spend on communication. We propose Macroflow, a networking abstraction that captures the primitive resource granularity of data-parallel frameworks. We study the inter-macroflow scheduling problem for decreasing application JCT. We propose the Smallest-Macroflow-First (SMF) and Smallest-Average-Macroflow-First (SAMF) heuristics that greedily schedule macroflows based on their network footprint. Trace-driven simulations demonstrate that our algorithms can reduce the average and tail JCT of network-intensive jobs by up to 20% and 25%, respectively; at the same time, the throughput of computation-intensive jobs is increased by up to 2.2×.
What problem does this paper attempt to address?