RPC: Joint Online Reducer Placement and Coflow Bandwidth Scheduling for Clusters

Yangming Zhao,Chen Tian,Jingyuan Fan,Tong Guan,Xiaoning Zhang,Chunming Qiao
DOI: https://doi.org/10.1109/icnp.2018.00028
2020-01-01
IEEE/ACM Transactions on Networking
Abstract:Reducing Coflow Completion Time (CCT) has a significant impact on application performance in data-parallel frameworks. Most existing works assume that the endpoints of constituent flows in each coflow are predetermined. We argue that CCT can be further optimized by treating flows' destinations as an additional optimization dimension via reducer placement. In this paper, we propose and implement RPC, a joint online Reducer Placement and Coflow bandwidth scheduling framework, to minimize the average CCT in cloud clusters. We first develop a 2-approximation algorithm to minimize the CCT of a single coflow, then schedule all the coflows following the Shortest Remaining Time First (SRTF) principle. We use a real testbed implementation and extensive large-scale simulations to demonstrate that RPC can reduce the average CCT by 64.98% compared with state-of-the-art technologies.
What problem does this paper attempt to address?