Floodgate: taming incast in datacenter networks
Kexin Liu,Chen Tian,Qingyue Wang,Hao Zheng,Peiwen Yu,Wenhao Sun,Yonghui Xu,Ke Meng,Lei Han,Jie Fu,Wanchun Dou,Guihai Chen
DOI: https://doi.org/10.1145/3485983.3494854
2021-01-01
Abstract:ABSTRACTIncast occurs frequently in datacenter networks where a large number of senders send data to a single receiver simultaneously, which makes the last hop the network bottleneck. Incast can hurt flows' performance. However, congestion control protocols are not effective at handling incast. One key insight is that it is too late to handle incast packets after they have already piled up at the last hop. Instead, we should avoid incast as early as possible. Inspired by flood control in Hydrologic Engineering, we propose Floodgate, a novel switch-based per-hop flow control to handle incast. Floodgate is compatible with existing congestion control protocols. We integrate it with practical congestion control approaches such as DCQCN, TIMELY, and HPCC. We evaluate Floodgate both in our implementations and large-scale simulations. Compared with state of the art, Floodgate reduces the buffer occupancy by a factor of 6.6x, as well as the queuing delay. Therefore, the average FCT and tail latency are greatly reduced.