Providing Bandwidth Guarantees, Work Conservation and Low Latency Simultaneously in the Cloud

Shuihai Hu,Wei Bai,Kai Chen,Chen Tian,Ying Zhang,Haitao Wu
DOI: https://doi.org/10.1109/tcc.2018.2890252
IF: 5.697
2021-04-01
IEEE Transactions on Cloud Computing
Abstract:Today’s cloud is shared among multiple tenants running different applications, and a desirable multi-tenant datacenter network infrastructure should provide bandwidth guarantees for throughput-intensive applications, low latency for latency-sensitive short messages, as well as work conservation to fully utilize the network bandwidth. Despite significant efforts in recent years, none of them can achieve these three properties simultaneously. In this paper, we identify the key deficiency of prior solutions and use this insight to motivate our design of $\mathsf{Trinity}$Trinity—a simple, practical yet effective solution that achieves bandwidth guarantees, work conservation and low latency simultaneously in the cloud. We implement $\mathsf{Trinity}$Trinity using existing commodity hardwares and demonstrate its superior performance over prior solutions using testbed experiments.
computer science, information systems, theory & methods
What problem does this paper attempt to address?