Reducing Tail Latency in Proactive Congestion Control Via Moderate Speculation

Dezun Dong,Ke Wu
DOI: https://doi.org/10.1109/hpcc-smartcity-dss50907.2020.00051
2020-01-01
Abstract:Congestion is the bottleneck of HPC networks and seriously affects system performance. In order to adapt to HPC’s short message(flows) dominated traffic and achieve a lower average flow delay, the proactive congestion control methods give speculative packets a higher priority and short flows are scheduled first in the network. However, the resulting phenomenon that the long flow has been blocked by the short flow will seriously damage the wake delay. Existing proactive congestion control methods face the challenge of taking into account both average flow delay and wake delay. In this paper, we combine the advantages of proactive and reactive congestion control technologies and propose Less-Tail Component(LTC), which can effectively reduce wake delay without affecting the average flow delay, and can further reduce the average flow delay under uniform traffic. We suspend all high-priority short flows when speculative packets are congested in the network. We do not change the short flow priority mechanism to ensure an average flow delay, and suspend short flows occasionally and temporarily to achieve a new dynamic balance between short flows and long flows. Our LTC can curb excessive speculation in short flows, effectively improve long flows treatment and reduce wake delay. We conducted extensive experiments to evaluate our LTC and compared it with the latest proactive reservation-based protocol, PCRP. The simulation results show that in our design, the wake delay of uniform traffic can be reduced by up to 21.12%, and the average flow delay can be reduced by up to 18.97%.
What problem does this paper attempt to address?