The impact of load comparison errors on the power-of- d load balancing

Sanidhay Bhambay,Arpan Mukhopadhyay,Thirupathaiah Vasantam
DOI: https://doi.org/10.1016/j.peva.2024.102408
IF: 2.205
2024-05-01
Performance Evaluation
Abstract:We consider a system with n unit-rate servers where jobs arrive according a Poisson process with rate n λ ( λ < 1 ). In the standard Power-of- d or Pod scheme with d ≥ 2 , for each incoming job, a dispatcher samples d servers uniformly at random and sends the incoming job to the least loaded of the d sampled servers. However, in practice, load comparisons may not always be accurate. In this paper, we analyse the effects of noisy load comparisons on the performance of the Pod scheme. To test the robustness of the Pod scheme against load comparison errors, we assume an adversarial setting where, in the event of an error, the adversary assigns the incoming job to the worst possible server, i.e., the server with the maximum load among the d sampled servers. We consider two error models: load-dependent and load-independent errors. In the load-dependent error model, the adversary has limited power in that it is able to cause an error with probability ϵ ∈ [ 0 , 1 ] only when the difference in the minimum and the maximum queue lengths of the d sampled servers is bounded by a constant threshold g ≥ 0 . For this type of errors, we show that, in the large system limit, the benefits of the Pod scheme are retained even if g and ϵ are arbitrarily large as long as the system is heavily loaded, i.e., λ is close to 1. In the load-independent error model, the adversary is assumed to be more powerful in that it can cause an error with probability ϵ independent of the loads of the sampled servers. For this model, we show that the performance benefits of the Pod scheme are retained only if ϵ ≤ 1 / d ; for ϵ > 1 / d we show that the stability region of the system reduces and the system performs poorly in comparison to the random scheme. Our mean-field analysis uses a new approach to characterise fixed points which neither have closed form solutions nor admit any recursion. Furthermore, we develop a generic approach to prove tightness and stability for any state-dependent load balancing scheme.
computer science, theory & methods, hardware & architecture
What problem does this paper attempt to address?