Scheduling Multi-Server Jobs is Not Easy

Rahul Vaze
2024-07-01
Abstract:The problem of online scheduling of multi-server jobs is considered, where there are a total of $K$ servers, and each job requires concurrent service from multiple servers for it to be processed. Each job on its arrival reveals its processing time, the number of servers from which it needs concurrent service and an online algorithm has to make scheduling decisions using only causal information, with the goal of minimizing the response/flow time. The worst case input model is considered and the performance metric is the competitive ratio. For the case, when all job processing time (sizes) are the same, we show that the competitive ratio of any deterministic/randomized algorithm is at least $\Omega(K)$ and propose an online algorithm whose competitive ratio is at most $K+1$. With equal job sizes, we also consider the resource augmentation regime where an online algorithm has access to more servers than an optimal offline algorithm. With resource augmentation, we propose a simple algorithm and show that it has a competitive ratio of $1$ when provided with $2K$ servers with respect to an optimal offline algorithm with $K$ servers. With unequal job sizes, we propose an online algorithm whose competitive ratio is at most $2K \log (K w_{\max})$, where $w_{\max}$ is the maximum size of any job.
Data Structures and Algorithms,Information Theory
What problem does this paper attempt to address?
The paper primarily addresses the problem of online scheduling for multi-server jobs, where each job requires service from multiple servers simultaneously to be processed. Specifically, the paper considers how to efficiently schedule these multi-server jobs in the worst-case input model to minimize response time (i.e., the time from job arrival to completion). The main contributions of the paper include: 1. **Theoretical Lower Bound**: The paper proves that for the case where all jobs have the same processing time, the competitive ratio of any deterministic or randomized online scheduling algorithm is at least \( \Omega(K) \), where \( K \) is the total number of servers. This demonstrates the inherent difficulty of the problem. 2. **New Algorithm RA**: For the scenario where all job sizes are the same, a new online algorithm RA is proposed, with a competitive ratio of at most \( K+1 \). The main idea of algorithm RA is to select as many jobs as possible that can be processed together in each time slot and ensure that all servers are occupied (if possible). 3. **Different Job Sizes**: When job sizes vary, the paper proposes a natural extension of the RA algorithm, which has a competitive ratio of at most \( 2K\log(Kw_{\text{max}}) \), where \( w_{\text{max}} \) is the maximum size of any job. 4. **Resource Augmentation**: Considering that the competitive ratio of any deterministic or randomized algorithm is at least \( \Omega(K) \), the paper also explores resource augmentation schemes, where an online algorithm can access more resources (e.g., number of servers) than the optimal offline algorithm. For the scenario where all job sizes are the same, the paper proposes a simple online algorithm that, when given \( 2K \) servers, has a competitive ratio of 1 relative to the optimal offline algorithm with \( K \) servers. This means that to achieve the same performance as the optimal offline algorithm, only twice the number of servers need to be deployed. Through these contributions, the paper not only highlights the inherent challenges of the multi-server job scheduling problem but also proposes effective algorithms to address these challenges.