Abstract:In this work, we are interested in the problem of satisfying multiple concurrent requests submitted to a computing server. Informally, there are users each sending a sequence of requests to the server. The requests consist of tasks linked by precedence constraints. Tasks may occur several times in the same sequence as well as in a request sequence of another user. The computing server has to execute tasks with variable processing times. The server owns a cache of limited size where intermediate results of the processing may be stored. If an intermediate result for a task is stored into the cache, no processing cost has to be paid and the result can directly be fetched from the cache. The goal of this work is to determine a schedule of the tasks such that an optimization function is minimized (the only objective studied up to now is the make span). This problem is a variant of caching which considers only one sequence of requests. We then extend the study to the minimization of the mean completion time of the request sequences. Two models are considered. In the first model, caching is forced whereas in the second model caching is optional and one can choose whether an intermediate result is stored in the cache or not. All combinations turn out to be NP-hard for fixed cache sizes and we provide a formulation as dynamic program as well as bounds for in approximation. We propose polynomial time approximation algorithms for some variants and analyze their approximation ratios. Finally, we also devise some heuristics and present experimental results. Tasks may occur several times in the same sequence as well as in a request sequence of another user. The computing server has to execute tasks with variable processing times. The server owns a cache of limited size where intermediate results of the processing may be stored. If an intermediate result for a task is stored into the cache, no processing cost has to be paid and the result can directly be fetched from the cache. The goal of this work is to determine a schedule of the tasks such that an optimization function is minimized (the only objective studied up to now is the make span). This problem is a variant of caching which considers only one sequence of requests. We then extend the study to the minimization of the mean completion time of the request sequences. Two models are considered. In the first model, caching is forced whereas in the second model caching is optional and one can choose whether an intermediate result is stored in the cache or not. All combinations turn out to be NP-hard for fixed cache sizes and we provide a formulation as dynamic program as well as bounds for in approximation. We propose polynomial time approximation algorithms for some variants and analyze their approximation ratios. Finally, we also devise some heuristics and present experimental results.

Offline Scheduling of Multi-threaded Request Streams on a Caching Server.

A (32+Ε)-Approximation Algorithm for Scheduling on Two Parallel Machines with Job Delivery Coordination.

Some Results on Resource Constrained Scheduling

Single Machine Batch Scheduling to Minimize the Sum of Total Flow Time and Batch Delivery Cost with an Unavailability Interval

An Online Scheduling Problem with Job Set-ups

Co-Optimizing Cache Partitioning and Multi-Core Task Scheduling: Exploit Cache Sensitivity or Not?

Delay-Optimal Edge Cache Replacement with Non-Markovian Content Fetching

Distributed Packet Forwarding and Caching Based on Stochastic Network Utility Maximization

Optimal Scheduling in Asynchronous Coded Caching

DR-Cache: Distributed Resilient Caching with Latency Guarantees.

Scheduling Parallelizable Jobs Online to Minimize the Maximum Flow Time

Online Flexible Job Scheduling for Minimum Span

Server Cloud Scheduling

Cost-aware demand scheduling for delay tolerant applications

Network cache design under stationary requests: Exact analysis and Poisson approximation

An Efficient Scheduling Algorithm for Stream Computing.

Minimizing the makespan on two parallel machines with a common server in charge of loading and unloading operations

Near-Optimal Scheduling Mechanisms for Deadline-Sensitive Jobs in Large Computing Clusters

Single-Machine Scheduling Problems with Variable Processing Times and Past-Sequence-Dependent Delivery Times

Minimize the Make-span of Batched Requests for FPGA Pooling in Cloud Computing

Constraint Programming and Constructive Heuristics for Parallel Machine Scheduling with Sequence-Dependent Setups and Common Servers