Reinforcement Learning for Optimal Load Distribution Sequencing in Resource-Sharing System

Fei Wu,Yang Cao,Thomas Robertazzi
DOI: https://doi.org/10.48550/arXiv.1902.01899
2019-02-06
Abstract:Divisible Load Theory (DLT) is a powerful tool for modeling divisible load problems in data-intensive systems. This paper studied an optimal divisible load distribution sequencing problem using a machine learning framework. The problem is to decide the optimal sequence to distribute divisible load to processors in order to achieve minimum finishing time. The scheduling is performed in a resource-sharing system where each physical processor is virtualized to multiple virtual processors. A reinforcement learning method called Multi-armed bandit (MAB) is used for our problem. We first provide a naive solution using the MAB algorithm and then several optimizations are performed. Various numerical tests are conducted. Our algorithm shows an increasing performance during the training progress and the global optimum will be acheived when the sample size is large enough.
Distributed, Parallel, and Cluster Computing,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to find the optimal divisible load allocation sequence in the resource - sharing system to achieve the minimum completion time. Specifically, the research focuses on how to determine the best order of allocating divisible loads to processors in order to shorten the completion time in a heterogeneous system. The system considered here is an environment of resource virtualization, where each physical processor is virtualized into multiple virtual processors, and the processing and communication speeds of the processors change over time. To address this problem, the author proposes a reinforcement - learning method based on the multi - armed bandit (MAB), especially using the Thompson sampling algorithm to balance exploration and exploitation, thereby gradually learning the optimal load - allocation sequence.