Cost-Aware Dynamic Multi-Workflow Scheduling in Cloud Data Center Using Evolutionary Reinforcement Learning

Victoria Huang,Chen Wang,Hui Ma,Gang Chen,Kameron Christopher
DOI: https://doi.org/10.1007/978-3-031-20984-0_32
2022-01-01
Abstract:The Dynamic Multi-Workflow Scheduling (DMWS) problem aims to allocate highly complex tasks modeled as workflows to cloud resources while optimizing workflow brokers' interests. A workflow broker offers workflow execution services to end-users with agreed Service Level Agreements (SLA) while reducing its total VM rental fees in the meantime. Most existing DMWS-related research works focus on minimizing the workflow makespan by using either heuristics or hyper-heuristics techniques. However, these techniques were either designed for static workflow scheduling based on prior workflow information and/or the simplified cloud environment. In this paper, the DMWS problem is formulated to collectively minimize VM rental fees and SLA violation penalties. Moreover, we introduce a novel priority-based deep neural network scheduling policy that can flexibly adapt to a changing number of VMs and workflows. To train the new policy, a new Evolutionary Strategy based Reinforcement Learning (ES-RL) is developed and implemented. Different from gradient-based deep reinforcement learning algorithms, ES-RL has its advances in effectively training population-based and generally applicable policies in parallel as well as robustness to hyper-parameter settings. Our experiments with real-world datasets show that ES-RL can effectively train scheduling policies that can significantly reduce the costs by more than 90% compared to the state-of-the-art scheduling policies.
What problem does this paper attempt to address?