Abstract:In this article, we consider the dynamic allocation of bursty requests stochastically arriving at heterogeneous servers with uncertain setup times. Lower expected response time and less power consumption are desirable objectives of users and service providers respectively. However, sudden increase and decrease of cloud servers caused by bursty requests are rather challenging to get an appropriate trade-off between the two conflicting objectives which are closely related to the launched servers. The heterogeneity of the cloud servers further makes it more difficult to decide how to switch on and off servers and effectively and efficiently allocate bursty requests with balanced objectives. Based on a Markov decision process, a real-time bilevel decision-making model is constructed for unallocated requests which includes: whether to launch a server and which type of server to launch. A learn-and-deploy algorithm framework is proposed which contains two complementary stages. In the first stage, an effective offline bi-objective optimization algorithm is proposed to learn a set of policies, which provides helpful trade-off information for a decision-maker to choose a preferred policy a posteriori. In terms of the system status, a policy decides whether to launch a server according to a state-action table and which server to launch using a server priority sequence. In the second stage, a computationally efficient policy deployment method is proposed to search the corresponding action in the selected policy based on the current system status and apply it to the real-time system. Experimental studies over a large number of random and real instances have been conducted to validate the effectiveness of the proposed bilevel model and algorithm. Compared to the most recent existing method, the performance of the proposed approach can at most achieve an 80% improvement on power consumption and 20% improvement on response ti- e.

Burstiness-Aware Resource Reservation for Server Consolidation in Computing Clouds

Burstiness-aware Server Consolidation Via Queuing Theory Approach in a Computing Cloud

Load Balancing In Server Consolidation

Cost-Efficient Vm Configuration Algorithm In The Cloud Using Mix Scaling Strategy

An energy-efficient load balance strategy based on virtual machine consolidation in cloud environment

Towards energy and QoS aware dynamic VM consolidation in a multi-resource cloud

Energy and quality of service-aware virtual machine consolidation in a cloud data center

Smart-DRS: A Strategy of Dynamic Resource Scheduling in Cloud Data Center

Research on virtual machine consolidation strategy based on combined prediction and energy-aware in cloud computing platform

Multi-Tiered On-Demand Resource Scheduling for VM-Based Data Center

An Integrated Dynamic Resource Scheduling Framework in On-Demand Clouds.

Efficient Consolidation-Aware VCPU Scheduling on Multicore Virtualization Platform.

An Energy-Efficient Scheme for Cloud Resource Provisioning Based on CloudSim

Virtual Machine Based Energy-Efficient Data Center Architecture for Cloud Computing: A Performance Perspective.

An Integrated Dynamic Resource Scheduling Framework in On-Demand Clouds

SSUR: An Approach to Optimizing Virtual Machine Allocation Strategy Based on User Requirements for Cloud Data Center

A Virtual Machine Consolidation Algorithm Based on Dynamic Load Mean and Multi-Objective Optimization in Cloud Computing

A Bi-Objective Learn-and-Deploy Scheduling Method for Bursty and Stochastic Requests on Heterogeneous Cloud Servers

Multi-Objective Virtual Machine Consolidation

Learning-Based Virtual Machine Selection in Cloud Server Consolidation

Adaptive Resource Provisioning for the Cloud Using Online Bin Packing.