Abstract:We address the problem of under-utilization of resources in datacenters during cloud operations, specifically focusing on the challenge of online virtual machine (VM) scheduling. Rather than following the traditional approach of scheduling VMs based solely on their static flavors, we take into account their dynamic CPU utilization. We employ Gamma -robustness theory to manage the dynamic nature and introduce a novel variant of bin packing -(), which theoretically protects the Physical Machines (PMs) from hotspots formation within a specified probability alpha . We develop a scheduling algroithm named CloseRadiusFit and cold-start AI-based prediction algorithms for the online version of . To verify the quality of our approach towards the optimal solutions, we solve the Offline problem by designing a novel Mixed Integer Linear Programming (MILP) model and a combination of numerical upper and lower bounds. Our experimental results demonstrate that CloseRadiusFit achieves narrow gaps of 1.6% and 3.1% when compared to the lower and upper bounds, respectively. Note to Practitioners -A growing trend in the cloud industry involves overcommitting VMs on PMs. While this approach can ease the problem of low utilization of resources in datacenters, it also introduces a higher risk of hotspots due to resource contention and competition among VMs. In this work, we propose a novel method that leverages Gamma -robustness theory and introduce effective heuristics to achieve ultimate utilization of datacenter resources while ensuring desirable service quality. We validate our approach using real-world production data from Huawei Cloud, improving resource utilization by 125% over traditional flavor-based allocation methods, while maintaining the occurrence of hotspots below 5% ( alpha=0.05 ). Our solution only requires VMs' real utilization data that is typically already collected in cloud providers' production environments. Therefore, with minimal modifications to the existing scheduling system, cloud providers can easily implement our solution and reap its benefits. Moreover, in cases of the absence of historical utilization data for VMs (cold-start), we use machine learning to predict VM utilization statistics for our approach.

Overbooking Microservices in the Cloud

Online Cost-Aware Service Requests Scheduling in Hybrid Clouds for Cloud Bursting

A Novel Job Scheduling Model to Enhance Efficiency and Overall User Fairness of Cloud Computing Environment.

Skedulix: Hybrid Cloud Scheduling for Cost-Efficient Execution of Serverless Applications

Hotspot-Aware Scheduling of Virtual Machines with Overcommitment for Ultimate Utilization in Cloud Datacenters

Overcommitment in Cloud Services -- Bin packing with Chance Constraints

On Non-Preemptive VM Scheduling in the Cloud

Utility-based Resource Allocation and Pricing for Serverless Computing

An Integrated Dynamic Resource Scheduling Framework in On-Demand Clouds.

An Integrated Dynamic Resource Scheduling Framework in On-Demand Clouds

Cost Minimization in Multi-cloud Systems with Runtime Microservice Re-orchestration

Practical Scheduling for Real-World Serverless Computing

Scheduling Bag-of-Tasks in Clouds using Spot and Burstable Virtual Machines

Overbooking-Based Resource Allocation in Virtualized Data Center

The Architectural Implications of Microservices in the Cloud

Optimizing Cloud Performance: A Microservice Scheduling Strategy for Enhanced Fault-Tolerance, Reduced Network Traffic, and Lower Latency

Near-Optimal Scheduling Mechanisms for Deadline-Sensitive Jobs in Large Computing Clusters

SLA-Driven Load Scheduling in Multi-Tier Cloud Computing: Financial Impact Considerations

Optimizing simultaneous autoscaling for serverless cloud computing

Formally Verifying Function Scheduling Properties in Serverless Applications

Tiny Autoscalers for Tiny Workloads: Dynamic CPU Allocation for Serverless Functions