Workload Intelligence: Punching Holes Through the Cloud Abstraction
Lexiang Huang,Anjaly Parayil,Jue Zhang,Xiaoting Qin,Chetan Bansal,Jovan Stojkovic,Pantea Zardoshti,Pulkit Misra,Eli Cortez,Raphael Ghelman,Íñigo Goiri,Saravan Rajmohan,Jim Kleewein,Rodrigo Fonseca,Timothy Zhu,Ricardo Bianchini
2024-04-30
Abstract:Today, cloud workloads are essentially opaque to the cloud platform. Typically, the only information the platform receives is the virtual machine (VM) type and possibly a decoration to the type (e.g., the VM is evictable). Similarly, workloads receive little to no information from the platform; generally, workloads might receive telemetry from their VMs or exceptional signals (e.g., shortly before a VM is evicted). The narrow interface between workloads and platforms has several drawbacks: (1) a surge in VM types and decorations in public cloud platforms complicates customer selection; (2) essential workload characteristics (e.g., low availability requirements, high latency tolerance) are often unspecified, hindering platform customization for optimized resource usage and cost savings; and (3) workloads may be unaware of potential optimizations or lack sufficient time to react to platform events.
Distributed, Parallel, and Cluster Computing