Measurement-based Resource Allocation and Control in Data Centers: A Survey

Diana Andreea Popescu
2024-08-23
Abstract:Data centers have become ubiquitous for today's businesses. From banks to startups, they rely on cloud infrastructure to deploy user applications. In this context, it is vital to provide users with application performance guarantees. Network interference is one of the causes of unpredictable application performance, and many solutions have been proposed over the years. The main objective of this survey is to familiarize the reader with research into network measurement-based resource allocation and control in data centers, focusing on network resources in order to provide cloud performance guarantees. We start with a primer on general network measurement techniques and data center network and applications to give the reader context. We then summarize the characteristics of network traffic and cluster workloads in data centers, which are pivotal for measurement-based allocation and control. We study and compare network monitoring in data centers, giving an overview on their evolution from Software-Defined Networking (SDN) to programmable dataplanes-based. The network monitoring information can serve as input to cluster allocation and scheduling decisions. We next categorize cluster scheduling frameworks, and perform an analysis of those that provide network guarantees in data centers, and we also look at emergent Machine Learning-driven resource allocation and control. We conclude with a discussion about future research directions.
Networking and Internet Architecture
What problem does this paper attempt to address?