Iphicles: Tuning Parameters of Data Center Networks with Differentiable Performance Model

Sijiang Huang,Mowei Wang,Yashe Liu,Zhenhua Liu,Yong Cui
DOI: https://doi.org/10.1109/iwqos61813.2024.10682926
2024-01-01
Abstract:Tuning parameters in Data Center Networks (DCN) has long been a nuisance and one of the reasons service providers are reluctant to deploy new mechanisms in their production environments. Despite the excessive time and resources devoted to finding better configurations, a "one-size-fits-all" solution remains elusive. Neither manual configuration by experts nor black-box optimization can address the challenges of network heterogeneity and dynamics. One essential factor impeding efficient and stable parameter optimization is the need to explore in real environments, which has a long convergence time alongside the risk of performance degradation. To address this problem, we build a twin performance model of the physical DCN that approximates the mapping from parameters to Quality of Service (QoS) metrics for fast and safe performance inference and present a DCN configuration framework called Iphicles. Leveraging gradients provided by differentiable performance models built with Graph Neural Networks (GNN), Iphicles can automatically recommend better parameters efficiently and stably. Experimental results based on extensive simulation demonstrate that in complex scenarios with mixed and dynamic traffic, Iphicles can deliver parameters that lead to evident improvements in flow completion time (FCT) for both mice and elephant flows simultaneously, with minimum convergence time while maintaining performance stability during the optimization process.
What problem does this paper attempt to address?