Enabling Efficient Spatio-Temporal GPU Sharing for Network Function Virtualization

Deze Zeng,Andong Zhu,Lin Gu,Peng Li,Quan Chen,Minyi Guo
DOI: https://doi.org/10.1109/tc.2023.3278541
IF: 3.183
2023-01-01
IEEE Transactions on Computers
Abstract:By leveraging standard IT virtualization technology and Commercial-Off-The-Shelf (COTS) servers, Network Function Virtualization (NFV) decouples network functions from proprietary hardware devices for flexible service provisioning. But the potential of NFV is significantly limited by its performance inefficiency. With the unparalleled advantages of multi-core parallelism and high memory bandwidth, Graphics Processing Units (GPUs) are regarded as a promising way to accelerate Virtualized Network Functions (VNF). However, the special architecture of GPU brings new challenges to task scheduling and resource allocation. To this end, we propose a GPU oriented spatio-temporal sharing framework for NFV called Gost, aiming for GPU based VNF performance promotion. The execution order and GPU resource allocation (i.e., the number of threads) are considered in task scheduling to minimize the end-to-end latency for VNF flows. First, we formulate the task scheduling problem into a nonlinear programming form, and then transform it into an equivalent Integer Linear Programming (ILP) form.The problem is proved as NP-hard. We customize the classical list scheduling algorithm and propose a List Scheduling based Spatio-Temporal GPU sharing strategy (LSSTG), whose achievable worst-case performance is also formally analyzed. We practically implement Gost prototype, based on which extensive experiments verify the high performance efficiency of LSSTG compared to state-of-the-art in terms of latency and throughput.
engineering, electrical & electronic,computer science, hardware & architecture
What problem does this paper attempt to address?