Abstract:Abstract Network function virtualization (NFV) has been proposed to enable flexible management and deployment of the network service in cloud. In NFV architecture, a network service needs to invoke several service functions (SFs) in a particular order following the service chain function. The placement of SFs has significant impact on the performance of network services. However, stochastic nature of the network service arrivals and departures as well as meeting the end-to-end Quality of Service(QoS) makes the SFs placement problem even more challenging. In this paper, we firstly provide a system architecture for the SFs placement of cloud service with end-to-end QoS deadline. We then formulate the end-to-end service placement as a Markov decision process (MDP) which aims to minimize the placement cost and the end-to-end delay. In our MDP, the end-to-end delay of active services in the network is considered to be the state of the system, and the placement (nonplacement or placement) of SF is considered as the action. Also, we discuss the rationality of our analytical model by analyzing the Markov stochastic property of the end-to-end service placement. To obtain the optimal placement policy, we then propose an algorithm (Algorithm 1) for dynamic SFs placement based on our model and use successive approximations, i.e. $\epsilon $-iteration algorithm (Algorithm 2) to obtain action distribution. Finally, we evaluate the proposed MDP by comparing our optimal method with DDQP, DRL-QOR, MinPath and MinDelay for QoS optimization, including acceptance probability, average delay, resource utilization, load-balancing and reliability.

Delay-Aware VNF Scheduling: A Reinforcement Learning Approach with Variable Action Set

Energy-delay-aware VNF scheduling: a reinforcement learning approach with hierarchical reward enhancement

Fair Virtual Network Function Scheduling with Deep Reinforcement Learning

Delay-Aware Resource Management for Multi-Service Coexisting LTE-D2D Networks with Wireless Network Virtualization

A reinforcement learning method for scheduling service function chains with multi-resource constraints

Age of Information Aware VNF Scheduling in Industrial IoT Using Deep Reinforcement Learning

Two-Phase Virtual Network Function Selection and Chaining Algorithm Based on Deep Learning in SDN/NFV-Enabled Networks

Online Virtual Network Function Scheduling Towards Deterministic Latency

Joint VNF Placement and Scheduling for Latency-sensitive Services

Cutting Long-Tail Latency of Routing Response in Software Defined Networks.

Reinforcement Learning-based Admission Control in Delay-sensitive Service Systems

Delay-Oriented Scheduling in 5G Downlink Wireless Networks Based on Reinforcement Learning With Partial Observations

Fair Virtual Network Function Mapping and Scheduling Using Proximal Policy Optimization.

Service Function Placement Optimization For Cloud Service With End-to-End Delay Constraints

Effective Multi-User Delay-Constrained Scheduling with Deep Recurrent Reinforcement Learning

Dynamic Prioritization and Adaptive Scheduling using Deep Deterministic Policy Gradient for Deploying Microservice-based VNFs

Multi-User Delay-Constrained Scheduling With Deep Recurrent Reinforcement Learning

Management and Orchestration of Virtual Network Functions via Deep Reinforcement Learning

Online Fault-tolerant VNF Chain Placement: A Deep Reinforcement Learning Approach

Network Resource Allocation Strategy Based on Deep Reinforcement Learning

Scheduling Multi-Flow Network Updates in Software-Defined NFV Systems