Abstract:Network slicing allows mobile network operators to virtualize infrastructures and provide customized slices for supporting various use cases with heterogeneous requirements. Online deep reinforcement learning (DRL) has shown promising potential in solving network problems and eliminating the simulation-to-reality discrepancy. Optimizing cross-domain resources with online DRL is, however, challenging, as the random exploration of DRL violates the service level agreement (SLA) of slices and resource constraints of infrastructures. In this paper, we propose OnSlicing, an online end-to-end network slicing system, to achieve minimal resource usage while satisfying slices' SLA. OnSlicing allows individualized learning for each slice and maintains its SLA by using a novel constraint-aware policy update method and proactive baseline switching mechanism. OnSlicing complies with resource constraints of infrastructures by using a unique design of action modification in slices and parameter coordination in infrastructures. OnSlicing further mitigates the poor performance of online learning during the early learning stage by offline imitating a rule-based solution. Besides, we design four new domain managers to enable dynamic resource configuration in radio access, transport, core, and edge networks, respectively, at a timescale of subseconds. We implement OnSlicing on an end-to-end slicing testbed designed based on OpenAirInterface with both 4G LTE and 5G NR, OpenDayLight SDN platform, and OpenAir-CN core network. The experimental results show that OnSlicing achieves 61.3% usage reduction as compared to the rule-based solution and maintains nearly zero violation (0.06%) throughout the online learning phase. As online learning is converged, OnSlicing reduces 12.5% usage without any violations as compared to the state-of-the-art online DRL solution.

Hierarchical Meta-Reinforcement Learning for Resource-Efficient Slicing in O-RAN

Towards efficient RAN slicing: A deep hierarchical reinforcement learning approach

Toward Scalable and Efficient Hierarchical Deep Reinforcement Learning for 5G RAN Slicing

Evolutionary Deep Reinforcement Learning for Dynamic Slice Management in O-RAN

Meta Reinforcement Learning Approach for Adaptive Resource Optimization in O-RAN

Federated Deep Reinforcement Learning for Resource Allocation in O-RAN Slicing

Using Deep Reinforcement Learning for 5G RAN Slicing Resource Allocation in New Power Load Management System

Real-Time Resource Slicing for 5G RAN Via Deep Reinforcement Learning

Attention-based Open RAN Slice Management using Deep Reinforcement Learning

Hierarchical Reinforcement Learning Based Resource Allocation for RAN Slicing

Deep Reinforcement Learning for Resource Management in Network Slicing

Advancing RAN Slicing with Offline Reinforcement Learning

Safe and Accelerated Deep Reinforcement Learning-based O-RAN Slicing: A Hybrid Transfer Learning Approach

Communication and Computation O-RAN Resource Slicing for URLLC Services Using Deep Reinforcement Learning

Deep Reinforcement Learning for Resource Management on Network Slicing: A Survey

Mobility aware and energy-efficient federated deep reinforcement learning assisted resource allocation for 5G-RAN slicing

OnSlicing: Online End-to-End Network Slicing with Reinforcement Learning

Constrained Reinforcement Learning for Resource Allocation in Network Slicing

RAN Slice Strategy Based on Deep Reinforcement Learning for Smart Grid

Demo: Deep Reinforcement Learning for Resource Management in Cellular Network Slicing.

Dynamic SDN-based Radio Access Network Slicing with Deep Reinforcement Learning for URLLC and eMBB Services