Abstract:Virtualized Radio Access Networks (vRANs) are fully configurable and can be implemented at a low cost over commodity platforms to enable network management flexibility. In this paper, a novel vRAN reconfiguration problem is formulated to jointly reconfigure the functional splits of the base stations (BSs), locations of the virtualized central units (vCUs) and distributed units (vDUs), their resources, and the routing for each BS data flow. The objective is to minimize the long-term total network operation cost while adapting to the varying traffic demands and resource availability. Testbed measurements are performed to study the relationship between the traffic demands and computing resources, which reveals high variance and depends on the platform and its load. Consequently, finding the perfect model of the underlying system is non-trivial. Therefore, to solve the proposed problem, a deep reinforcement learning (RL)-based framework is proposed and developed using model-free RL approaches. Moreover, the problem consists of multiple BSs sharing the same resources, which results in a multi-dimensional discrete action space and leads to a combinatorial number of possible actions. To overcome this curse of dimensionality, action branching architecture, which is an action decomposition method with a shared decision module followed by neural network is combined with Dueling Double Deep Q-network (D3QN) algorithm. Simulations are carried out using an O-RAN compliant model and real traces of the testbed. Our numerical results show that the proposed framework successfully learns the optimal policy that adaptively selects the vRAN configurations, where its learning convergence can be further expedited through transfer learning even in different vRAN systems. It offers significant cost savings by up to 59\% of a static benchmark, 35\% of DDPG with discretization, and 76\% of non-branching D3QN.

Deep reinforcement learning for RAN optimization and control

Toward Scalable and Efficient Hierarchical Deep Reinforcement Learning for 5G RAN Slicing

Deep Reinforcement Learning Based Massive Access Management for Ultra-Reliable Low-Latency Communications

Design and Evaluation of Deep Reinforcement Learning for Energy Saving in Open RAN

Using Deep Reinforcement Learning for 5G RAN Slicing Resource Allocation in New Power Load Management System

A Multi-Agent Deep Reinforcement Learning Approach for RAN Resource Allocation in O-RAN

CaRL: Cascade Reinforcement Learning with State Space Splitting for O-RAN based Traffic Steering

OpenRANet: Neuralized Spectrum Access by Joint Subcarrier and Power Allocation with Optimization-based Deep Learning

Collaborative Multi-BS Power Management for Dense Radio Access Network using Deep Reinforcement Learning

Intelligent O-RAN Traffic Steering for URLLC Through Deep Reinforcement Learning

Scalable Deep Reinforcement Learning for Routing and Spectrum Access in Physical Layer

Sim-to-Real Optimization of Complex Real World Mobile Network with Imperfect Information via Deep Reinforcement Learning from Self-play

Deep Reinforcement Learning for 5G Networks: Joint Beamforming, Power Control, and Interference Coordination

Deep Reinforcement Learning for Orchestrating Cost-Aware Reconfigurations of vRANs

A Framework for Automated Cellular Network Tuning with Reinforcement Learning

Anomaly Detection for Scalable Task Grouping in Reinforcement Learning-based RAN Optimization

Continual Model-based Reinforcement Learning for Data Efficient Wireless Network Optimisation

Joint Traffic Control and Multi-Channel Reassignment for Core Backbone Network in SDN-IoT: A Multi-Agent Deep Reinforcement Learning Approach

Dynamic SDN-based Radio Access Network Slicing with Deep Reinforcement Learning for URLLC and eMBB Services

Evolutionary Deep Reinforcement Learning for Dynamic Slice Management in O-RAN

Intent-driven Closed-Loop Control and Management Framework for 6G Open RAN