Abstract:The forthcoming 6G networks will embrace a new realm of AI-driven services that requires innovative network slicing strategies, namely slicing for AI, which involves the creation of customized network slices to meet Quality of service (QoS) requirements of diverse AI services. This poses challenges due to time-varying dynamics of users' behavior and mobile networks. Thus, this paper proposes an online learning framework to optimize the allocation of computational and communication resources to AI services, while considering their unique key performance indicators (KPIs), such as accuracy, latency, and cost. We define a problem of optimizing the total accuracy while balancing conflicting KPIs, prove its NP-hardness, and propose an online learning framework for solving it in dynamic environments. We present a basic online solution and two variations employing a pre-learning elimination method for reducing the decision space to expedite the learning. Furthermore, we propose a biased decision space subset selection by incorporating prior knowledge to enhance the learning speed without compromising performance and present two alternatives of handling the selected subset. Our results depict the efficiency of the proposed solutions in converging to the optimal decisions, while reducing decision space and improving time complexity.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is how to create customized network slices for AI services through network slicing technology in the future 6G network to meet the quality - of - service (QoS) requirements of different AI services and optimize the allocation of computing and communication resources. Specifically, the paper aims to address the challenges brought by the dynamic changes of user behaviors and mobile networks, and proposes an online - learning framework to optimize resource allocation while balancing conflicting key performance indicators (KPIs) such as accuracy, latency, and cost. ### Problem Background With the development of 6G networks, AI - driven services will become mainstream, and these services have strict requirements for network quality. For example, services such as multi - sensory extended reality, autonomous driving, and holographic video streaming require high bandwidth, low latency, and high reliability. To meet these diverse needs, network slicing technology is used to create multiple virtual networks, and each virtual network can be optimized for a specific service. However, AI services have unique requirements. For example, changes in parameters such as the amount of training data and the number of training cycles (epochs) will affect their performance. In addition, changes in user behaviors and network environments also complicate resource allocation. Therefore, traditional static optimization methods and reinforcement learning methods are difficult to adapt to such dynamic changes. ### Core Problems of the Paper The core problem of the paper is to design an online - learning framework that can dynamically adapt to user behaviors and network changes to optimize the resource allocation of AI services. Specifically, the paper hopes to solve the following key problems: 1. **Resource Allocation Optimization**: How to maximize the overall performance (such as accuracy) while meeting the QoS requirements of different AI services. 2. **Dynamic Adaptability**: How to make the system adapt to the dynamic changes of user behaviors and network environments. 3. **Complexity Management**: How to reduce the decision - making space and improve the time complexity of the algorithm while ensuring performance. ### Mathematical Model The paper formalizes the problem as an optimization problem and proves its NP - hard property. Specifically, the objective is to optimize the following performance function within each time slot \(t\): \[ f^{(t)}(\mathbf{a}^{(t)})=\frac{\sum_{i = 1}^{I}\alpha_{i}\cdot q_{i}^{(t)}(L_{i}^{(t)},m_{i}^{(t)})}{\sum_{i = 1}^{I}\alpha_{i}} \] where: - \(\mathbf{a}^{(t)}=(L_{1}^{(t)},m_{1}^{(t)},\psi_{1}^{(t)},\lambda_{1}^{(t)},\ldots,L_{I}^{(t)},m_{I}^{(t)},\psi_{I}^{(t)},\lambda_{I}^{(t)})\) is the resource allocation decision vector in the \(t\) - th time slot. - \(q_{i}^{(t)}(L_{i}^{(t)},m_{i}^{(t)})\) is the inference accuracy of the \(i\) - th AI model in the \(t\) - th time slot. - \(\alpha_{i}\) is the priority weight of the \(i\) - th AI service. The constraints of the optimization problem include: - The sum of computing resources does not exceed the maximum available computing resources: \(\sum_{i = 1}^{I}\psi_{i}\leq\Psi_{\text{max}}\) - The sum of communication resources does not exceed the maximum available communication resources: \(\sum_{i = 1}^{I}\lambda_{i}\leq\Lambda_{\text{max}}\) - The learning latency does not exceed the maximum allowable latency: \(D(L_{i},m_{i},\psi_{i},\lambda_{i})\leq D_{\text{max},i}\) - The cost does not exceed the maximum allowable cost: \(C(\psi_{i},\lambda_{i})\leq C_{\text{max},i}\) ### Proposed Solutions The paper proposes an online - learning framework, which specifically includes the following algorithm variants: 1. **Basic Online Learning Algorithm (OLS)**: Directly apply the online - learning method to solve the resource allocation problem. 2. **With Hyper -...

Slicing for AI: An Online Learning Framework for Network Slicing Supporting AI Services

Learn to Slice, Slice to Learn: Unveiling Online Optimization and Reinforcement Learning for Slicing AI Services

Toward Scalable and Efficient Hierarchical Deep Reinforcement Learning for 5G RAN Slicing

OnSlicing: Online End-to-End Network Slicing with Reinforcement Learning

Safe and Fast Reinforcement Learning for Network Slicing Resource Allocation

Learning to Slice Wi-Fi Networks: A State-Augmented Primal-Dual Approach

Fast and Scalable Network Slicing by Integrating Deep Learning with Lagrangian Methods

Customized Slicing for 6G: Enforcing Artificial Intelligence on Resource Management

Prediction-based Hybrid Slicing Framework for Service Level Agreement Guarantee in Mobility Scenarios: A Deep Learning Approach

Artificial Intelligence-Assisted Network Slicing: Network Assurance and Service Provisioning in 6G

Deep Reinforcement Learning for Network Slicing with Heterogeneous Resource Requirements and Time Varying Traffic Dynamics

Online Slice Reconfiguration for End-to-End QoE in 6G Applications

Multi-objective Optimization of Space-Air-Ground Integrated Network Slicing Relying on a Pair of Central and Distributed Learning Algorithms

Reinforcement Learning-Based Network Slicing Scheme for Optimized UE-QoS in Future Networks

Analysis of Network Slicing for Management of 5G Networks Using Machine Learning Techniques

DRL‐based customised resource allocation for sub‐slices in 6G network slicing

Zero-touch Continuous Network Slicing Control via Scalable Actor-Critic Learning

Deep Reinforcement Learning for Adaptive Network Slicing in 5G for Intelligent Vehicular Systems and Smart Cities

Deep Reinforcement Learning Approaches to Network Slice Scaling and Placement: A Survey

Digital Twin-Enhanced Deep Reinforcement Learning for Resource Management in Networks Slicing

AI-Assisted Slicing-Based Resource Management for Two-Tier Radio Access Networks