Abstract:The forthcoming 6G networks will embrace a new realm of AI-driven services that requires innovative network slicing strategies, namely slicing for AI, which involves the creation of customized network slices to meet Quality of service (QoS) requirements of diverse AI services. This poses challenges due to time-varying dynamics of users' behavior and mobile networks. Thus, this paper proposes an online learning framework to optimize the allocation of computational and communication resources to AI services, while considering their unique key performance indicators (KPIs), such as accuracy, latency, and cost. We define a problem of optimizing the total accuracy while balancing conflicting KPIs, prove its NP-hardness, and propose an online learning framework for solving it in dynamic environments. We present a basic online solution and two variations employing a pre-learning elimination method for reducing the decision space to expedite the learning. Furthermore, we propose a biased decision space subset selection by incorporating prior knowledge to enhance the learning speed without compromising performance and present two alternatives of handling the selected subset. Our results depict the efficiency of the proposed solutions in converging to the optimal decisions, while reducing decision space and improving time complexity.
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to create customized network slices for AI services through network slicing technology in the future 6G network to meet the quality - of - service (QoS) requirements of different AI services and optimize the allocation of computing and communication resources. Specifically, the paper aims to address the challenges brought by the dynamic changes of user behaviors and mobile networks, and proposes an online - learning framework to optimize resource allocation while balancing conflicting key performance indicators (KPIs) such as accuracy, latency, and cost.
### Problem Background
With the development of 6G networks, AI - driven services will become mainstream, and these services have strict requirements for network quality. For example, services such as multi - sensory extended reality, autonomous driving, and holographic video streaming require high bandwidth, low latency, and high reliability. To meet these diverse needs, network slicing technology is used to create multiple virtual networks, and each virtual network can be optimized for a specific service.
However, AI services have unique requirements. For example, changes in parameters such as the amount of training data and the number of training cycles (epochs) will affect their performance. In addition, changes in user behaviors and network environments also complicate resource allocation. Therefore, traditional static optimization methods and reinforcement learning methods are difficult to adapt to such dynamic changes.
### Core Problems of the Paper
The core problem of the paper is to design an online - learning framework that can dynamically adapt to user behaviors and network changes to optimize the resource allocation of AI services. Specifically, the paper hopes to solve the following key problems:
1. **Resource Allocation Optimization**: How to maximize the overall performance (such as accuracy) while meeting the QoS requirements of different AI services.
2. **Dynamic Adaptability**: How to make the system adapt to the dynamic changes of user behaviors and network environments.
3. **Complexity Management**: How to reduce the decision - making space and improve the time complexity of the algorithm while ensuring performance.
### Mathematical Model
The paper formalizes the problem as an optimization problem and proves its NP - hard property. Specifically, the objective is to optimize the following performance function within each time slot \(t\):
\[
f^{(t)}(\mathbf{a}^{(t)})=\frac{\sum_{i = 1}^{I}\alpha_{i}\cdot q_{i}^{(t)}(L_{i}^{(t)},m_{i}^{(t)})}{\sum_{i = 1}^{I}\alpha_{i}}
\]
where:
- \(\mathbf{a}^{(t)}=(L_{1}^{(t)},m_{1}^{(t)},\psi_{1}^{(t)},\lambda_{1}^{(t)},\ldots,L_{I}^{(t)},m_{I}^{(t)},\psi_{I}^{(t)},\lambda_{I}^{(t)})\) is the resource allocation decision vector in the \(t\) - th time slot.
- \(q_{i}^{(t)}(L_{i}^{(t)},m_{i}^{(t)})\) is the inference accuracy of the \(i\) - th AI model in the \(t\) - th time slot.
- \(\alpha_{i}\) is the priority weight of the \(i\) - th AI service.
The constraints of the optimization problem include:
- The sum of computing resources does not exceed the maximum available computing resources: \(\sum_{i = 1}^{I}\psi_{i}\leq\Psi_{\text{max}}\)
- The sum of communication resources does not exceed the maximum available communication resources: \(\sum_{i = 1}^{I}\lambda_{i}\leq\Lambda_{\text{max}}\)
- The learning latency does not exceed the maximum allowable latency: \(D(L_{i},m_{i},\psi_{i},\lambda_{i})\leq D_{\text{max},i}\)
- The cost does not exceed the maximum allowable cost: \(C(\psi_{i},\lambda_{i})\leq C_{\text{max},i}\)
### Proposed Solutions
The paper proposes an online - learning framework, which specifically includes the following algorithm variants:
1. **Basic Online Learning Algorithm (OLS)**: Directly apply the online - learning method to solve the resource allocation problem.
2. **With Hyper -...