Multi-Agent RL-Based Industrial AIGC Service Offloading over Wireless Edge Networks

Siyuan Li,Xi Lin,Hansong Xu,Kun Hua,Xiaomin Jin,Gaolei Li,Jianhua Li
2024-05-05
Abstract:Currently, the generative model has garnered considerable attention due to its application in addressing the challenge of scarcity of abnormal samples in the industrial Internet of Things (IoT). However, challenges persist regarding the edge deployment of generative models and the optimization of joint edge AI-generated content (AIGC) tasks. In this paper, we focus on the edge optimization of AIGC task execution and propose GMEL, a generative model-driven industrial AIGC collaborative edge learning framework. This framework aims to facilitate efficient few-shot learning by leveraging realistic sample synthesis and edge-based optimization capabilities. First, a multi-task AIGC computational offloading model is presented to ensure the efficient execution of heterogeneous AIGC tasks on edge servers. Then, we propose an attention-enhanced multi-agent reinforcement learning (AMARL) algorithm aimed at refining offloading policies within the IoT system, thereby supporting generative model-driven edge learning. Finally, our experimental results demonstrate the effectiveness of the proposed algorithm in optimizing the total system latency of the edge-based AIGC task completion.
Networking and Internet Architecture,Artificial Intelligence
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is the optimization problem of generative - model - driven edge - computing task offloading in the Industrial Internet of Things (IoT). Specifically, the paper focuses on how to efficiently execute heterogeneous AI - generated content (AIGC) tasks in wireless edge networks and proposes a new multi - agent - reinforced - learning (AMARL) algorithm to optimize the offloading strategies of these tasks. The following are the core problems and solutions in the paper: ### Research Background and Problems 1. **Application Challenges of Generative Models**: - Generative models are used in the industrial IoT to solve the problem of scarce abnormal samples, but there are still challenges in edge - deployment and joint - edge - AI - generated - content (AIGC) - task optimization. - Traditional discriminative models are prone to produce biased learning results due to limited abnormal samples, making few - sample learning a key challenge. 2. **Limitations of Existing Methods**: - Conventional simulation methods are difficult to update frequently to adapt to the constantly changing real environment. - When existing multi - agent - reinforced - learning (MARL) methods deal with complex mixed - cooperation - competition interaction problems, the central - critic network cannot effectively handle large - scale joint - state spaces and a large amount of useless information. ### Solutions 1. **GMEL Framework**: - A generative - model - driven industrial - AIGC - collaborative - edge - learning framework (GMEL) is proposed, aiming to promote efficient few - sample learning through real - sample synthesis and edge - optimization capabilities. - The framework includes a multi - task - AIGC - computing - offloading model to ensure the efficient execution of heterogeneous - AIGC tasks on edge servers. 2. **AMARL Algorithm**: - An attention - enhanced multi - agent - reinforced - learning (AMARL) algorithm is proposed to optimize the offloading strategies in IoT systems. - By introducing the multi - head - attention mechanism, the dimension of the joint - observation space is effectively compressed, invalid information is avoided, and resource utilization is improved. ### Experimental Verification - **Experimental Setup**: - An industrial IoT system of 100m × 100m is simulated, which contains 50 industrial - edge - devices (IED) and 5 edge - servers (ES). - Parameters such as the data size, GPU - cycle requirements, and deadline of AIGC tasks are set. - The AMARL model is used for training, and hyper - parameters such as the learning rate, discount factor, and batch size are set. - **Performance Evaluation**: - Compared with existing DRL algorithms (such as MADDPG, Actor - Critic, DQN), the results show that the AMARL algorithm has a faster convergence speed and higher reward value. - In cases of different task - generation probabilities and maximum deadlines, the AMARL algorithm shows a higher task - completion rate and lower transmission delay. - As the number of IEDs increases, the AMARL algorithm still maintains the best performance, indicating its adaptability in large - scale scenarios. ### Conclusion The GMEL framework and AMARL algorithm proposed in the paper effectively solve the generative - model - driven - AIGC - task - offloading - optimization problem in the industrial IoT. By introducing the multi - head - attention mechanism, the resource - utilization rate and task - completion rate of the system are significantly improved. The experimental results verify the effectiveness and superiority of this method.