Accelerating Mobile Edge Generation (MEG) by Constrained Learning

Xiaoxia Xu,Yuanwei Liu,Xidong Mu,Hong Xing,Arumugam Nallanathan
2024-08-07
Abstract:A novel accelerated mobile edge generation (MEG) framework is proposed for generating high-resolution images on mobile devices. Exploiting a large-scale latent diffusion model (LDM) distributed across edge server (ES) and user equipment (UE), cost-efficient artificial intelligence generated content (AIGC) is achieved by transmitting low-dimensional features between ES and UE. To reduce overheads of both distributed computations and transmissions, a dynamic diffusion and feature merging scheme is conceived. By jointly optimizing the denoising steps and feature merging ratio, the image generation quality is maximized subject to latency and energy consumption constraints. To address this problem and tailor LDM sub-models, a low-complexity MEG acceleration protocol is developed. Particularly, a backbone meta-architecture is trained via offline distillation. Then, dynamic diffusion and feature merging are determined in online channel environment, which can be viewed as a constrained Markov Decision Process (MDP). A constrained variational policy optimization (CVPO) based MEG algorithm is further proposed for constraint-guaranteed learning, namely MEG-CVPO. Numerical results verify that: 1) The proposed framework can generate 1024$\times$1024 high-quality images over noisy channels while reducing over $40\%$ latency compared to conventional generation schemes. 2) The developed MEG-CVPO effectively mitigates constraint violations, thus flexibly controlling the trade-off between image distortion and generation costs.
Systems and Control,Networking and Internet Architecture,Signal Processing
What problem does this paper attempt to address?
The main problem this paper attempts to address is how to accelerate the generation of high-resolution images within the Mobile Edge Generation (MEG) framework while reducing computational and transmission overhead, and meeting latency and energy consumption constraints. Specifically, the paper proposes a novel accelerated MEG framework by distributing large-scale Latent Diffusion Models (LDM) between edge servers (ES) and user equipment (UE), transmitting only low-dimensional features to achieve efficient Artificial Intelligence Generated Content (AIGC). To further reduce computational and transmission overhead, the paper designs a dynamic diffusion and feature compression scheme, and maximizes image quality through joint optimization of denoising steps and feature fusion ratios. Additionally, the paper proposes a MEG algorithm based on Constrained Variational Policy Optimization (CVPO) (MEG-CVPO) to solve the resulting constrained Markov Decision Process (MDP), ensuring the satisfaction of constraints and achieving a flexible trade-off between image quality and generation cost. ### Main Contributions: 1. **Proposed a novel accelerated MEG framework**: By distributing large-scale LDM between UE and ES, achieving efficient and low-cost high-resolution image generation. 2. **Developed a low-complexity dynamic MEG acceleration protocol**: First, the backbone architecture of the LDM sub-model is learned through offline distillation, then in an online environment, the denoising steps and feature fusion ratios are dynamically predicted based on channel observations, reformulating the problem as a constrained MDP. 3. **Proposed a CVPO-based MEG algorithm (MEG-CVPO)**: By training strategies through variational inference, it improves system rewards while achieving feasible policy distribution, effectively mitigating constraint violation issues, and solving the constrained MDP. 4. **Provided numerical results to verify effectiveness**: Compared to traditional generation schemes, the proposed dynamic diffusion and feature fusion scheme can reconstruct high-quality 1024×1024 images through noisy channels within 3-7 seconds, reducing latency by over 40%, and the MEG-CVPO algorithm outperforms traditional Lagrangian methods in constraint assurance, achieving a controllable trade-off between generation quality and cost.