Abstract:Wireless powered mobile edge computing (MEC) networks, where wireless devices (WDs) are allowed to offload parts of computation-intensive tasks to remote servers and charge the built-in batteries over the air, have been envisaged as a promising technology to ensure the ultra-low-power requirement and enhance the continuous work capacity of WDs. However, when multiple WDs coexist in the network, it is non-trivial to minimize the total tasks delay since the optimization variables are intrinsically coupled. Even more, channels are dynamically varying from time to time and the tasks are unpredictable, which aggravates the difficulty to obtain the closed-form solution. Although reinforcement learning (RL) has been proved to be effective for such complex optimization problems, there is still the challenge that the training of neural networks is time-consuming. This paper considers a challenging hybrid tasks offloading scenario, where offloading tasks can be partially executed locally and remotely in parallel, and each WD is endowed to take both the active RF-transmission and passive backscatter communication (BackCom) for remote tasks offloading. Furthermore, a game-combined multi-agent deep deterministic policy gradient (MADDPG) algorithm is proposed to minimize the total tasks delay with the fairness consideration of multiple WDs, i.e., potential game for offloading decision and MADDPG for time scheduling and harvested energy splitting. The introduction of potential game which can be proved to converge with finite iterations, helps to accelerate the training and reduce the computation complexity. Equipped with the feature of ‘centralized training with decentralized execution,’ once well trained, each agent in MADDPG can figure out the proper time scheduling and harvested energy splitting independently without sharing information with others. Besides the unilateral contention among WDs for the offloading decision by potential game, a fully decentralized framework is finally designed for the proposed algorithm. Numerical results demonstrate that the game-combined MADDPG algorithm can achieve the near-optimal performance compared with existing centralized approaches, and reduce the convergence time compared with other no-game learning approaches.

Deep Reinforcement Learning-based Power Control and Bandwidth Allocation Policy for Weighted Cost Minimization in Wireless Networks

Delay-Aware Stochastic Resource Management for Mobile Edge Computing Systems Via Constrained Reinforcement Learning

Multi-Agent Deep Reinforcement Learning-Based Partial Task Offloading and Resource Allocation in Edge Computing Environment

Decentralized Computation Offloading for Multi-User Mobile Edge Computing: A Deep Reinforcement Learning Approach

Deep Reinforcement Learning Empowers Wireless Powered Mobile Edge Computing: Towards Energy-Aware Online Offloading

Deep Reinforcement Learning-Based Offloading Decision Optimization in Mobile Edge Computing

Performance Optimization in Mobile-Edge Computing Via Deep Reinforcement Learning

Decentralized Power Allocation for MIMO-NOMA Vehicular Edge Computing Based on Deep Reinforcement Learning

Secure Deep Reinforcement Learning for Dynamic Resource Allocation in Wireless MEC Networks

Deep Reinforcement Learning for Computation and Communication Resource Allocation in Multiaccess MEC Assisted Railway IoT Networks

A Power Allocation Scheme for MIMO-NOMA and D2D Vehicular Edge Computing Based on Decentralized DRL

Game-Combined Multi-Agent DRL for Tasks Offloading in Wireless Powered MEC Networks

Energy-Efficient Collaborative Multi-Access Edge Computing Via Deep Reinforcement Learning

Adaptive Computation Offloading Policy for Multi-Access Edge Computing in Heterogeneous Wireless Networks

Multi-user Resource Control with Deep Reinforcement Learning in IoT Edge Computing

Deep Reinforcement Learning for Energy Minimization in Multi-RIS-Aided Cell-Free MEC Networks

Deep Reinforcement Learning Based Task Offloading and Resource Allocation in Small Cell MEC

Energy Efficient Joint Computation Offloading and Service Caching for Mobile Edge Computing: A Deep Reinforcement Learning Approach

Optimized Computation Offloading Performance in Virtual Edge Computing Systems via Deep Reinforcement Learning

Power Allocation in Multi-User Cellular Networks: Deep Reinforcement Learning Approaches

Collaborative Optimization of Wireless Communication and Computing Resource Allocation based on Multi-Agent Federated Weighting Deep Reinforcement Learning