Abstract:In heterogeneous wireless networks, massive mobile terminals randomly generate a large number of computation tasks (payloads). How to better manage these mobile terminals located in wireless networks to achieve acceptable quality of service (QoS) such as latency minimization, energy consumption minimization is crucial. A multi-access edge computing (MEC) server can be leveraged to execute the offloaded payloads generated from mobile terminals owing to its powerful processing power and location proximity features. However, an MEC server cannot tackle all offloaded tasks from multiple mobile terminals, and its energy consumption needs further consideration. We introduce an edge server model combined with the unmanned aerial vehicle (UAV) and equipped with the macro base station (MBS-MEC) to process the arrival payloads, and all UAVs and MBS-MECs can harvest renewable energy by using energy harvesting equipment. Furthermore, we model the computation offloading as a deep reinforcement learning scheme without priori knowledge. Considering the infeasibility of deep-reinforcement learning-based centralized learning for the proposed edge computing framework, we propose a distributed computation offloading scheme based on deep reinforcement learning (DCODRL) to minimize the weighted average cost, including the latency cost and the energy cost. Each mobile terminal can be regarded as a learning agent for the DCODRL. To compensate for the lack of cooperation of the DCODRL, we propose a gated-recurrent-unit-assisted multi-agent computation offloading scheme based on deep reinforcement learning (MCODRL) to improve the offloading policy by obtaining global observation information and designing a common reward for all agents. Comprehensive numerical results reflect the convergence and effectiveness of the DCODRL and MCODRL, and the efficacy of the proposed algorithms is further verified through comparisons with two baseline algorithms.

A Hybrid Deep Reinforcement Learning Approach for Dynamic Task Offloading in NOMA-MEC System.

Two-Tier Multi-Access Partial Computation Offloading Via NOMA: A Hybrid Deep Learning Approach for Energy Minimization

Computation Offloading and Resource Allocation in NOMA-MEC: A Deep Reinforcement Learning Approach

Dynamic Computation Offloading With Energy Harvesting Devices: A Hybrid-Decision-Based Deep Reinforcement Learning Approach

NOMA-Based Multi-User Mobile Edge Computation Offloading via Cooperative Multi-Agent Deep Reinforcement Learning

Multi-agent Reinforcement Learning for Task Offloading with Hybrid Decision Space in Multi-Access Edge Computing

Energy-Efficient Collaborative Multi-Access Edge Computing Via Deep Reinforcement Learning

Towards Efficient Task Offloading at the Edge Based on Meta-Reinforcement Learning with Hybrid Action Space.

Distance-aware Multi-Agent Reinforcement Learning for Task Offloading in MEC Network.

Deep reinforcement learning-based task scheduling and resource allocation for NOMA-MEC in Industrial Internet of Things

Multi-Queue-Based Offloading Strategy for Deep Reinforcement Learning Tasks

Decentralized Computation Offloading for Multi-User Mobile Edge Computing: A Deep Reinforcement Learning Approach

DMADRL: A Distributed Multi-agent Deep Reinforcement Learning Algorithm for Cognitive Offloading in Dynamic MEC Networks

Optimized Computation Offloading Performance in Virtual Edge Computing Systems via Deep Reinforcement Learning

An Efficient Online Computation Offloading Approach for Large-Scale Mobile Edge Computing via Deep Reinforcement Learning

Adaptive Computation Offloading Policy for Multi-Access Edge Computing in Heterogeneous Wireless Networks

Double Riss Assisted Task Offloading for NOMA-MEC with Action-Constrained Deep Reinforcement Learning

Energy Saving Computation Offloading for Dynamic CR-MEC Systems with NOMA Via Decomposition Based Soft Actor–critic

Dynamic Offloading for Multiuser Muti-CAP MEC Networks: A Deep Reinforcement Learning Approach

Fast Adaptive Task Offloading in Edge Computing based on Meta Reinforcement Learning

DRL-Assisted Energy Minimization for NOMA-Based Dynamic Multi-User Multi-Access MEC Networks