Abstract:Mobile Edge Computing (MEC) has been regarded as a promising paradigm to reduce service latency for data processing in Internet of Things, by provisioning computing resources at network edge. In this work, we jointly optimize the task partitioning and computational power allocation for computation offloading in a dynamic environment with multiple IoT devices and multiple edge servers. We formulate the problem as a Markov decision process with constrained hybrid action space, which cannot be well handled by existing deep reinforcement learning (DRL) algorithms. Therefore, we develop a novel Deep Reinforcement Learning called Dirichlet Deep Deterministic Policy Gradient (D3PG), which is built on Deep Deterministic Policy Gradient (DDPG) to solve the problem. The developed model can learn to solve multi-objective optimization, including maximizing the number of tasks processed before expiration and minimizing the energy cost and service latency. More importantly, D3PG can effectively deal with constrained distribution-continuous hybrid action space, where the distribution variables are for the task partitioning and offloading, while the continuous variables are for computational frequency control. Moreover, the D3PG can address many similar issues in MEC and general reinforcement learning problems. Extensive simulation results show that the proposed D3PG outperforms the state-of-art methods. Mobile Edge Computing (MEC) has been regarded as a promising paradigm to reduce service latency for data processing in Internet of Things, by provisioning computing resources at network edge. In this work, we jointly optimize the task partitioning and computational power allocation for computation offloading in a dynamic environment with multiple IoT devices and multiple edge servers. We formulate the problem as a Markov decision process with constrained hybrid action space, which cannot be well handled by existing deep reinforcement learning (DRL) algorithms. Therefore, we develop a novel Deep Reinforcement Learning called Dirichlet Deep Deterministic Policy Gradient (D3PG), which is built on Deep Deterministic Policy Gradient (DDPG) to solve the problem. The developed model can learn to solve multi-objective optimization, including maximizing the number of tasks processed before expiration and minimizing the energy cost and service latency. More importantly, D3PG can effectively deal with constrained distribution-continuous hybrid action space, where the distribution variables are for the task partitioning and offloading, while the continuous variables are for computational frequency control. Moreover, the D3PG can address many similar issues in MEC and general reinforcement learning problems. Extensive simulation results show that the proposed D3PG outperforms the state-of-art methods.

Parameterized Deep Reinforcement Learning with Hybrid Action Space for Edge Task Offloading

Towards Efficient Task Offloading at the Edge Based on Meta-Reinforcement Learning with Hybrid Action Space.

A Novel Hybrid-ARPPO Algorithm for Dynamic Computation Offloading in Edge Computing

Multi-agent Reinforcement Learning for Task Offloading with Hybrid Decision Space in Multi-Access Edge Computing

Deep Reinforcement Learning-Based Dynamical Task Offloading for Mobile Edge Computing

An Efficient Computation Offloading Approach in Multi-access Edge Computing Using Deep Reinforcement Learning

Security-Aware Task Offloading Using Deep Reinforcement Learning in Mobile Edge Computing Systems

Deep Reinforcement Learning-Based Offloading Decision Optimization in Mobile Edge Computing

Decentralized Computation Offloading for Multi-User Mobile Edge Computing: A Deep Reinforcement Learning Approach

Pruning-based Deep Reinforcement Learning for Task Offloading in End-Edge-Cloud Collaborative Mobile Edge Computing

Energy-Efficient Collaborative Multi-Access Edge Computing Via Deep Reinforcement Learning

Multi-Queue-Based Offloading Strategy for Deep Reinforcement Learning Tasks

Deep Reinforcement Learning Method for Task Offloading in Mobile Edge Computing Networks Based on Parallel Exploration with Asynchronous Training

Policy network-based dual-agent deep reinforcement learning for multi-resource task offloading in multi-access edge cloud networks

A Hybrid Deep Reinforcement Learning Approach for Dynamic Task Offloading in NOMA-MEC System.

DRL-Based Dependent Task Offloading Strategies with Multi-Server Collaboration in Multi-Access Edge Computing

Fast Adaptive Task Offloading in Edge Computing based on Meta Reinforcement Learning

D3PG: Dirichlet DDPG for Task Partitioning and Offloading with Constrained Hybrid Action Space in Mobile Edge Computing

Hybrid Deep Reinforcement Learning-Based Task Offloading for D2D-Assisted Cloud-Edge-Device Collaborative Networks

Cloud-Edge-End Collaborative Task Offloading in Vehicular Edge Networks: A Multi-Layer Deep Reinforcement Learning Approach

A3C-DO: A Regional Resource Scheduling Framework Based on Deep Reinforcement Learning in Edge Scenario