Abstract:Multi-access edge computing has been considered as a promising solution for enabling computation-intensive yet latency-sensitive applications at resource-constrained wireless devices (WDs). To improve the spectrum efficiency for multi-WD computation offloading, this paper considers nonorthogonal multiple access (NOMA) assisted two-tier multiaccess edge computing scenario, which exploits the computation resources of both the edge servers (ESs) and the cloudlet server (CS) deployed at different tiers. In particular, the WDs can offload partial workloads to different ESs simultaneously via NOMA, and the ESs can form a NOMA-group to further offload partially received workloads to the CS for processing. We investigate the total energy consumption minimization problem by jointly optimizing the two-tier offloading decisions, the NOMA transmission duration, and the computation resource allocation. Due to the successive interference cancellation in the NOMA and the coupling effect in two-tier offloading, the formulated optimization problem is strictly non-convex. To address this difficulty, we exploit the hierarchical relationship among the joint optimization variables, and then propose a hybrid deep reinforcement learning (HDRL) algorithm to learn two policies that determine the coupled variables, i.e., the ESs' offloading decisions and the NOMA transmission duration, respectively. Then, the remaining decision variables can be jointly optimized by using the convex optimization methods directly based on the results provided by the HDRL algorithm. Specifically, the HDRL algorithm that uses different policies to determine the coupled variables can converge faster than the existing solutions that learn a single policy to determine all variables. Experimental results are provided to validate the performance of our proposed HDRL algorithm in comparison with two other learning-based algorithms.

Energy Efficiency Resource Management for D2D-NOMA Enabled Network: A Dinkelbach Combined Twin Delayed Deterministic Policy Gradient Approach

Two-Tier Multi-Access Partial Computation Offloading Via NOMA: A Hybrid Deep Learning Approach for Energy Minimization

A Dynamic Power Allocation Scheme in Power-Domain NOMA Using Actor-Critic Reinforcement Learning.

Energy-Efficient Resource Allocation with Imperfect CSI in NOMA-based D2D Networks with SWIPT

Energy-Efficient D2D Communications Underlaying NOMA-Based Networks with Energy Harvesting.

Double Deep Q-Network Based Distributed Resource Matching Algorithm for D2D Communication

Power Allocation for Full-Duplex Communication Systems Based on Deep Deterministic Policy Gradient

Downlink Non-Orthogonal Multiple Access Power Allocation Algorithm Based on Double Deep Q Network for Ensuring User's Quality of Service

Deep Reinforcement Learning Based Multidimensional Resource Management for Energy Harvesting Cognitive NOMA Communications

Hybrid Centralized-Distributed Resource Allocation Based on Deep Reinforcement Learning for Cooperative D2D Communications

Energy-Efficient Resource Allocation in D2D Underlaid Cellular Uplinks.

Deep Learning Based Power Optimizing for NOMA Based Relay Aided D2D Transmissions

Spectrum-efficient user grouping and resource allocation based on deep reinforcement learning for mmWave massive MIMO-NOMA systems

Energy-efficient access point clustering and power allocation in cell-free massive MIMO networks: a hierarchical deep reinforcement learning approach

Joint EH Time and Transmit Power Optimization Based on DDPG for EH Communications

DDPG-Based Joint Resource Management for Latency Minimization in NOMA-MEC Networks.

Deep Reinforcement Learning for Joint Channel Selection and Power Control in D2D Networks

Resource Allocation for NOMA-MEC Systems in Ultra-Dense Networks: A Learning Aided Mean-Field Game Approach

Collaborative Multi-BS Power Management for Dense Radio Access Network using Deep Reinforcement Learning

User Pairing for Delay-Limited NOMA-Based Satellite Networks with Deep Reinforcement Learning

Resource Allocation for Uplink NOMA-Based D2D Communication in Energy Harvesting Scenario: A Two-Stage Game Approach