Abstract:Multi-access edge computing has been considered as a promising solution for enabling computation-intensive yet latency-sensitive applications at resource-constrained wireless devices (WDs). To improve the spectrum efficiency for multi-WD computation offloading, this paper considers nonorthogonal multiple access (NOMA) assisted two-tier multiaccess edge computing scenario, which exploits the computation resources of both the edge servers (ESs) and the cloudlet server (CS) deployed at different tiers. In particular, the WDs can offload partial workloads to different ESs simultaneously via NOMA, and the ESs can form a NOMA-group to further offload partially received workloads to the CS for processing. We investigate the total energy consumption minimization problem by jointly optimizing the two-tier offloading decisions, the NOMA transmission duration, and the computation resource allocation. Due to the successive interference cancellation in the NOMA and the coupling effect in two-tier offloading, the formulated optimization problem is strictly non-convex. To address this difficulty, we exploit the hierarchical relationship among the joint optimization variables, and then propose a hybrid deep reinforcement learning (HDRL) algorithm to learn two policies that determine the coupled variables, i.e., the ESs' offloading decisions and the NOMA transmission duration, respectively. Then, the remaining decision variables can be jointly optimized by using the convex optimization methods directly based on the results provided by the HDRL algorithm. Specifically, the HDRL algorithm that uses different policies to determine the coupled variables can converge faster than the existing solutions that learn a single policy to determine all variables. Experimental results are provided to validate the performance of our proposed HDRL algorithm in comparison with two other learning-based algorithms.

Energy Efficient Transmission in Underlay CR-NOMA Networks Enabled by Reinforcement Learning

A Dynamic Power Allocation Scheme in Power-Domain NOMA Using Actor-Critic Reinforcement Learning.

DDPG with Transfer Learning and Meta Learning Framework for Resource Allocation in Underlay Cognitive Radio Network

Two-Tier Multi-Access Partial Computation Offloading Via NOMA: A Hybrid Deep Learning Approach for Energy Minimization

AoI-Oriented Resource Allocation for NOMA-Based Wireless Powered Cognitive Radio Networks Based on Multi-Agent Deep Reinforcement Learning

Deep Reinforcement Learning Based Multidimensional Resource Management for Energy Harvesting Cognitive NOMA Communications

Robust and Outage-Constrained Energy Efficiency Optimization in RIS-Assisted NOMA Networks

Reinforcement Learning Enhanced Iterative Power Allocation in Stochastic Cognitive Wireless Mesh Networks

Deep Reinforcement Learning Based Massive Access Management for Ultra-Reliable Low-Latency Communications

Resource Allocation in Uplink NOMA-IoT Networks: A Reinforcement-Learning Approach

Power Allocation for Cognitive Wireless Mesh Networks by Applying Multi-agent Q-learning Approach

A Robust Adaptive Objective Power Allocation in Cognitive NOMA Networks

Dual Dynamic Scheduling for Hierarchical QoS in Uplink-NOMA: A Reinforcement Learning Approach

Collaborative Multi-BS Power Management for Dense Radio Access Network using Deep Reinforcement Learning

Decentralized Power Allocation for MIMO-NOMA Vehicular Edge Computing Based on Deep Reinforcement Learning

RL-Assisted Power Allocation for Covert Communication in Distributed NOMA Networks

Cognitive network management with optimization using network protocol and machine learning model

AI Empowered RIS-Assisted NOMA Networks: Deep Learning or Reinforcement Learning?

Adaptive Coordinated Direct and Relay Transmission for NOMA Networks: A Joint Downlink-Uplink Scheme

Conjectural Variations in Multi-Agent Reinforcement Learning for Energy-Efficient Cognitive Wireless Mesh Networks.

Deep Reinforcement Learning-Based Power Allocation for Minimizing Age of Information and Energy Consumption in Multi-Input Multi-Output and Non-Orthogonal Multiple Access Internet of Things Systems