Abstract:The combination of energy harvesting (EH), cognitive radio (CR), and non-orthogonal multiple access (NOMA) is a promising solution to improve energy efficiency and spectral efficiency of the upcoming beyond fifth generation network (B5G), especially for support the wireless sensor communications in Internet of things (IoT) system. However, how to realize intelligent frequency, time, and energy resource allocation to support better performances is an important problem to be solved. In this paper, we study joint spectrum, energy, and time resource management for the EH-CR-NOMA IoT systems. Our goal is to minimize the number of data packets losses for all secondary sensing users (SSU), while satisfying the constraints on the maximum charging battery capacity, maximum transmitting power, maximum buffer capacity, and minimum data rate of primary users (PU) and SSUs. Due to the non-convexity of this optimization problem and the stochastic nature of the wireless environment, we propose a distributed multidimensional resource management algorithm based on deep reinforcement learning (DRL). Considering the continuity of the resources to be managed, the deep deterministic policy gradient (DDPG) algorithm is adopted, based on which each agent (SSU) can manage its own multidimensional resources without collaboration. In addition, a simplified but practical action adjuster (AA) is introduced for improving the training efficiency and battery performance protection. The provided results show that the convergence speed of the proposed algorithm is about 4 times faster than that of DDPG, and the average number of packet losses (ANPL) is about 8 times lower than that of the greedy algorithm.

Dynamic User Pairing and Power Allocation for NOMA with Deep Reinforcement Learning

A Dynamic Power Allocation Scheme in Power-Domain NOMA Using Actor-Critic Reinforcement Learning.

User Pairing for Delay-Limited NOMA-Based Satellite Networks with Deep Reinforcement Learning

Resource Optimisation for Downlink Non-Orthogonal Multiple Access Systems: a Joint Channel Bandwidth and Power Allocations Approach.

Downlink Non-Orthogonal Multiple Access Power Allocation Algorithm Based on Double Deep Q Network for Ensuring User's Quality of Service

Joint User Pairing and Association for Multicell NOMA: A Pointer Network-based Approach

Power Control, User Scheduling And Resource Allocation For Downlink Noma Systems With Imperfect Channel State Information

A Novel User Pairing in Downlink Non-Orthogonal Multiple Access

Dual Dynamic Scheduling for Hierarchical QoS in Uplink-NOMA: A Reinforcement Learning Approach

Spectrum-efficient user grouping and resource allocation based on deep reinforcement learning for mmWave massive MIMO-NOMA systems

Joint User Clustering And Passive Beamforming For Downlink Noma System With Reconfigurable Intelligent Surface

User Pairing for Downlink Non-Orthogonal Multiple Access Networks Using Matching Algorithm

Deep Reinforcement Learning-Based User Pairing In Full-Duplex Communication Systems

Resource Allocation for D2D Communications Underlaying a NOMA-Based Cellular Network

Joint User Pairing and Beamforming Design for NOMA-Aided CFMM-ISAC Systems

Secure User Pairing and Power Allocation for Downlink Non-Orthogonal Multiple Access against External Eavesdropping

Deep Learning Based Radio Resource Management in NOMA Networks: User Association, Subchannel and Power Allocation

Power Allocation in Multi-User Cellular Networks: Deep Reinforcement Learning Approaches

Joint Resource Management for MC-NOMA: A Deep Reinforcement Learning Approach

Joint User Pairing and Power Allocation for NOMA-Based GEO and LEO Satellite Network

Deep Reinforcement Learning Based Multidimensional Resource Management for Energy Harvesting Cognitive NOMA Communications