Abstract:In order to improve the energy efficiency（EE） in the underlay cognitive radio（CR）networks, a power allocation strategy based on an actor-critic reinforcement learning is proposed, where a cluster of cognitive users（CUs） can simultaneously access to the same primary spectrum band under the interference constraints of the primary user（PU）, by employing the non-orthogonal multiple access（NOMA） technique. In the proposed scheme, the optimization of the power allocation is formulated as a non-convex optimization problem. Additionally, the power allocation for different CUs is based on the actor-critic reinforcement learning model, in which the weighted data rate is set as the reward function,and the generated action strategy（i.e. the power allocation） is iteratively criticized and updated. Both the CU’s spectral efficiency and the PU’s interference constrains are considered in the training of the actor-critic reinforcement learning. Furthermore, the first order Taylor approximation as well as other manipulations are adopted to solve the power allocation optimization problem for the sake of considering the conventional channel conditions. According to the simulation results, we find that our scheme could achieve a higher spectral efficiency for the CUs compared to a benchmark scheme without learning process as well as the existing Q-learning based method, while the resultant interference affecting the PU transmission can be maintained at a given tolerated limit.

Reinforcement learning based spectrum-aware routing in multi-hop cognitive radio networks

Multi-hop routing algorithm with spectrum assignment for cognitive radio networks

Exploiting Spectrum Availability And Quality In Routing For Multi-Hop Cognitive Radio Networks

Artificial intelligence based cognitive routing for cognitive radio networks

Multi-metric cross layer routing protocol for cognitive radio ad hoc networks

Spectrum Aware Routing for Multi-Hop Cognitive Radio Networks with a Single Transceiver

Multi-objective Reinforcement Learning Based Routing in Cognitive Radio Networks: Walking in a Random Maze

Spectrum Aware On-Demand Routing In Cognitive Radio Networks

DDPG with Transfer Learning and Meta Learning Framework for Resource Allocation in Underlay Cognitive Radio Network

Reliable Link Routing In Cognitive Radio Networks

Spectrum-aware Cluster-based Routing Protocol for Multiple-Hop Cognitive Wireless Network

Local Coordination Based Routing and Spectrum Assignment in Multi-hop Cognitive Radio Networks

Scalable Deep Reinforcement Learning for Routing and Spectrum Access in Physical Layer

Energy Efficient Transmission in Underlay CR-NOMA Networks Enabled by Reinforcement Learning

A selection region based multi-hop routing protocol for cognitive radio Ad Hoc networks

Multichannel Non-Persistent CSMA MAC Schemes with Reinforcement Learning for Cognitive Radio Networks.

Routing and QoS Provisioning in Cognitive Radio Networks.

Dynamic Channel Selection and Transmission Scheduling for Cognitive Radio Networks.

Spectrum-Aware Routing for Reliable End-to-End Communications in Cognitive Sensor Network.

Spectrum-Aware Anypath Routing in Multi-Hop Cognitive Radio Networks.

Multi-layer Based Multi-Path Routing Algorithm for Maximizing Spectrum Availability