Abstract:Reconfigurable wireless network can flexibly provide efficient spectrum access service and keep stable operation in highly dynamic environment. In this paper, a primary-prioritized recurrent deep reinforcement learning algorithm for dynamic spectrum access based on cognitive radio (CR) technology is proposed. The spectrum Markov state is modeled to capture the evolution behavior to achieve the priority queuing of the primary users and the secondary users. According to the spectrum access strategies of the secondary users under different optimal criteria, we can obtain the best tradeoff benefits of spectrum access fairness and throughput. Furthermore, we proposed a learning-based algorithm for dynamic spectrum access, which allows the secondary users to modify their parameters to select the optimal access policy to maximize network throughput utilization. The Dueling Deep <span class="mjpage"><svg xmlns:xlink="http://www.w3.org/1999/xlink" width="1.838ex" height="2.509ex" style="vertical-align: -0.671ex;" viewBox="0 -791.3 791.5 1080.4" role="img" focusable="false" xmlns="http://www.w3.org/2000/svg"><g stroke="currentColor" fill="currentColor" stroke-width="0" transform="matrix(1 0 0 -1 0 0)"> <use xlink:href="#MJMATHI-51" x="0" y="0"></use></g></svg></span>-Network (Dueling DQN) with prioritized experience replay combined with recurrent neural network is used to improve the convergence speed. Extensive experimental results demonstrate that the proposed RDRL scheme outperforms the existing Dueling DQN and DQN schemes in terms of convergence speed and channel throughput.<svg xmlns="http://www.w3.org/2000/svg" style="display: none;"><defs id="MathJax_SVG_glyphs"><path stroke-width="1" id="MJMATHI-51" d="M399 -80Q399 -47 400 -30T402 -11V-7L387 -11Q341 -22 303 -22Q208 -22 138 35T51 201Q50 209 50 244Q50 346 98 438T227 601Q351 704 476 704Q514 704 524 703Q621 689 680 617T740 435Q740 255 592 107Q529 47 461 16L444 8V3Q444 2 449 -24T470 -66T516 -82Q551 -82 583 -60T625 -3Q631 11 638 11Q647 11 649 2Q649 -6 639 -34T611 -100T557 -165T481 -194Q399 -194 399 -87V-80ZM636 468Q636 523 621 564T580 625T530 655T477 665Q429 665 379 640Q277 591 215 464T153 216Q153 110 207 59Q231 38 236 38V46Q236 86 269 120T347 155Q372 155 390 144T417 114T429 82T435 55L448 64Q512 108 557 185T619 334T636 468ZM314 18Q362 18 404 39L403 49Q399 104 366 115Q354 117 347 117Q344 117 341 117T337 118Q317 118 296 98T274 52Q274 18 314 18Z"></path></defs></svg>

Reinforcement Learning‐based Spectrum Handoff Scheme with Measured PDR in Cognitive Radio Networks

A Q-Learning Based Spectrum Handoff Scheme with SINR-MOS over Cognitive Radio Networks

DQN-Based Predictive Spectrum Handoff via Hybrid Priority Queuing Model

On the Queue Dynamics of Multiuser Multichannel Cognitive Radio Networks

DDPG with Transfer Learning and Meta Learning Framework for Resource Allocation in Underlay Cognitive Radio Network

Energy Efficient Transmission in Underlay CR-NOMA Networks Enabled by Reinforcement Learning

Spectrum Handoff Scheme Based on Recommended Channel Sensing Sequence

Reinforcement Learning Enabled Cooperative Spectrum Sensing in Cognitive Radio Networks.

Q-Learning-Based Spectrum Access for Multimedia Transmission Over Cognitive Radio Networks

Variable bandwidth spectrum access for secondary user in cognitive radio networks.

Dynamic Spectrum Access for Multimedia Transmission over Multi-User, Multi-Channel Cognitive Radio Networks

Modeling For Spectrum Handoff Based On Secondary Users With Different Priorities In Cognitive Radio Networks

Traffic-Adaptive Proactive Spectrum Handoff Strategy for Graded Secondary Users in Cognitive Radio Networks

Spectrum Handoff for Anti-interference in Cognitive Radio Networks

Dynamic Channel Selection and Transmission Scheduling for Cognitive Radio Networks.

Deep Reinforcement Learning-based Distributed Dynamic Spectrum Access in Multi-User Multi-channel Cognitive Radio Internet of Things Networks

RDRL: A Recurrent Deep Reinforcement Learning Scheme for Dynamic Spectrum Access in Reconfigurable Wireless Networks

Deep Reinforcement Learning-Based RIS-assisted Cooperative Spectrum Sensing in Cognitive Radio Network

Adaptive Power Control Based Spectrum Handover for Cognitive Radio Networks

An Opportunistic Relaying Protocol Exploiting Distributed Beamforming and Token Passing in Cognitive Radios

Sensing-Transmission Tradeoff for Multimedia Transmission in Cognitive Radio Networks