Abstract:In this paper, we consider the design of efficient strategies that allow multiple secondary users to choose channels with unknown idle probabilities to sense and access without centralized control. Due to the limited sensing capability of CR, the secondary user cannot sense all the channels simultaneously. How to design intelligent sensing strategy is crucial to track the varying spectrum opportunities. However, the availability probability of each channel is not known a prior. The need to learning the information creates a fundamental trade-off between exploration and exploitation. First, the scenario in which a single cognitive user wishes to opportunistically exploit the availability of idle spectrum is considered. An index based strategy from the classical multi-armed bandit problem is efficient to achieve asymptotically optimal performance. Then, the multiuser case is considered. However, the index based strategy for the single user cannot be applied to multiuser scenario directly. If so, collisions among secondary users degrade the overall network performance greatly. We find that randomized selection of the channel to sense is essential to avoid collisions. We extend the exploration and exploitation idea to the multiuser scenario and come up with a randomization based mixed strategy which takes the activity of other secondary users into consideration in the learning process and at the same time achieves exploration and exploitation tradeoff. Numerical simulation results show that the proposed scheme can achieve near optimal in terms of total network performance as the centralized scenario without any information exchange among cognitive users.

Multi-user Dynamic Spectrum Access Based on Reinforcement Learning

Dynamic Cooperative Spectrum Sensing Based on Deep Multi-User Reinforcement Learning

Dynamic Spectrum Access in Cognitive Radio Networks Using Deep Reinforcement Learning and Evolutionary Game

Dynamic multiple access based on deep reinforcement learning for Internet of Things

Deep Reinforcement Learning for Dynamic Spectrum Sensing and Aggregation in Multi-Channel Wireless Networks

Dynamic Spectrum Access for Multimedia Transmission over Multi-User, Multi-Channel Cognitive Radio Networks

Dynamic Spectrum Sharing Based on Deep Reinforcement Learning in Mobile Communication Systems

Joint Spectrum Sensing and Access for Stable Dynamic Spectrum Aggregation.

Deep Reinforcement Learning for Dynamic Multichannel Access in Multi-Cognitive Radio Networks

Dynamic Spectrum Access Based on Deep Reinforcement Learning for Multiple Access in Cognitive Radio

A Novel Dynamic Spectrum Access Framework Based on Reinforcement Learning for Cognitive Radio Sensor Networks.

Multi-Agent Reinforcement Learning for Dynamic Spectrum Access.

Dynamic Multichannel Sensing in Cognitive Radio: Hierarchical Reinforcement Learning

Deep Reinforcement Learning-based Distributed Dynamic Spectrum Access in Multi-User Multi-channel Cognitive Radio Internet of Things Networks

Deep Reinforcement Learning for Spectrum Sharing in Future Mobile Communication System.

Dynamic Spectrum Access Based on Prior Knowledge Enabled Reinforcement Learning with Double Actions in Complex Electromagnetic Environment

Traffic Priority-Aware Multi-User Distributed Dynamic Spectrum Access: A Multi-Agent Deep RL Approach

Exploration Vs Exploitation for Distributed Channel Access in Cognitive Radio Networks: A Multi-User Case Study.

Deep Q-Network Based Dynamic Spectrum Access for Cognitive Networks with Limited Spectrum Sensing Capability SUs

Dynamic Access Approach to Multiple Channels in Pervasive Wireless Multimedia Communications for Technology Enhanced Learning

Dynamic Spectrum Access Algorithm Based on Q-learning