Abstract:In this paper, we investigate cost-aware joint learning and optimization for multi-channel opportunistic spectrum access in a cognitive radio system. We investigate a discrete time model where the time axis is partitioned into frames. Each frame consists of a sensing phase, followed by a transmission phase. During the sensing phase, the user is able to sense a subset of channels sequentially before it decides to use one of them in the following transmission phase. We assume the channel states alternate between busy and idle according to independent Bernoulli random processes from frame to frame. To capture the inherent uncertainty in channel sensing, we assume the reward of each transmission when the channel is idle is a random variable. We also associate random costs with sensing and transmission actions. Our objective is to understand how the costs and reward of the actions would affect the optimal behavior of the user in both offline and online settings, and design the corresponding opportunistic spectrum access strategies to maximize the expected cumulative net reward (i.e., reward-minus-cost). We start with an offline setting where the statistics of the channel status, costs and reward are known beforehand. We show that the the optimal policy exhibits a recursive double threshold structure, and the user needs to compare the channel statistics with those thresholds sequentially in order to decide its actions. With such insights, we then study the online setting, where the statistical information of the channels, costs and reward are unknown a priori. We judiciously balance exploration and exploitation, and show that the cumulative regret scales in O(log T). We also establish a matched lower bound, which implies that our online algorithm is order-optimal. Simulation results corroborate our theoretical analysis.

Dynamic Spectrum Access in Time-varying Environment: Distributed Learning Beyond Expectation Optimization

Dynamic channel selection in unknown environment based on graphical game and multi-Q learning

Dynamic Adaptation in Wireless Networks Under Comprehensive Interference via Carrier Sense

Joint Spectrum Sensing and Access for Stable Dynamic Spectrum Aggregation.

Interference-aware spectrum resource management in dynamic environment: strategic learning with higher-order statistic optimization

Privacy-Preserving Database Assisted Spectrum Access for Industrial Internet of Things: A Distributed Learning Approach

Joint Spectrum Sensing and Access Evolutionary Game in Cognitive Radio Networks.

Dynamic Spectrum Access in Cognitive Radio Networks Using Deep Reinforcement Learning and Evolutionary Game

Distributed Learning for Optimal Spectrum Access in Dense Device-to-Device Ad-Hoc Networks

A Multi-Stage Dynamic Spectrum Sharing Framework in Cognitive Radio Networks

Toward Order Optimal Channel Access in Unknown Environments: an Online Learning Method

Almost Optimal Dynamically-Ordered Channel Sensing and Accessing for Cognitive Networks.

Exploiting User Demand Diversity in Heterogeneous Wireless Networks

Learning for Dynamic Bidding in Cognitive Radio Resources

Cost-Aware Learning and Optimization for Opportunistic Spectrum Access

A Game-Theoretic Learning Approach for Anti-Jamming Dynamic Spectrum Access in Dense Wireless Networks

Dynamic Spectrum Negotiation with Asymmetric Information.

Multi-Channel Sensing and Access Game: Bayesian Social Learning with Negative Network Externality

Dynamic Spectrum Negotiation with Asymmetric Information : Technical Report

Distributed Learning over Markovian Fading Channels for Stable Spectrum Access

Learning distributed channel access policies for networked estimation: data-driven optimization in the mean-field regime