Abstract:In this paper, we investigate cost-aware joint learning and optimization for multi-channel opportunistic spectrum access in a cognitive radio system. We investigate a discrete time model where the time axis is partitioned into frames. Each frame consists of a sensing phase, followed by a transmission phase. During the sensing phase, the user is able to sense a subset of channels sequentially before it decides to use one of them in the following transmission phase. We assume the channel states alternate between busy and idle according to independent Bernoulli random processes from frame to frame. To capture the inherent uncertainty in channel sensing, we assume the reward of each transmission when the channel is idle is a random variable. We also associate random costs with sensing and transmission actions. Our objective is to understand how the costs and reward of the actions would affect the optimal behavior of the user in both offline and online settings, and design the corresponding opportunistic spectrum access strategies to maximize the expected cumulative net reward (i.e., reward-minus-cost). We start with an offline setting where the statistics of the channel status, costs and reward are known beforehand. We show that the the optimal policy exhibits a recursive double threshold structure, and the user needs to compare the channel statistics with those thresholds sequentially in order to decide its actions. With such insights, we then study the online setting, where the statistical information of the channels, costs and reward are unknown a priori. We judiciously balance exploration and exploitation, and show that the cumulative regret scales in O(log T). We also establish a matched lower bound, which implies that our online algorithm is order-optimal. Simulation results corroborate our theoretical analysis.

One Step Beyond Myopic Probing Policy: A Heuristic Lookahead Policy for Multi-Channel Opportunistic Access

Multi-channel opportunistic spectrum access: A mixed-scale decision perspective

Tunable Probing: Towards Timely Channel Selection Mechanism in Dynamic Multi-channel Wireless Networks

Energy Efficiency and Contact Opportunities Tradeoff in Opportunistic Mobile Networks

Online Sequential Channel Accessing Control: A Double Exploration Vs. Exploitation Problem

Multiple access algorithm based on channel sensing and prediction

SPA : Almost Optimal Sequential Channel Sensing , Probing , Accessing in Cognitive Radio Networks

Exploration Vs Exploitation for Distributed Channel Access in Cognitive Radio Networks: A Multi-User Case Study.

Toward Order Optimal Channel Access in Unknown Environments: an Online Learning Method

A POMDP-based optimal spectrum sensing and access scheme for cognitive radio networks with hardware limitation

Optimality of Myopic Sensing in Multi-Channel Opportunistic Access

An order optimal policy for exploiting idle spectrum in cognitive radio networks

A Rollout-Based Joint Spectrum Sensing and Access Policy for Cognitive Radio Networks with Hardware Limitations

Observation Vs Statistics: Near Optimal Online Channel Access In Cognitive Radio Networks

Cost-Aware Learning and Optimization for Opportunistic Spectrum Access

Sensing-Transmission Tradeoff for Multimedia Transmission in Cognitive Radio Networks

Exploiting Channel Correlation and PU Traffic Memory for Opportunistic Spectrum Scheduling

Almost Optimal Dynamically-Ordered Channel Sensing and Accessing for Cognitive Networks.

Hopping-Based Channel Access in Cognitive Radio Systems

On Design of Opportunistic Spectrum Access in the Presence of Reactive Primary Users

Optimality of Multichannel Myopic Sensing in the Presence of Sensing Error for Opportunistic Spectrum Access