Deep Reinforcement Learning for Dynamic Multichannel Access in Wireless Networks

Shangxing Wang,Hanpeng Liu,Pedro Henrique Gomes,Bhaskar Krishnamachari
DOI: https://doi.org/10.1109/tccn.2018.2809722
IF: 6.359
2018-06-01
IEEE Transactions on Cognitive Communications and Networking
Abstract:We consider a dynamic multichannel access problem, where multiple correlated channels follow an unknown joint Markov model and users select the channel to transmit data. The objective is to find a policy that maximizes the expected long-term number of successful transmissions. The problem is formulated as a partially observable Markov decision process with unknown system dynamics. To overcome the challenges of unknown dynamics and prohibitive computation, we apply the concept of reinforcement learning and implement a deep Q-network (DQN). We first study the optimal policy for fixed-pattern channel switching with known system dynamics and show through simulations that DQN can achieve the same optimal performance without knowing the system statistics. We then compare the performance of DQN with a Myopic policy and a Whittle Index-based heuristic through both more general simulations as well as real data trace and show that DQN achieves near-optimal performance in more complex situations. Finally, we propose an adaptive DQN approach with the capability to adapt its learning in time-varying scenarios.
telecommunications
What problem does this paper attempt to address?