Abstract:The advent of the Internet of Things (IoT) has triggered an increased demand for sensing devices with multiple integrated wireless transceivers. These platforms often support the advantageous use of multiple radio technologies to exploit their differing characteristics. Intelligent radio selection techniques allow these systems to become highly adaptive, ensuring more robust and reliable communications under dynamic channel conditions. In this paper, we focus on the wireless links between devices equipped by deployed operating personnel and intermediary access-point infrastructure. We use multi-radio platforms and wireless devices with multiple and diverse transceiver technologies to produce robust and reliable links through the adaptive control of available transceivers. In this work, the term ‘robust’ refers to communications that can be maintained despite changes in the environmental and radio conditions, i.e., during periods of interference caused by non-cooperative actors or multi-path or fading conditions in the physical environment. In this paper, a multi-objective reinforcement learning (MORL) framework is applied to address a multi-radio selection and power control problem. We propose independent reward functions to manage the trade-off between the conflicting objectives of minimised power consumption and maximised bit rate. We also adopt an adaptive exploration strategy for learning a robust behaviour policy and compare its online performance to conventional methods. An extension to the multi-objective state–action–reward–state–action (SARSA) algorithm is proposed to implement this adaptive exploration strategy. When applying adaptive exploration to the extended multi-objective SARSA algorithm, we achieve a 20% increase in the F1 score in comparison to one with decayed exploration policies.

Reinforcement-learning-based Wireless Resource Allocation

Delay-Aware Stochastic Resource Management for Mobile Edge Computing Systems Via Constrained Reinforcement Learning

DDPG with Transfer Learning and Meta Learning Framework for Resource Allocation in Underlay Cognitive Radio Network

To RL or not to RL? An Algorithmic Cheat-Sheet for AI-Based Radio Resource Management

Graph Reinforcement Learning for Radio Resource Allocation

Deep Reinforcement Learning-based Power Control and Bandwidth Allocation Policy for Weighted Cost Minimization in Wireless Networks

Sequential Learning And Decision-Making In Wireless Resource Management Preface

Secure Deep Reinforcement Learning for Dynamic Resource Allocation in Wireless MEC Networks

Adaptive transmission scheduling over fading channels for energy-efficient cognitive radio networks by reinforcement learning

An MDP approach for radio resource allocation in urban Future Railway Mobile Communication System (FRMCS) scenarios

Joint Resource Management for MC-NOMA: A Deep Reinforcement Learning Approach

Intelligent Multi-Radio Access Based on Markov Decision Process.

Reinforcement-Learning-Based Robust Resource Management for Multi-Radio Systems

Deep Reinforcement Learning for Radio Resource Allocation in NOMA-based Remote State Estimation

Wireless Resource Scheduling in Virtualized Radio Access Networks Using Stochastic Learning.

Age of Information-Aware Radio Resource Management in Vehicular Networks: A Proactive Deep Reinforcement Learning Perspective

Optimization Theory Based Deep Reinforcement Learning for Resource Allocation in Ultra-Reliable Wireless Networked Control Systems

Offline Reinforcement Learning for Wireless Network Optimization with Mixture Datasets

S-MFRL: Spiking Mean Field Reinforcement Learning for Dynamic Resource Allocation of D2D Networks

Green Deep Reinforcement Learning for Radio Resource Management: Architecture, Algorithm Compression and Challenge