Abstract:To accommodate the explosive wireless traffics, massive multiple-input multiple-output (MIMO) is regarded as one of the key enabling technologies for next-generation communication systems. In massive MIMO cellular networks, coordinated beamforming (CBF), which jointly designs the beamformers of multiple base stations (BSs), is an efficient method to enhance the network performance. In this paper, we investigate the sum rate maximization problem in a massive MIMO mobile cellular network, where in each cell a multi-antenna BS serves multiple mobile users simultaneously via downlink beamforming. Although existing optimization-based CBF algorithms can provide near-optimal solutions, they require realtime and global channel state information (CSI), in addition to their high computation complexity. It is almost impossible to apply them in practical wireless networks, especially highly dynamic mobile cellular networks. Motivated by this, we propose a deep reinforcement learning based distributed dynamic coordinated beamforming (DDCBF) framework, which enables each BS to determine the beamformers with only local CSI and some historical information from other BSs.Besides, the beamformers can be calculated with a considerably lower computational complexity by exploiting neural networks and expert knowledge, i.e., a solution structure observed from the iterative procedure of the weighted minimum mean square error (WMMSE) algorithm. Moreover, we provide extensive numerical simulations to validate the effectiveness of the proposed DRL-based approach. With lower computational complexity and less required information, the results show that the proposed approach can achieve comparable performance to the centralized iterative optimization algorithms.

Beam Hopping Scheduling Based on Deep Reinforcement Learning

Dynamic Beam Pattern and Bandwidth Allocation Based on Multi-Agent Deep Reinforcement Learning for Beam Hopping Satellite Systems

Satellite-Terrestrial Coordinated Multi-Satellite Beam Hopping Scheduling Based on Multi-Agent Deep Reinforcement Learning

DRL-Based Dynamic Resource Allocation for Multi-Beam Satellite Systems

Efficient Initial Access with Deep Reinforcement Learning Based Beam Sweeping in Wireless Cellular Communication Systems

Buffer-Aware Wireless Scheduling Based On Deep Reinforcement Learning

Dynamic Beam Hopping for DVB-S2X GEO Satellite: A DRL-Powered GA Approach

Deep-Reinforcement-Learning-Based Scheduling with Contiguous Resource Allocation for Next-Generation Cellular Systems

Deep Reinforcement Learning Based Dynamic Beam Selection in Dual-Band Communication Systems

Semantic-aware Transmission Scheduling: a Monotonicity-driven Deep Reinforcement Learning Approach

Dynamic Resource Allocation With Deep Reinforcement Learning in Multibeam Satellite Communication

Traffic-Aware Hierarchical Beam Selection for Cell-Free Massive MIMO

Deep Reinforcement Learning for Beam Management in UAV Relay mmWave Networks

Deep Reinforcement Learning for Energy-Efficient Beamforming Design in Cell-Free Networks

Deep Reinforcement Learning-Based Resource Management in Maritime Communication Systems

Beam management optimization for V2V communications based on deep reinforcement learning

Joint Band Assignment and Beam Management using Hierarchical Reinforcement Learning for Multi-Band Communication

Multi-User Delay-Constrained Scheduling With Deep Recurrent Reinforcement Learning

Digital twin‐enabled deep reinforcement learning for joint scheduling of ultra‐reliable low latency communication and enhanced mobile broad band: A reliability‐guaranteed approach

Deep Reinforcement Learning for Distributed Dynamic Coordinated Beamforming in Massive MIMO Cellular Networks