Learning to Continuously Optimize Wireless Resource in a Dynamic Environment: A Bilevel Optimization Perspective
Haoran Sun,Wenqiang Pu,Xiao Fu,Tsung-Hui Chang,Mingyi Hong
DOI: https://doi.org/10.1109/tsp.2022.3143372
IF: 4.875
2022-01-01
IEEE Transactions on Signal Processing
Abstract:There has been a growing interest in developing data-driven, and in particular deep neural network (DNN) based methods for modern communication tasks. These methods achieve state-of-the-art performance for a few popular wireless resource allocation problems, while requiring less computational efforts, less resources for acquiring channel state information (CSI), etc. However, it is often challenging for these approaches to learn in a dynamic environment. This work develops a new approach that enables data-driven methods to continuously learn and optimize wireless resource allocation in a dynamic environment. Specifically, we consider an episodically dynamic setting where the environment statistics change in episodes, and in each episode the environment is stationary. We propose to build the notion of continual learning (CL) into wireless system design, so that the learning model can incrementally adapt to the new episodes, without forgetting knowledge learned from the previous episodes. We demonstrate the effectiveness of the CL approach by integrating it with three popular DNN based models for power control, beamforming and multi-user MIMO, respectively, and testing using both synthetic and ray-tracing based data sets. These numerical results show that the proposed CL approach is not only able to adapt to the new scenarios quickly and seamlessly, but importantly, it also maintains high performance over the previously encountered scenarios as well.
engineering, electrical & electronic