Optimal Resource Allocation in Wireless Control Systems via Deep Policy Gradient

Vinicius Lima Silva,Mark Eisen,Konstantinos Gatsis,Alejandro Ribeiro
DOI: https://doi.org/10.48550/arXiv.1910.11900
2019-10-26
Abstract:In wireless control systems, remote control of plants is achieved through closing of the control loop over a wireless channel. As wireless communication is noisy and subject to packet dropouts, proper allocation of limited resources, e.g. transmission power, across plants is critical for maintaining reliable operation. In this paper, we formulate the design of an optimal resource allocation policy that uses current plant states and wireless channel states to assign resources used to send control actuation information back to plants. While this problem is challenging due to its infinite dimensionality and need for explicit system model and state knowledge, we propose the use of deep reinforcement learning techniques to find neural network-based resource allocation policies. In particular, we use model-free policy gradient methods to directly learn continuous power allocation policies without knowledge of plant dynamics or communication models. Numerical simulations demonstrate the strong performance of learned policies relative to baseline resource allocation methods in settings where state information is available both with and without noise.
Systems and Control,Signal Processing
What problem does this paper attempt to address?