Stochastic Graph Neural Network-Based Value Decomposition for Multi-Agent Reinforcement Learning in Urban Traffic Control.

Baidi Xiao,Rongpeng Li,Fei Wang,Chenghui Peng,Jianjun Wu,Zhifeng Zhao,Honggang Zhang
DOI: https://doi.org/10.1109/vtc2023-spring57618.2023.10199985
2023-01-01
Abstract:Multi-Agent Reinforcement Learning (MARL) has reached astonishing achievements in various fields such as the traffic control of vehicles in a wireless connected environment. In MARL, how to effectively decompose a global feedback into the relative contributions of individual agents belongs to one of the most fundamental problems. However, the volatility of the environment (e.g., the vehicle movement and wireless disturbance) could significantly shape the time-varying topological relationships among agents, thus making the Value Decomposition (VD) challenging. Therefore, in order to cope with this annoying volatility, it becomes imperative to design a dynamic VD framework. Hence, in this paper, we propose a novel Stochastic VMIX (SVMIX) methodology by embedding the dynamic topological features into the VD and incorporating the corresponding components into a multi-agent actor-critic architecture. In particular, the Stochastic Graph Neural Network (SGNN) is leveraged to effectively extract underlying dynamics embedded in topological features and improve the flexibility of VD against the environment volatility. Finally, the superiority of SVMIX is verified through extensive simulations.
What problem does this paper attempt to address?