Stable Resource Allocation Based on Multi-agent Reinforcement Learning for Edge Computing

Song Jiang,Xuejie Zhang
DOI: https://doi.org/10.1109/icfeict57213.2022.00083
2022-01-01
Abstract:To address the needs of distributed decision making and maintaining long-term system stability in edge computing resource allocation, we propose a distributed resource allocation framework DLMAO, and multi-agent deep reinforcement learning-based allocation algorithms QSLSQP and IDSAC deployed for cooperative and non-cooperative scenarios, respectively. Considering the stochastic nature of the environment and the mobility of terminals, the resource allocation problem is modeled as a mixed-integer nonlinear programming (MINLP) problem on sequential time series. The DLMAO framework is based on Lyapunov optimization, which decouples the programming into multiple independent MINLPs and ensures the system's long-term stability, and then QSLSQP and IDSAC are used to solve binary offloading and resource allocation problems in the hybrid action space to maximize the system's weighted data processing rate with limited resource constraints. Both algorithms allow every agent to make autonomous decisions. Experimental results demonstrate that the QSLSQP algorithm, which utilizes global information in a cooperative scenario, has the best performance, and the fully autonomous IDSAC algorithm outperforms the single-agent global optimization approach. The extremely short decision latency of both algorithms makes them appropriate to be applied in edge computing environments.
What problem does this paper attempt to address?