Multi-Agent Reinforcement Learning for Cooperative Task Offloading in Internet-of-Vehicles

Yuchen Lei,Kai Jiang,Zhenning Wang,Yue Cao,Hai Lin,Liang Chen
DOI: https://doi.org/10.1109/wcnc57260.2024.10571109
2024-01-01
Abstract:The Internet of Vehicles (IoV) has witnessed a significant growth in the number of participants. This rapid expansion has increased demands for computing resources and quality of service (QoS), posing challenges for mobile edge computing (MEC) in the IoV domain. Efficiently allocating computing power to meet these service demands has become a crucial concern. Therefore, joint optimization of offloading decisions and power allocation is required to achieve the tradeoff between task latency and energy consumption. To address the above challenge, we propose a multi-agent reinforcement learning (MARL) method called multi-agent twin delayed deep deterministic policy gradient (MA-TD3) in this paper. Compared to its predecessor, multi-agent deep deterministic policy gradient (MADDPG), this algorithm improves performance and execution speed. It solves the slow convergence problem caused by Q-value overestimation and reduces the computational cost. The experimental results illustrate that the proposed algorithm reaches an observable performance improvement.
What problem does this paper attempt to address?