Robust Computation Offloading and Trajectory Optimization for Multi-UAV-Assisted MEC: A Multi-Agent DRL Approach

Bin Li,Rongrong Yang,Lei Liu,Junyi Wang,Ning Zhang,Mianxiong Dong
DOI: https://doi.org/10.1109/JIOT.2023.3300718
2023-08-24
Abstract:For multiple Unmanned-Aerial-Vehicles (UAVs) assisted Mobile Edge Computing (MEC) networks, we study the problem of combined computation and communication for user equipments deployed with multi-type tasks. Specifically, we consider that the MEC network encompasses both communication and computation uncertainties, where the partial channel state information and the inaccurate estimation of task complexity are only available. We introduce a robust design accounting for these uncertainties and minimize the total weighted energy consumption by jointly optimizing UAV trajectory, task partition, as well as the computation and communication resource allocation in the multi-UAV scenario. The formulated problem is challenging to solve with the coupled optimization variables and the high uncertainties. To overcome this issue, we reformulate a multi-agent Markov decision process and propose a multi-agent proximal policy optimization with Beta distribution framework to achieve a flexible learning policy. Numerical results demonstrate the effectiveness and robustness of the proposed algorithm for the multi-UAV-assisted MEC network, which outperforms the representative benchmarks of the deep reinforcement learning and heuristic algorithms.
Signal Processing
What problem does this paper attempt to address?
The paper primarily focuses on addressing the issues of computation offloading and trajectory optimization in multi-UAV (Unmanned Aerial Vehicles) assisted Mobile Edge Computing (MEC) networks. Specifically, the research aims to: 1. **Handle Communication and Computation Uncertainty**: In MEC networks, there are uncertainties such as incomplete communication channel state information and inaccurate task complexity estimation. The research proposes a robust design method to tackle these uncertainties and minimizes the total weighted energy consumption of the system by jointly optimizing UAV trajectories, task allocation, and computation and communication resource allocation. 2. **Multi-UAV Collaboration**: A single UAV, due to its limited coverage and computational capacity, struggles to efficiently serve a large number of User Equipment (UEs). Therefore, the research explores the collaboration among multiple UAVs to provide more flexible services to users. 3. **Introduce Multi-Agent Deep Reinforcement Learning Algorithm**: To overcome the challenges of coupled optimization variables and high uncertainty, the paper proposes an algorithm based on the Multi-Agent Proximal Policy Optimization (MAPPO) framework. It utilizes the Beta distribution to improve the boundary effect issue in the original MAPPO algorithm, thereby achieving flexible learning of UAV trajectories and task offloading decisions. In summary, by introducing robust design and advanced multi-agent deep reinforcement learning techniques, the paper aims to enhance the energy efficiency and reliability of multi-UAV assisted MEC networks.