Abstract:Multi-access edge computing (MEC) presents computing services at the edge of networks to address the enormous processing requirements of intelligent applications. Due to the maneuverability of unmanned aerial vehicles (UAVs), they can be used as temporal aerial edge nodes for providing edge services to ground users in MEC. However, MEC environment is usually dynamic and complicated. It is a challenge for multiple UAVs to select appropriate service strategies. Besides, most of existing works study UAV-MEC with the assumption that the flight heights of UAVs are fixed; i.e., the flying is considered to occur with reference to a two-dimensional plane, which neglects the importance of the height. In this paper, with consideration of the co-channel interference, an optimization problem of energy efficiency is investigated to maximize the number of fulfilled tasks, where multiple UAVs in a three-dimensional space collaboratively fulfill the task computation of ground users. In the formulated problem, we try to obtain the optimal flight and sub-channel selection strategies for UAVs and schedule strategies for tasks. Based on the multi-agent deep deterministic policy gradient (MADDPG) algorithm, we propose a curiosity-driven and twin-networks-structured MADDPG (CTMADDPG) algorithm to solve the formulated problem. It uses the inner reward to facilitate the state exploration of agents, avoiding convergence at the sub-optimal strategy. Furthermore, we adopt the twin critic networks for update stabilization to reduce the probability of Q value overestimation. The simulation results show that CTMADDPG is outstanding in maximizing the energy efficiency of the whole system and outperforms the other benchmarks.

Energy Constrained Multi-Agent Reinforcement Learning for Coverage Path Planning

Multi-region Coverage Path Planning for Heterogeneous Unmanned Aerial Vehicles Systems

A Balanced Shadow-Following Coverage Path Planning Approach under Energy Constraints

Multi-UAV Path Planning for Multi-Region Coverage by Multi-Objective Genetic Method

Dense Multi-Agent Reinforcement Learning Aided Multi-UAV Information Coverage for Vehicular Networks

Deep Reinforcement Learning-based Collaborative Multi-UAV Coverage Path Planning

Multi-UAV Coverage Path Planning: A Distributed Online Cooperation Method

Multi-Agent Path Planning for Unmanned Aerial Vehicle Based on Threats Analysis

Balanced Multi-Region Coverage Path Planning for Unmanned Aerial Vehicles.

Trace Pheromone-Based Energy-Efficient UAV Dynamic Coverage Using Deep Reinforcement Learning

Energy-aware Multi-UAV Coverage Mission Planning with Optimal Speed of Flight

Multi-UAV Path Planning and Following Based on Multi-Agent Reinforcement Learning

Multi-UAV Trajectory Planning for Energy-Efficient Content Coverage: A Decentralized Learning-Based Approach

Communication and Energy-Aware Multi-UAV Coverage Path Planning for Networked Operations

Coverage Path Planning Methods Focusing on Energy Efficient and Cooperative Strategies for Unmanned Aerial Vehicles

Multi-UAV Coverage Path Assignment Algorithm Considering Flight Time and Energy Consumption

UAV Path Planning Based on Multi-agent Deep Reinforcement Learning

Collaborative Coverage Path Planning of UAV Cluster based on Deep Reinforcement Learning

Learning-Based UAV Coverage-Aware Path Planning in Large-scale Urban Environments

Multi-UAV Disaster Environment Coverage Planning with Limited-Endurance

A Multi-Agent Collaboration Scheme for Energy-Efficient Task Scheduling in a 3D UAV-MEC Space