Abstract:Here, we present a deep reinforcement learning‐based scheduling method for minimizing energy usage in multi‐unmanned aerial vehicle networks with a focus on optimizing age of information in disaster‐affected areas. By introducing a hierarchical unmanned aerial vehicle deployment strategy for cooperative path planning, we provide a solution for disaster response to collect fresh data with up to 74.20% energy savings. This letter introduces an innovative approach for minimizing energy consumption in multi‐unmanned aerial vehicles (multi‐UAV) networks using deep reinforcement learning, with a focus on optimizing the age of information (AoI) in disaster environments. A hierarchical UAV deployment strategy that facilitates cooperative trajectory planning, ensuring timely data collection and transmission while minimizing energy consumption is proposed. By formulating the inter‐UAV network path planning problem as a Markov decision process, a deep Q‐network (DQN) strategy is applied to enable real‐time decision making that accounts for dynamic environmental changes, obstacles, and UAV battery constraints. The extensive simulation results, conducted in both rural and urban scenarios, demonstrate the effectiveness of employing a memory access approach within the DQN framework, significantly reducing energy consumption up to 33.25% in rural settings and 74.20% in urban environments compared to non‐memory approaches. By integrating AoI considerations with energy‐efficient UAV control, this work offers a robust solution for maintaining fresh data in critical applications, such as disaster response, where ground‐based communication infrastructures are compromised. The use of replay memory approach, particularly the online history approach, proves crucial in adapting to changing conditions and optimizing UAV operations for both data freshness and energy consumption.

Actor-Critic Deep Reinforcement Learning for Energy Minimization in UAV-Aided Networks

An Actor-Critic-Based UAV-BSs Deployment Method for Dynamic Environments

Penalized Reinforcement Learning-Based Energy-Efficient UAV-RIS Assisted Maritime Uplink Communications Against Jamming

SREC: Proactive Self-Remedy of Energy-Constrained UAV-Based Networks via Deep Reinforcement Learning

Energy optimization and age of information enhancement in multi‐UAV networks using deep reinforcement learning

Blocklength Allocation and Power Control in UAV-Assisted URLLC System via Multi-agent Deep Reinforcement Learning

Deep Reinforcement Learning for Online Routing of Unmanned Aerial Vehicles with Wireless Power Transfer

Computation Offloading and Trajectory Control for UAV-Assisted Edge Computing Using Deep Reinforcement Learning

UAV Trajectory Planning in Wireless Sensor Networks for Energy Consumption Minimization by Deep Reinforcement Learning

Deep Reinforcement Learning-Based Energy Minimization Task Offloading and Resource Allocation for Air Ground Integrated Heterogeneous Networks

Deep Reinforcement Learning for Aerial Data Collection in Hybrid-Powered NOMA-IoT Networks

A Novel Joint DRL-Based Utility Optimization for UAV Data Services

Resource Allocation in UAV-Assisted Networks: A Clustering-Aided Reinforcement Learning Approach

Resource Scheduling Based on Deep Reinforcement Learning in UAV Assisted Emergency Communication Networks

Toward Optimal Resource Allocation: A Multi-Agent DRL Based Task Offloading Approach in Multi-UAV-Assisted MEC Networks

Joint 3D trajectory and phase shift optimization via deep reinforcement learning for RIS-assisted UAV communication systems

DL-DRL: A double-level deep reinforcement learning approach for large-scale task scheduling of multi-UAV

Optimization for Master-UAV-powered Auxiliary-Aerial-IRS-assisted IoT Networks: An Option-based Multi-agent Hierarchical Deep Reinforcement Learning Approach

Task offloading and trajectory scheduling for UAV-enabled MEC networks: An MADRL algorithm with prioritized experience replay

Reinforcement-Learning-Based Optimization on Energy Efficiency in UAV Networks for IoT

Multi-Agent Deep Reinforcement Learning For Optimising Energy Efficiency of Fixed-Wing UAV Cellular Access Points