Abstract:In emergency scenarios, strong mobility and serious interference cause unstable transmission of on-site information such as close-up photos and high resolution videos, which requires a robust temporary communication network. In this paper, we focus on a UAV-assisted wireless cooperative communication and coded caching network, where emergency command vehicles and a UAV serve as content providers (CPs) to cache and transmit coded fragments or complete files for rescuers regarded as content requesters (CRs). The delivery success probability and content hit ratio are theoretically derived by incorporating the physical connectivity and social relationship between CPs and CRs. Aiming at maximizing the overall content hit ratio, we propose a multiagent two-timescale deep reinforcement learning (MA2T-DRL) algorithm to jointly optimize the transmission power and caching strategies for CPs. Specifically, we develop a two tier deep-Q networks (DQNs) framework integrating a slow-timescale DQN (ST-DQN) and a fast-timescale DQN (FT-DQN) for caching decision-making and power decision-making respectively, and then the QMIX framework is leveraged to aggregate all the outputs from local ST-DQNs. Considering the cooperative characteristics of coded caching, we further propose a novel clustering method for CPs such that CPs in the same cluster have the same willingness to serve CRs, and each cluster is regarded as the agent for training which further reduces the aggregation scale of the mixing network. Simulation results show that the proposed MA2T-DRL algorithm is efficient in model training, and presents the advantages in performance and complexity compared with the single-agent centralized training and the multiagent independent distributed training.

Double Coded Caching in Ultra Dense Networks: Caching and Multicast Scheduling Via Deep Reinforcement Learning.

Deep Reinforcement Learning for Cooperative Coded Caching Strategy in Fog Radio Access Network

Reinforcement Learning Based Cooperative Coded Caching under Dynamic Popularities in Ultra-Dense Networks

Multi-Agent Reinforcement Learning for Cooperative Coded Caching via Homotopy Optimization

Distributed Cache Replacement for Caching-Enable Base Stations in Cellular Networks.

Coded Caching Schemes for Two-dimensional Caching-aided Ultra-dense Networks

Mean-Field Game Theoretic Edge Caching in Ultra-Dense Networks

Delay-Aware Cache-Enabled Cooperative D2D Transmission in Mobile Cellular Networks

Caching Placement Optimization in UAV-assisted Cellular Networks: A Deep Reinforcement Learning based Framework

Dynamic Coded Caching in Wireless Networks Using Multi-Agent Reinforcement Learning

Deep Reinforcement Learning Approaches for Content Caching in Cache-Enabled D2D Networks

Distributed Caching in Converged Networks: A Deep Reinforcement Learning Approach

Deadline-Aware Cache Placement Scheme Using Fuzzy Reinforcement Learning in Device-to-Device Mobile Edge Networks

Dynamic Content Update for Wireless Edge Caching via Deep Reinforcement Learning

Reinforcement Learning-Based Optimal Computing and Caching in Mobile Edge Network

Cooperative Cache in Cognitive Radio Networks: A Heterogeneous Multi-Agent Learning Approach

UAV-Assisted Wireless Cooperative Communication and Coded Caching: A Multiagent Two-Timescale DRL Approach

Exploiting Deep Reinforcement Learning for Edge Caching in Cell-Free Massive MIMO Systems

Distributed Caching Popular Services by Using Deep Q-Learning in Converged Networks

Joint Pushing and Caching Based on Physical Layer Multicasting and Network Coding.

Joint Content Caching, Recommendation, and Transmission for Layered Scalable Videos Over Dynamic Cellular Networks: A Dueling Deep Q-Learning Approach