Abstract:The integration of terrestrial and satellite wireless communication networks offers a practical solution to enhance network coverage, connectivity, and cost-effectiveness. Moreover, in today’s interconnected world, connectivity’s reliable and widespread availability is increasingly important across various domains. This is especially more crucial for applications like the Internet of Things (IoT), remote sensing, disaster management, and bridging the digital divide. However, allocating the limited network resources efficiently and ensuring seamless handover between satellite and terrestrial networks present significant challenges. Therefore, this study introduces a resource allocation framework for integrated satellite–terrestrial networks to address these challenges. The framework leverages local cache pool deployments and non-orthogonal multiple access (NOMA) to reduce time delays and improve energy efficiency. Our proposed approach utilizes a multi-agent enabled deep deterministic policy gradient algorithm (MADDPG) to optimize user association, cache design, and transmission power control, resulting in enhanced energy efficiency. The approach comprises two phases: User Association and Power Control, where users are treated as agents, and Cache Optimization, where the satellite (Bs) is considered the agent. Through extensive simulations, we demonstrate that our approach surpasses conventional single-agent deep reinforcement learning algorithms in addressing cache design and resource allocation challenges in integrated terrestrial–satellite networks. Specifically, our proposed approach achieves significantly higher energy efficiency and reduced time delays compared to existing methods. This research highlights the importance and addresses the need for efficient resource allocation and cache design in integrated terrestrial–satellite networks, paving the way for enhanced connectivity and improved network performance in various applications.

Multi-Agent Deep Reinforcement Learning-Based Flexible Satellite Payload for Mobile Terminals.

Dynamic Beam Pattern and Bandwidth Allocation Based on Multi-Agent Deep Reinforcement Learning for Beam Hopping Satellite Systems

Multi-Agent Reinforcement Learning Based UAV Swarm Communications Against Jamming

Multi-Agent Deep Reinforcement Learning Based Channel Allocation for Networked Satellite Telemetry System.

Satellite-Terrestrial Coordinated Multi-Satellite Beam Hopping Scheduling Based on Multi-Agent Deep Reinforcement Learning

Double-Timescale Multi-Agent Deep Reinforcement Learning for Flexible Payload in VHTS Systems

Efficient Resource Allocation for Multi-Beam Satellite-Terrestrial Vehicular Networks: A Multi-Agent Actor-Critic Method With Attention Mechanism

Load-Aware Satellite Handover Strategy Based on Multi-Agent Reinforcement Learning

Flexible Robust Beamforming for Multibeam Satellite Downlink using Reinforcement Learning

DRL-Based Dynamic Resource Allocation for Multi-Beam Satellite Systems

Multi-Agent Reinforcement Learning Based Unlicensed Resource Sharing for LTE-U Networks.

Traffic Optimization in Satellites Communications: A Multi-agent Reinforcement Learning Approach

Multi-objective deep reinforcement learning based time-frequency resource allocation for multi-beam satellite communications

Multi - Agent Deep Deterministic Policy Gradient Based Satellite Spectrum/Code Resource Scheduling with Multi-constraint

Collaborative Deep Reinforcement Learning for Resource Optimization in Non-Terrestrial Networks

+2cmFlexible Resource Management in High-throughput Satellite Communication Systems: A Two-stage Machine Learning Framework

Flexible Payload Configuration for Satellites using Machine Learning

Multi-Agent Deep Reinforcement Learning for Dynamic Laser Inter-Satellite Link Scheduling

Dynamic Resource Management in Integrated NOMA Terrestrial-Satellite Networks using Multi-Agent Reinforcement Learning

Dynamic resource management in integrated NOMA terrestrial–satellite networks using multi-agent reinforcement learning

Collaborative Ground-Space Communications via Evolutionary Multi-objective Deep Reinforcement Learning