Abstract:The rising demand for sustainable information delivery and efficient data transmission in next-generation cloud-enabled non-terrestrial networks necessitates advanced optimization techniques. This paper introduces a novel framework that integrates Power-Domain Non-Orthogonal Multiple Access (PD-NOMA) with Deep Reinforcement Learning (DRL) to optimize satellite constellations and dynamically manage communication links. By leveraging a multi-objective optimization approach, the framework aims to balance key performance indicators, such as link utilization, latency, power efficiency, and network throughput, in satellite communication networks. The proposed methodology is structured into four key stages: (1) Analyzing network data, including user demands in the Mobile Ad Hoc Network, traffic patterns, and satellite positions, to predict future network requirements. (2) Utilizing PD-NOMA for efficient link utilization, enabling multiple users to share the same communication resources, thereby maximizing throughput and minimizing power consumption. (3) Introducing dynamic DRL for adaptive resource allocation and multi-objective optimization of constellation parameters, including satellite positions and communication links. (4) Dynamically adjusting both resource allocation and network configurations in response to real-time network conditions to ensure sustained and optimized performance. Extensive simulations validate the effectiveness of the proposed framework, demonstrating significant improvements in network data rate, energy efficiency, and overall performance. The results indicate that the integration of dynamic link utilization with multi-objective constellation optimization offers an efficient solution for enhancing hierarchical satellite communication systems.

A DRL Resource Allocation for Downlink NOMA Multi-beam Satellite Communications.

DRL-Based Dynamic Resource Allocation for Multi-Beam Satellite Systems

A Dynamic Power Allocation Scheme in Power-Domain NOMA Using Actor-Critic Reinforcement Learning.

A Q-Learning-Based Resource Allocation for Downlink Non-Orthogonal Multiple Access Systems Considering QoS.

Resource Allocation for Multi-service NOMA System Based on Deep Reinforcement Learning

DRL-based Joint Optimization for Energy Efficiency Maximization in DAV-NOMA Networks.

Joint User Clustering And Passive Beamforming For Downlink Noma System With Reconfigurable Intelligent Surface

Improved Satellite Resource Allocation Algorithm Based on DRL and MOP

Dynamic Resource Allocation With Deep Reinforcement Learning in Multibeam Satellite Communication

Multi-Satellite Beam Hopping and Power Allocation Using Deep Reinforcement Learning

Resource Allocation Using Deep Reinforcement Learning in GEO Multibeam Satellite System.

Deep Reinforcement Learning for Resource Allocation With Mixed Traffic in NOMA System.

A Deep Q-Network Based-Resource Allocation Scheme for Massive MIMO-NOMA

DDQN Based Beamwidth and Subcarrier Allocation Strategy for LEO Satellite Communication System with Multi-Beam Capability

Multi objective constellation optimization and dynamic link utilization for sustainable information delivery using PD-NOMA deep reinforcement learning

Multi-objective deep reinforcement learning based time-frequency resource allocation for multi-beam satellite communications

Downlink Non-Orthogonal Multiple Access Power Allocation Algorithm Based on Double Deep Q Network for Ensuring User's Quality of Service

Deep Reinforcement Learning Based Resource Allocation for RSMA in LEO Satellite-Terrestrial Networks

Resource Allocation in Uplink NOMA Systems: A Hybrid-Decision-Based Multi-Agent Deep Reinforcement Learning Approach

Admission Control and Power Allocation for NOMA-Based Satellite Multi-Beam Network