Abstract:Networked robots have become crucial for unmanned applications since they can collaborate to complete complex tasks in remote/hazardous/depopulated areas. Due to the cost inefficiency of deploying cellular network infrastructure in these areas, hybrid satellite-UAV networks emerge as a promising solution. These networks provide seamless and on-demand connectivity for multiple robots with various task requirements, and support computation-intensive and latency-sensitive services through mobile edge computing (MEC)-based offloading. However, to complete tasks in limited times, the rapid collective movement of mobile robots may cause frequent service migration, and a large number of gathered robots may compete for limited bandwidth resources in satellite and UAV communications. As a result, offloading latency may increase significantly. To address this issue, the average completion time of multi-robot offloading in task-oriented satellite-UAV networks with MEC is formulated as an optimization problem. Unlike conventional mobility-aware MEC-based offloading schemes, joint optimization of mobility control, data offloading, and resource allocation is proposed using velocity control of multiple robots. According to Lyapunov optimization, the original optimization problem is simplified into minimizing the average completion time of offloading for all robots within UAV and satellite coverage. A multi-agent $Q$ -learning algorithm, including multi-group dual-agent $Q$ -learning, is proposed based on local state observation and global reward calculation. In each dual-agent $Q$ -learning, one agent is responsible for velocity control and communication resource allocation, while the other is responsible for data offloading and computational resource allocation. The convergence of the proposed multi-agent $Q$ -learning algorithm is also theoretically analyzed. Simulation results show that the proposed scheme can effectively reduce the offloading latency by up to 35% in the multi-robot environment over its conventional counterparts.

Traffic Optimization in Satellites Communications: A Multi-agent Reinforcement Learning Approach

Efficient Resource Allocation for Multi-Beam Satellite-Terrestrial Vehicular Networks: A Multi-Agent Actor-Critic Method With Attention Mechanism

Load-Aware Satellite Handover Strategy Based on Multi-Agent Reinforcement Learning

Satellite-Terrestrial Coordinated Multi-Satellite Beam Hopping Scheduling Based on Multi-Agent Deep Reinforcement Learning

Network-wide traffic signal control optimization using a multi-agent deep reinforcement learning

Shaping Rewards, Shaping Routes: On Multi-Agent Deep Q-Networks for Routing in Satellite Constellation Networks

Multi-Agent Deep Reinforcement Learning Based Channel Allocation for Networked Satellite Telemetry System.

Task-Oriented Satellite-UAV Networks with Mobile-Edge Computing

Towards Multi-agent Reinforcement Learning based Traffic Signal Control through Spatio-temporal Hypergraphs

Task Offloading in MEC-Aided Satellite-Terrestrial Networks: A Reinforcement Learning Approach.

A Multi-agent Reinforcement Learning Perspective on Distributed Traffic Engineering

Joint Optimization of Traffic Signal Control and Vehicle Routing in Signalized Road Networks using Multi-Agent Deep Reinforcement Learning

Stigmergy and Hierarchical Learning for Routing Optimization in Multi-domain Collaborative Satellite Networks

Dynamic Routing Planning Method for Large-Scale Low-Orbit Satellite Networks Based on Location Guided and Multi-Agent DQN Network

Dynamic Routing for Integrated Satellite-Terrestrial Networks: A Constrained Multi-Agent Reinforcement Learning Approach

Multi-agent Assessment with QoS Enhancement for HD Map Updates in a Vehicular Network

Agent-Agnostic Centralized Training for Decentralized Multi-Agent Cooperative Driving

Batch-Augmented Multi-Agent Reinforcement Learning for Efficient Traffic Signal Optimization

Nash Soft Actor-Critic LEO Satellite Handover Management Algorithm for Flying Vehicles

Dynamic Resource Management in Integrated NOMA Terrestrial-Satellite Networks using Multi-Agent Reinforcement Learning

Coverage-aware and Reinforcement Learning Using Multi-agent Approach for HD Map QoS in a Realistic Environment