Abstract:This article investigates the multirobot cooperative navigation problem based on raw visual observations. A fully end-to-end learning framework is presented, which leverages graph neural networks to learn local motion coordination and utilizes deep reinforcement learning to generate visuomotor policy that enables each robot to move to its goal without the need of environment map and global positioning information. Experimental results show that, with a few tens of robots, our approach achieves comparable performance with the state-of-the-art imitation learning-based approaches with bird-view state inputs. We also illustrate our generalizability to crowded and large environments and our scalability to ten times number of the training robots. In addition, we demonstrate that our model trained for multirobot case can also improve the success rate in the single-robot navigation task in unseen environments. Note to Practitioners—With the development of intelligent industrial and logistic systems, robotic transportation systems are widely implemented. However, existing multirobot path coordination and navigation approaches are basically under some unreasonable assumptions, which are very hard to be implemented in practical scenarios. This article aims to greatly promote the real application of learning-based multirobot cooperative navigation approach, in order to achieve the following. First, we introduce an end-to-end reinforcement learning framework instead of the commonly used imitation learning strategy, as the latter one needs exhaustive training data to cover all the scenarios and does not have the required generalizability. Second, we directly use the raw sensor data instead of the commonly used bird-eye-view semantic observations, as the latter one is generally not representative of practical application scenario from the robot perspective and cannot solve the occlusion issue. Third, we interpret our learned model to illustrate which parts of t-e input and shared observations contribute most to the robots' final actions. The above interpretability ensures predictability (thus safety) of our visuomotor policy in practical applications. Our learned visuomotor policy has the ability to coordinate dozens of robots by only using raw visual observations in unknown environments without map nor global localization information, this is the first time in the literature. Our future work includes solving the sim-to-real issue and conducting physical experiments.

Multi-robot Cooperative Navigation Method based on Multi-agent Reinforcement Learning in Sparse Reward Tasks

Mapless Collaborative Navigation for a Multi-Robot System Based on the Deep Reinforcement Learning

Learning to Cooperate: Application of Deep Reinforcement Learning for Online AGV Path Finding.

Learning Observation-Based Certifiable Safe Policy for Decentralized Multi-Robot Navigation

Safe Multi-Agent Reinforcement Learning for Behavior-Based Cooperative Navigation

Multi-Robot Cooperative Socially-Aware Navigation Using Multi-Agent Reinforcement Learning

Optimal Exploration Algorithm of Multi-Agent Reinforcement Learning Methods (Student Abstract)

Multi-robot Social-aware Cooperative Planning in Pedestrian Environments Using Multi-agent Reinforcement Learning

Visuomotor Reinforcement Learning for Multirobot Cooperative Navigation

Multiple Ships Cooperative Navigation and Collision Avoidance using Multi-agent Reinforcement Learning with Communication

Edge-conditioned vector basis functions for the analysis and optimization of rectangular waveguide dual-mode filters

Multi-Robot Informative Path Planning for Efficient Target Mapping using Deep Reinforcement Learning

Multi-robot social-aware cooperative planning in pedestrian environments using attention-based actor-critic

Safe Multi-Agent Reinforcement Learning for Multi-Robot Control

Enhancing Robotic Navigation: An Evaluation of Single and Multi-Objective Reinforcement Learning Strategies

Multi-Robot Collaborative Navigation with Formation Adaptation

Cooperative Reward Shaping for Multi-Agent Pathfinding

Improving multi-UAV cooperative path-finding through multiagent experience learning

Implantable cardioverter defibrillators after acute myocardial infarction

Co-NavGPT: Multi-Robot Cooperative Visual Semantic Navigation using Large Language Models

Attention-Cooperated Reinforcement Learning for Multi-agent Path Planning