Abstract:This article investigates the multirobot cooperative navigation problem based on raw visual observations. A fully end-to-end learning framework is presented, which leverages graph neural networks to learn local motion coordination and utilizes deep reinforcement learning to generate visuomotor policy that enables each robot to move to its goal without the need of environment map and global positioning information. Experimental results show that, with a few tens of robots, our approach achieves comparable performance with the state-of-the-art imitation learning-based approaches with bird-view state inputs. We also illustrate our generalizability to crowded and large environments and our scalability to ten times number of the training robots. In addition, we demonstrate that our model trained for multirobot case can also improve the success rate in the single-robot navigation task in unseen environments. Note to Practitioners—With the development of intelligent industrial and logistic systems, robotic transportation systems are widely implemented. However, existing multirobot path coordination and navigation approaches are basically under some unreasonable assumptions, which are very hard to be implemented in practical scenarios. This article aims to greatly promote the real application of learning-based multirobot cooperative navigation approach, in order to achieve the following. First, we introduce an end-to-end reinforcement learning framework instead of the commonly used imitation learning strategy, as the latter one needs exhaustive training data to cover all the scenarios and does not have the required generalizability. Second, we directly use the raw sensor data instead of the commonly used bird-eye-view semantic observations, as the latter one is generally not representative of practical application scenario from the robot perspective and cannot solve the occlusion issue. Third, we interpret our learned model to illustrate which parts of t-e input and shared observations contribute most to the robots' final actions. The above interpretability ensures predictability (thus safety) of our visuomotor policy in practical applications. Our learned visuomotor policy has the ability to coordinate dozens of robots by only using raw visual observations in unknown environments without map nor global localization information, this is the first time in the literature. Our future work includes solving the sim-to-real issue and conducting physical experiments.

Multi-robots Formation and Navigation Based Reinforcement Learning

Mapless Collaborative Navigation for a Multi-Robot System Based on the Deep Reinforcement Learning

Multi-Robot Learning Dynamic Obstacle Avoidance in Formation with Information-Directed Exploration.

Multi-Robot Collaborative Navigation with Formation Adaptation

Moving Forward in Formation: A Decentralized Hierarchical Learning Approach to Multi-Agent Moving Together

Hierarchical Multi-Robot Navigation and Formation in Unknown Environments Via Deep Reinforcement Learning and Distributed Optimization

Multi-robot formation control using reinforcement learning method

Learning-Based Multi-Robot Formation Control With Obstacle Avoidance

Formation Rotation and Assignment: Avoiding Obstacles in Multi-Robot Scenarios

Underwater Multi-agent Cooperative Formation Hunting Based on Deep Reinforcement Learning

Obstacle Avoidance in Multi-Agent Formation Process Based on Deep Reinforcement Learning

A Deep Reinforcement Learning Approach Combined with Model-Based Paradigms for Multiagent Formation Control with Collision Avoidance

Multi-UAV Behavior-based Formation with Static and Dynamic Obstacles Avoidance via Reinforcement Learning

Visuomotor Reinforcement Learning for Multirobot Cooperative Navigation

Relative Distributed Formation and Obstacle Avoidance with Multi-agent Reinforcement Learning

The Multi-Dimensional Actions Control Approach for Obstacle Avoidance Based on Reinforcement Learning

Multirobot Unknown Environment Exploration and Obstacle Avoidance Based on a Voronoi Diagram and Reinforcement Learning

Reinforcement Learning Based Multi-robot Formation Control under Separation Bearing Orientation Scheme

Obstacle avoidance of multi mobile robots based on hierarchical reinforcement learning

Decentralized Multi-Robot Formation Control Using Reinforcement Learning

Multi-Robot Cooperative Socially-Aware Navigation Using Multi-Agent Reinforcement Learning