Abstract:This article investigates the multirobot cooperative navigation problem based on raw visual observations. A fully end-to-end learning framework is presented, which leverages graph neural networks to learn local motion coordination and utilizes deep reinforcement learning to generate visuomotor policy that enables each robot to move to its goal without the need of environment map and global positioning information. Experimental results show that, with a few tens of robots, our approach achieves comparable performance with the state-of-the-art imitation learning-based approaches with bird-view state inputs. We also illustrate our generalizability to crowded and large environments and our scalability to ten times number of the training robots. In addition, we demonstrate that our model trained for multirobot case can also improve the success rate in the single-robot navigation task in unseen environments. Note to Practitioners—With the development of intelligent industrial and logistic systems, robotic transportation systems are widely implemented. However, existing multirobot path coordination and navigation approaches are basically under some unreasonable assumptions, which are very hard to be implemented in practical scenarios. This article aims to greatly promote the real application of learning-based multirobot cooperative navigation approach, in order to achieve the following. First, we introduce an end-to-end reinforcement learning framework instead of the commonly used imitation learning strategy, as the latter one needs exhaustive training data to cover all the scenarios and does not have the required generalizability. Second, we directly use the raw sensor data instead of the commonly used bird-eye-view semantic observations, as the latter one is generally not representative of practical application scenario from the robot perspective and cannot solve the occlusion issue. Third, we interpret our learned model to illustrate which parts of t-e input and shared observations contribute most to the robots' final actions. The above interpretability ensures predictability (thus safety) of our visuomotor policy in practical applications. Our learned visuomotor policy has the ability to coordinate dozens of robots by only using raw visual observations in unknown environments without map nor global localization information, this is the first time in the literature. Our future work includes solving the sim-to-real issue and conducting physical experiments.

Learning to Navigate using Visual Sensor Networks

See What the Robot Can't See: Learning Cooperative Perception for Visual Navigation

A LiDAR Based End to End Controller for Robot Navigation Using Deep Neural Network

A Navigation Cognitive System Driven by Hierarchical Spiking Neural Network.

Vision and Language Navigation in the Real World via Online Visual Language Mapping

Sensor-based Autonomous Robot Navigation under Unknown Environments with Grid Map Representation.

Learning to Navigate from Simulation via Spatial and Semantic Information Synthesis with Noise Model Embedding

Learning Social Navigation from Demonstrations with Deep Neural Networks

Learning Autonomous Exploration and Mapping with Semantic Vision

Sparse Image based Navigation Architecture to Mitigate the need of precise Localization in Mobile Robots

A Neural Network-Based Navigation Approach for Autonomous Mobile Robot Systems

Visuomotor Reinforcement Learning for Multirobot Cooperative Navigation

Robot visual navigation estimation and target localization based on neural network

StereoNavNet: Learning to Navigate using Stereo Cameras with Auxiliary Occupancy Voxels

Towards Target-Driven Visual Navigation in Indoor Scenes via Generative Imitation Learning

Object-Based Reliable Visual Navigation for Mobile Robot

End-to-End Navigation in Unknown Environments using Neural Networks

Learning with a Mole: Transferable latent spatial representations for navigation without reconstruction

Navigating to objects in the real world

Vision-based mobile robot navigation through deep convolutional neural networks and end-to-end learning

MGRL: Graph neural network based inference in a Markov network with reinforcement learning for visual navigation