Abstract:In this paper, we present a decentralized sensor-level collision avoidance policy for multi-robot systems, which shows promising results in practical applications. In particular, our policy directly maps raw sensor measurements to an agent's steering commands in terms of the movement velocity. As a first step toward reducing the performance gap between decentralized and centralized methods, we present a multi-scenario multi-stage training framework to learn an optimal policy. The policy is trained over a large number of robots in rich, complex environments simultaneously using a policy gradient based reinforcement learning algorithm. The learning algorithm is also integrated into a hybrid control framework to further improve the policy's robustness and effectiveness. We validate the learned sensor-level collision avoidance policy in a variety of simulated and real-world scenarios with thorough performance evaluations for large-scale multi-robot systems. The generalization of the learned policy is verified in a set of unseen scenarios including the navigation of a group of heterogeneous robots and a large-scale scenario with 100 robots. Although the policy is trained using simulation data only, we have successfully deployed it on physical robots with shapes and dynamics characteristics that are different from the simulated agents, in order to demonstrate the controller's robustness against the sim-to-real modeling error. Finally, we show that the collision-avoidance policy learned from multi-robot navigation tasks provides an excellent solution to the safe and effective autonomous navigation for a single robot working in a dense real human crowd. Our learned policy enables a robot to make effective progress in a crowd without getting stuck. Videos are available at <a class="link-external link-https" href="https://sites.google.com/view/hybridmrca" rel="external noopener nofollow">this https URL</a>

Multi-Robot Learning Dynamic Obstacle Avoidance in Formation with Information-Directed Exploration.

Mapless Collaborative Navigation for a Multi-Robot System Based on the Deep Reinforcement Learning

Learning Observation-Based Certifiable Safe Policy for Decentralized Multi-Robot Navigation

Moving Forward in Formation: A Decentralized Hierarchical Learning Approach to Multi-Agent Moving Together

Cooperative Flocking And Learning In Multi-Robot Systems For Predator Avoidance

Multi-Robot Collaborative Navigation with Formation Adaptation

Learning-Based Multi-Robot Formation Control With Obstacle Avoidance

Reinforcement Learned Distributed Multi-Robot Navigation With Reciprocal Velocity Obstacle Shaped Rewards

Fully Distributed Multi-Robot Collision Avoidance via Deep Reinforcement Learning for Safe and Efficient Navigation in Complex Scenarios

Obstacle Avoidance in Multi-Agent Formation Process Based on Deep Reinforcement Learning

Distributed deep reinforcement learning based on bi-objective framework for multi-robot formation

Multi-UAV Behavior-based Formation with Static and Dynamic Obstacles Avoidance via Reinforcement Learning

Distributed Formation Navigation of Constrained Second-Order Multiagent Systems with Collision Avoidance and Connectivity Maintenance

Adaptive Leader-Follower Formation Control and Obstacle Avoidance via Deep Reinforcement Learning

Multi-robot Target Encirclement Control with Collision Avoidance via Deep Reinforcement Learning

Multi-robot consensus formation based on virtual spring obstacle avoidance

Relative Distributed Formation and Obstacle Avoidance with Multi-agent Reinforcement Learning

Distributed Robust Learning based Formation Control of Mobile Robots based on Bioinspired Neural Dynamics

Collision-Free Robot Navigation in Crowded Environments using Learning based Convex Model Predictive Control

Realtime Collision Avoidance for Mobile Robots in Dense Crowds using Implicit Multi-sensor Fusion and Deep Reinforcement Learning

Sequential Neural Barriers for Scalable Dynamic Obstacle Avoidance