Abstract:In this paper, we present a decentralized sensor-level collision avoidance policy for multi-robot systems, which shows promising results in practical applications. In particular, our policy directly maps raw sensor measurements to an agent's steering commands in terms of the movement velocity. As a first step toward reducing the performance gap between decentralized and centralized methods, we present a multi-scenario multi-stage training framework to learn an optimal policy. The policy is trained over a large number of robots in rich, complex environments simultaneously using a policy gradient based reinforcement learning algorithm. The learning algorithm is also integrated into a hybrid control framework to further improve the policy's robustness and effectiveness. We validate the learned sensor-level collision avoidance policy in a variety of simulated and real-world scenarios with thorough performance evaluations for large-scale multi-robot systems. The generalization of the learned policy is verified in a set of unseen scenarios including the navigation of a group of heterogeneous robots and a large-scale scenario with 100 robots. Although the policy is trained using simulation data only, we have successfully deployed it on physical robots with shapes and dynamics characteristics that are different from the simulated agents, in order to demonstrate the controller's robustness against the sim-to-real modeling error. Finally, we show that the collision-avoidance policy learned from multi-robot navigation tasks provides an excellent solution to the safe and effective autonomous navigation for a single robot working in a dense real human crowd. Our learned policy enables a robot to make effective progress in a crowd without getting stuck. Videos are available at <a class="link-external link-https" href="https://sites.google.com/view/hybridmrca" rel="external noopener nofollow">this https URL</a>

Multi-Robot Cooperative Target Encirclement Through Learning Distributed Transferable Policy

Cooperative Flocking And Learning In Multi-Robot Systems For Predator Avoidance

Multi-Robot Learning Dynamic Obstacle Avoidance in Formation with Information-Directed Exploration.

Large Scale Pursuit-Evasion under Collision Avoidance Using Deep Reinforcement Learning.

Multi-robot Target Encirclement Control with Collision Avoidance via Deep Reinforcement Learning

Mapless Collaborative Navigation for a Multi-Robot System Based on the Deep Reinforcement Learning

Learning Hierarchical Graph-Based Policy for Goal-Reaching in Unknown Environments

Socially-Aware Multi-Agent Following with 2D Laser Scans Via Deep Reinforcement Learning and Potential Field

Fully Distributed Multi-Robot Collision Avoidance via Deep Reinforcement Learning for Safe and Efficient Navigation in Complex Scenarios

Cooperative multi-agent target searching: a deep reinforcement learning approach based on parallel hindsight experience replay

Multi-robot Social-aware Cooperative Planning in Pedestrian Environments Using Multi-agent Reinforcement Learning

Reinforcement Learned Distributed Multi-Robot Navigation With Reciprocal Velocity Obstacle Shaped Rewards

Multi-robot social-aware cooperative planning in pedestrian environments using attention-based actor-critic

Multi-Robot Informative Path Planning for Efficient Target Mapping using Deep Reinforcement Learning

C3F: Constant Collaboration and Communication Framework for Graph-Representation Dynamic Multi-Robotic Systems

Cooperative Encirclement Strategy for Multiple Drones Based on ATT-MADDPG

Decentralized Multi-Robot Collision Avoidance in Complex Scenarios With Selective Communication

Learning-Based Multi-Robot Formation Control With Obstacle Avoidance

A Reinforcement Learning-based Decentralized Method of Avoiding Multi-UAV Collision in 3-D Airspace

Efficient Multi-agent Navigation with Lightweight DRL Policy

Multi-Robot Environmental Coverage With a Two-Stage Coordination Strategy via Deep Reinforcement Learning