Abstract:In this paper, we present a decentralized sensor-level collision avoidance policy for multi-robot systems, which shows promising results in practical applications. In particular, our policy directly maps raw sensor measurements to an agent's steering commands in terms of the movement velocity. As a first step toward reducing the performance gap between decentralized and centralized methods, we present a multi-scenario multi-stage training framework to learn an optimal policy. The policy is trained over a large number of robots in rich, complex environments simultaneously using a policy gradient based reinforcement learning algorithm. The learning algorithm is also integrated into a hybrid control framework to further improve the policy's robustness and effectiveness. We validate the learned sensor-level collision avoidance policy in a variety of simulated and real-world scenarios with thorough performance evaluations for large-scale multi-robot systems. The generalization of the learned policy is verified in a set of unseen scenarios including the navigation of a group of heterogeneous robots and a large-scale scenario with 100 robots. Although the policy is trained using simulation data only, we have successfully deployed it on physical robots with shapes and dynamics characteristics that are different from the simulated agents, in order to demonstrate the controller's robustness against the sim-to-real modeling error. Finally, we show that the collision-avoidance policy learned from multi-robot navigation tasks provides an excellent solution to the safe and effective autonomous navigation for a single robot working in a dense real human crowd. Our learned policy enables a robot to make effective progress in a crowd without getting stuck. Videos are available at <a class="link-external link-https" href="https://sites.google.com/view/hybridmrca" rel="external noopener nofollow">this https URL</a>

A multi-robot path-planning algorithm for autonomous navigation using meta-reinforcement learning based on transfer learning

Mapless Collaborative Navigation for a Multi-Robot System Based on the Deep Reinforcement Learning

Multi-Robot Learning Dynamic Obstacle Avoidance in Formation with Information-Directed Exploration.

Learning Observation-Based Certifiable Safe Policy for Decentralized Multi-Robot Navigation

Model-Based Robot Learning Control with Uncertainty Directed Exploration

Multi-Robot Path Planning Combining Heuristics and Multi-Agent Reinforcement Learning

Multi-agent policy learning-based path planning for autonomous mobile robots

Leveraging the Efficiency of Multi-Task Robot Manipulation Via Task-Evoked Planner and Reinforcement Learning

Multi-objective deep reinforcement learning for crowd-aware robot navigation with dynamic human preference

Fully Distributed Multi-Robot Collision Avoidance via Deep Reinforcement Learning for Safe and Efficient Navigation in Complex Scenarios

Multi-robot Cooperative Navigation Method based on Multi-agent Reinforcement Learning in Sparse Reward Tasks

Mapless Path Planning for Mobile Robot Based on Improved Deep Deterministic Policy Gradient Algorithm

Multi-robot path planning using an improved self-adaptive particle swarm optimization

Multi-robot path planning using learning-based Artificial Bee Colony algorithm

Multi-robot multi-target dynamic path planning using artificial bee colony and evolutionary programming in unknown environment

Proximal Policy Optimization for Multi-rotor UAV Autonomous Guidance, Tracking and Obstacle Avoidance

Research on reinforcement learning based warehouse robot navigation algorithm in complex warehouse layout

A Path Planning Algorithm Based on Deep Reinforcement Learning for Mobile Robots in Unknown Environment

Learning Dynamic Weight Adjustment for Spatial-Temporal Trajectory Planning in Crowd Navigation

Reinforcement Learned Distributed Multi-Robot Navigation With Reciprocal Velocity Obstacle Shaped Rewards

Research on Autonomous Robots Navigation based on Reinforcement Learning