Abstract:In this paper, we present a decentralized sensor-level collision avoidance policy for multi-robot systems, which shows promising results in practical applications. In particular, our policy directly maps raw sensor measurements to an agent's steering commands in terms of the movement velocity. As a first step toward reducing the performance gap between decentralized and centralized methods, we present a multi-scenario multi-stage training framework to learn an optimal policy. The policy is trained over a large number of robots in rich, complex environments simultaneously using a policy gradient based reinforcement learning algorithm. The learning algorithm is also integrated into a hybrid control framework to further improve the policy's robustness and effectiveness. We validate the learned sensor-level collision avoidance policy in a variety of simulated and real-world scenarios with thorough performance evaluations for large-scale multi-robot systems. The generalization of the learned policy is verified in a set of unseen scenarios including the navigation of a group of heterogeneous robots and a large-scale scenario with 100 robots. Although the policy is trained using simulation data only, we have successfully deployed it on physical robots with shapes and dynamics characteristics that are different from the simulated agents, in order to demonstrate the controller's robustness against the sim-to-real modeling error. Finally, we show that the collision-avoidance policy learned from multi-robot navigation tasks provides an excellent solution to the safe and effective autonomous navigation for a single robot working in a dense real human crowd. Our learned policy enables a robot to make effective progress in a crowd without getting stuck. Videos are available at <a class="link-external link-https" href="https://sites.google.com/view/hybridmrca" rel="external noopener nofollow">this https URL</a>

Learning Observation-Based Certifiable Safe Policy for Decentralized Multi-Robot Navigation

Multi-Robot Learning Dynamic Obstacle Avoidance in Formation with Information-Directed Exploration.

Safe Sim-to-Real Robot Exploration with Constrained Bayesian Optimization

Safe Multi-Agent Reinforcement Learning for Behavior-Based Cooperative Navigation

A safe reinforcement learning approach for autonomous navigation of mobile robots in dynamic environments

Neural Control Barrier Functions for Safe Navigation

Decentralized Global Connectivity Maintenance for Multi-Robot Navigation: A Reinforcement Learning Approach

Fully Distributed Multi-Robot Collision Avoidance via Deep Reinforcement Learning for Safe and Efficient Navigation in Complex Scenarios

Sensor-Based Distributionally Robust Control for Safe Robot Navigation in Dynamic Environments

Safe Multi-Agent Reinforcement Learning for Multi-Robot Control

Learning Adaptive Safety for Multi-Agent Systems

Model-free Neural Lyapunov Control for Safe Robot Navigation

Deadlock-free, Safe, and Decentralized Multi-Robot Navigation in Social Mini-Games via Discrete-Time Control Barrier Functions

SafeCrowdNav: safety evaluation of robot crowd navigation in complex scenes

Learning-Based Control Barrier Function with Provably Safe Guarantees: Reducing Conservatism with Heading-Aware Safety Margin

Safety-Aware Preference-Based Learning for Safety-Critical Control

Barrier-Certified Adaptive Reinforcement Learning with Applications to Brushbot Navigation

Data-efficient safe learning and control with on-board sensors: Bayesian meta-learning and barrier function based approach

Safety Correction from Baseline: Towards the Risk-aware Policy in Robotics Via Dual-agent Reinforcement Learning

Collision-Free Robot Navigation in Crowded Environments using Learning based Convex Model Predictive Control

Learning Safe, Generalizable Perception-Based Hybrid Control With Certificates