Abstract:Centralized multi-robot path planning is a prevalent approach involving a global planner computing feasible paths for each robot using shared information. Nonetheless, this approach encounters limitations due to communication constraints and computational complexity. To address these challenges, we introduce a novel decentralized multi-robot path planning approach that eliminates the need for sharing the states and intentions of robots. Our approach harnesses deep reinforcement learning and features an asynchronous multi-critic twin delayed deep deterministic policy gradient (AMC-TD3) algorithm, which enhances the original GRU-Attention based TD3 algorithm by incorporating a multi-critic network and employing an asynchronous training mechanism. By training each critic with a unique reward function, our learned policy enables each robot to navigate towards its long-term objective without colliding with other robots in complex environments. Furthermore, our reward function, grounded in social norms, allows the robots to naturally avoid each other in congested situations. Specifically, we train three critics to encourage each robot to achieve its long-term navigation goal, maintain its moving direction, and prevent collisions with other robots. Our model can learn an end-to-end navigation policy without relying on an accurate map or any localization information, rendering it highly adaptable to various environments. Simulation results reveal that our proposed approach surpasses baselines in several environments with different levels of complexity and robot populations.

Deep Reinforcement Learning with Multi-Critic TD3 for Decentralized Multi-Robot Path Planning

Mapless Collaborative Navigation for a Multi-Robot System Based on the Deep Reinforcement Learning

Multi-objective Path Planning Based on Deep Reinforcement Learning

Decentralized Motion Planning for Multi-Robot Navigation using Deep Reinforcement Learning

Multi-Robot Path Planning Method Using Reinforcement Learning

Navigation Based on Hybrid Decentralized and Centralized Training and Execution Strategy for Multiple Mobile Robots Reinforcement Learning

Multi-Robot Informative Path Planning for Efficient Target Mapping using Deep Reinforcement Learning

A Mapless Local Path Planning Approach Using Deep Reinforcement Learning Framework

A decentralized path planning model based on deep reinforcement learning

TD3 Based Collision Free Motion Planning for Robot Navigation

Novel task decomposed multi-agent twin delayed deep deterministic policy gradient algorithm for multi-UAV autonomous path planning

Multi-robot social-aware cooperative planning in pedestrian environments using attention-based actor-critic

Path Planning of Autonomous Mobile Robot in Comprehensive Unknown Environment Using Deep Reinforcement Learning

Decentralized Task and Path Planning for Multi-Robot Systems

Multi-robot Social-aware Cooperative Planning in Pedestrian Environments Using Multi-agent Reinforcement Learning

Improved Robot Path Planning Method Based on Deep Reinforcement Learning

Deep Reinforcement Learning-based Collaborative Multi-UAV Coverage Path Planning

Deep Reinforcement Learning for Indoor Mobile Robot Path Planning

Path Planning Method for Manipulators Based on Improved Twin Delayed Deep Deterministic Policy Gradient and RRT*

Deep Reinforcement Learning for Decentralized Multi-Robot Control: A DQN Approach to Robustness and Information Integration