Abstract:Intelligence agents and multi-agent systems play important roles in scenes like the control system of grouped drones, and multi-agent navigation and obstacle avoidance which is the foundational function of advanced application has great importance. In multi-agent navigation and obstacle avoidance tasks, the decision-making interactions and dynamic changes of agents are difficult for traditional route planning algorithms or reinforcement learning algorithms with the increased complexity of the environment. The classical multi-agent reinforcement learning algorithm, Multi-agent deep deterministic policy gradient(MADDPG), solved precedent algorithms' problems of having unstationary training process and unable to deal with environment randomness. However, MADDPG ignored the temporal message hidden beneath agents' interaction with the environment. Besides, due to its CTDE technique which let each agent's critic network to calculate over all agents' action and the whole environment information, it lacks ability to scale to larger amount of agents. To deal with MADDPG's ignorance of the temporal information of the data, this article proposes a new algorithm called MADDPG-LSTMactor, which combines MADDPG with Long short term memory (LSTM). By using agent's observations of continuous timesteps as the input of its policy network, it allows the LSTM layer to process the hidden temporal message. Experimental result demonstrated that this algorithm had better performance in scenarios where the amount of agents is small. Besides, to solve MADDPG's drawback of not being efficient in scenarios where agents are too many, this article puts forward a light-weight MADDPG (MADDPG-L) algorithm, which simplifies the input of critic network. The result of experiments showed that this algorithm had better performance than MADDPG when the amount of agents was large.

An Experience Aggregative Reinforcement Learning With Multi-Attribute Decision-Making for Obstacle Avoidance of Wheeled Mobile Robot

Multimodal Deep Reinforcement Learning with Auxiliary Task for Obstacle Avoidance of Indoor Mobile Robot

The Design and Realization of Multi-agent Obstacle Avoidance based on Reinforcement Learning

Multi-objective deep reinforcement learning for crowd-aware robot navigation with dynamic human preference

Adaptive Aggregation for Safety-Critical Control

Multi-Uav Automatic Dynamic Obstacle Avoidance With Experience-Shared A2c

Relative Distributed Formation and Obstacle Avoidance with Multi-agent Reinforcement Learning

Reinforcement Learned Distributed Multi-Robot Navigation With Reciprocal Velocity Obstacle Shaped Rewards

Adaptive Environment Modeling Based Reinforcement Learning for Collision Avoidance in Complex Scenes

Enhancing Robotic Navigation: An Evaluation of Single and Multi-Objective Reinforcement Learning Strategies

Obstacle Avoidance in Multi-Agent Formation Process Based on Deep Reinforcement Learning

Cooperative Control of Multiple AGVs Based on Multi-Agent Reinforcement Learning

Dynamic Obstacle Avoidance Technique for Mobile Robot Navigation Using Deep Reinforcement Learning

End-to-End Autonomous Navigation Based on Deep Reinforcement Learning with a Survival Penalty Function

Autonomous obstacle avoidance of UAV based on deep reinforcement learning

Multi-Agent Reinforcement Learning-Based UAV Pathfinding for Obstacle Avoidance in Stochastic Environment

A 3-Level Adaptive Robust Control Strategy For Autonomous Mobile Robots

Intelligent mobile robot navigation in unknown and complex environment using reinforcement learning technique

Addressing unpredictable movements of dynamic obstacles with deep reinforcement learning to ensure safe navigation for omni-wheeled mobile robot

Deep Reinforcement Learning-based Obstacle Avoidance for Robot Movement in Warehouse Environments

Learning To Chase A Ball Efficiently And Smoothly For A Wheeled Robot