Abstract:Intelligence agents and multi-agent systems play important roles in scenes like the control system of grouped drones, and multi-agent navigation and obstacle avoidance which is the foundational function of advanced application has great importance. In multi-agent navigation and obstacle avoidance tasks, the decision-making interactions and dynamic changes of agents are difficult for traditional route planning algorithms or reinforcement learning algorithms with the increased complexity of the environment. The classical multi-agent reinforcement learning algorithm, Multi-agent deep deterministic policy gradient(MADDPG), solved precedent algorithms' problems of having unstationary training process and unable to deal with environment randomness. However, MADDPG ignored the temporal message hidden beneath agents' interaction with the environment. Besides, due to its CTDE technique which let each agent's critic network to calculate over all agents' action and the whole environment information, it lacks ability to scale to larger amount of agents. To deal with MADDPG's ignorance of the temporal information of the data, this article proposes a new algorithm called MADDPG-LSTMactor, which combines MADDPG with Long short term memory (LSTM). By using agent's observations of continuous timesteps as the input of its policy network, it allows the LSTM layer to process the hidden temporal message. Experimental result demonstrated that this algorithm had better performance in scenarios where the amount of agents is small. Besides, to solve MADDPG's drawback of not being efficient in scenarios where agents are too many, this article puts forward a light-weight MADDPG (MADDPG-L) algorithm, which simplifies the input of critic network. The result of experiments showed that this algorithm had better performance than MADDPG when the amount of agents was large.

Multi-agent action strategy learning method and device, medium and computing equipment

Adaptive algorithm for multi-agent learning optimal cooperative pursuit strategy based on Markov game

A Plan Recognition Approach for Agent in Adversarial Multi-Agent System

A multi-agent planning approach integrated with learning mechanism

Cooperative multi-agent target searching: a deep reinforcement learning approach based on parallel hindsight experience replay

Multi-Agent Deep Reinforcement Learning with Human Strategies

Cooperative Behavior Acquisition Based Modular Q Learning in Multi-Agent System

Multi-Robot Real-time Game Strategy Learning Based on Deep Reinforcement Learning.

A MULTI-AGENT BASED COOPERATIVE INTELLIGENT FRAME FOR PROBLEM SOLVING METHOD

Multi-Agent Behavior Retrieval: Retrieval-Augmented Policy Training for Cooperative Push Manipulation by Mobile Robots

A Novel Multi-Agent Reinforcement Learning Approach

A new multi-agent reinforcement learning approach

A multiagent reinforcement learning approach based on different states

Learning-based Formal Synthesis of Cooperative Multi-agent Systems

Study on Multi-agent Based Simulation of Team Machine Learning

The Design and Realization of Multi-agent Obstacle Avoidance based on Reinforcement Learning

A muti-agent defensive strategy based on monte carlo method

Embodied Multi-Agent Task Planning from Ambiguous Instruction

Learning Attention-Based Strategies to Cooperate for Multi-Agent Path Finding

Strategy Extraction in Single-Agent Games

Cooperative Learning of Multi-Agent Systems Via Reinforcement Learning