Abstract:There is a concerted effort to build intelligent sea and numerous artificial intelligence technologies have been explored. At present, more and more people are engaged in the research of deep reinforcement learning algorithm, and its mainstream application is in the field of games. Reinforcement learning has conquered chess belonging to complete information game, and Texas poker belonging to incomplete information games. And it reached or even surpassed the highest player level of mankind in E-sports games with huge state space and complex action space. However, reinforcement learning algorithm still has great challenges in fields such as automatic driving. The main reason is that the training of reinforcement learning needs to build an environment for interacting with agents. However, it is very difficult to construct realistic simulation scenes, and there is no guarantee that we will not encounter the state that the agent has not seen. Therefore, it is necessary to explore the simulation scene first. Based on this, this paper mainly studies reinforcement learning in simulation scenario. There are huge challenges in migrating them to real scenario applications, especially in sea missions. Aiming at the heterogeneous multi-agent game confrontation scenario, this paper proposes a sea battlefield game confrontation decision algorithm based on multi-agent deep deterministic policy gradient. The algorithm combines long short-term memory and actor-critic, which not only realizes the convergence of the algorithm in huge state space and action space, but also solves the problem of sparse real rewards. At the same time, imitation learning is integrated into the decision algorithm, which not only improves the convergence speed of the algorithm, but also greatly improves the effectiveness of the algorithm. The results show that the algorithm can deal with a variety of different tactical sea battlefield scenarios, make flexible decisions according to the changes of the enemy, and the average winning rate is close to 90%.

Mastering the Game of 3v3 Snakes with Rule-Enhanced Multi-Agent Reinforcement Learning

Learning to Cooperate: Application of Deep Reinforcement Learning for Online AGV Path Finding.

Large Scale Pursuit-Evasion under Collision Avoidance Using Deep Reinforcement Learning.

Zonation Method for Efficient Training of Collaborative Multi-Agent Reinforcement Learning in Double Snake Game

Optimal Exploration Algorithm of Multi-Agent Reinforcement Learning Methods (Student Abstract)

Training Interactive Agent in Large FPS Game Map with Rule-enhanced Reinforcement Learning

Deep reinforcement learning algorithm based on multi-agent parallelism and its application in game environment

Hierarchical Reinforcement Learning for Multi-agent MOBA Game

NeuronsMAE: A Novel Multi-Agent Reinforcement Learning Environment for Cooperative and Competitive Multi-Robot Tasks

Mastering Asymmetrical Multiplayer Game with Multi-Agent Asymmetric-Evolution Reinforcement Learning

Evolutionary reinforcement learning algorithm for large-scale multi-agent cooperation and confrontation applications

Revisiting the Master-Slave Architecture in Multi-Agent Deep Reinforcement Learning

Building a 3-Player Mahjong AI using Deep Reinforcement Learning

Hierarchical Deep Reinforcement Learning Agent with Counter Self-play on Competitive Games

Mastering Complex Control in MOBA Games with Deep Reinforcement Learning

Towards Playing Full MOBA Games with Deep Reinforcement Learning

AlphaSnake: Policy Iteration on a Nondeterministic NP-hard Markov Decision Process (student Abstract)

Towards a Competitive 3-Player Mahjong AI Using Deep Reinforcement Learning

Mastering the game of Stratego with model-free multiagent reinforcement learning

A MADDPG-based multi-agent antagonistic algorithm for sea battlefield confrontation