Intuitionistic Fuzzy MADM in Wargame Leveraging with Deep Reinforcement Learning

Yuxiang Sun,Yuanbai Li,Huaxiong Li,Jiubing Liu,Xianzhong Zhou
DOI: https://doi.org/10.1109/tfuzz.2024.3435400
IF: 12.253
2024-01-01
IEEE Transactions on Fuzzy Systems
Abstract:Presently, intelligent games have emerged as a substantial research area. Nonetheless, the slow convergence of intelligent wargame training and the low success rates of agents against specific rules present challenges. In this article, we propose a game confrontation algorithm combining the multiple attribute decision making (MADM) approach from management science and reinforcement learning (RL) technology. This integration enables us to combine the strengths of both approaches and addresses the above issues effectively. This study conducts experiments using the algorithm that integrates MADM and RL techniques to gather confrontation data from the red and blue sides within the winning-first wargame platform. The data is then analyzed using the weight calculation method of intuitionistic fuzzy numbers to determine each intelligent opponent agent's threat level from the perspective of MADM. The threat level calculated by MADM is used to construct the reward function for the red side. The simulation results demonstrate that the algorithm combining MADM and RL proposed in this study outperforms classical RL algorithms regarding intelligence. This approach effectively addresses issues, such as the convergence difficulty, caused by random initialization and the sparse rewards for agent neural networks in wargame environments with large maps. Combining the MADM method from management with the RL algorithm in control can lead to cross-disciplinary innovation in academic fields, which provides innovative research values for intelligent wargame design and RL algorithm improvements.
What problem does this paper attempt to address?