Multi-Agent Reinforcement Learning Algorithm Based On Neural Networks

Lianggui Tang,Hu Yang,Bo An,Daijie Cheng
2006-01-01
Abstract:Because of the computational complexity about agent's state-action space, the general algorithm of agent's learning may be inefficient. Our work adopted Markov Decision-making Process as a framework of multi-agent reinforcement learning, designed a neural networks model of agent's learning action, and a faith modification mechanism of cooperative multi-agents was investigated. The neural networks model can approach any value function, and the higher dimension space of information can be transformed into lower dimension space by the mapping of value function, so the algorithm of Multi-Agent Reinforcement Learning based on Neural Networks (MARLNN) could reduce the VC-dimensions of the state-action space, and by using random gradient descent algorithm to minimize the square sum of Bellman residues, a higher convergence speed of the MARLNN algorithm was gained. Simulation experimental results validated that this work had very good performance and action impending ability.
What problem does this paper attempt to address?