Experimental Study on Decentralized Concurrent Learning for Multi-Agent System with Complex Dynamics

Ting Fei,Xin Chen,Min Wu,Chi Wang
DOI: https://doi.org/10.23919/chicc.2017.8028683
2017-01-01
Abstract:A cooperative multi-agent system entitles some independent agents to complete complex tasks through coordination and cooperation. Since the dynamics of physical agents are so complex that the environment of learning is indeed stochastic, the paper introduces the decentralized multi-agent reinforcement learning (MARL) algorithm, named as Decentralized Concurrent Learning with Cooperative Policy Exploration (DCL-CPE), in order to solve cooperative learning within stochastic environment. To investigate its feasibility in practical multi-agent systems, the box-pushing test with DCL-CPE is designed with a group of two-wheel driven robots acting as learning agents. Due to physical properties, such as nonholonomic dynamics, rolling and sliding frictions, unreliable sense, rigid body collision, etc., the cooperative learning is a high stochastic learning case. The simulation test in Webots shows that DCL-CPE is good at exploring best cooperative policy in a decentralized way, even as state transition and rewards are all stochastic.
What problem does this paper attempt to address?