SocialGFs: Learning Social Gradient Fields for Multi-Agent Reinforcement Learning

Qian Long,Fangwei Zhong,Mingdong Wu,Yizhou Wang,Song-Chun Zhu

2024-05-03

Abstract:Multi-agent systems (MAS) need to adaptively cope with dynamic environments, changing agent populations, and diverse tasks. However, most of the multi-agent systems cannot easily handle them, due to the complexity of the state and task space. The social impact theory regards the complex influencing factors as forces acting on an agent, emanating from the environment, other agents, and the agent's intrinsic motivation, referring to the social force. Inspired by this concept, we propose a novel gradient-based state representation for multi-agent reinforcement learning. To non-trivially model the social forces, we further introduce a data-driven method, where we employ denoising score matching to learn the social gradient fields (SocialGFs) from offline samples, e.g., the attractive or repulsive outcomes of each force. During interactions, the agents take actions based on the multi-dimensional gradients to maximize their own rewards. In practice, we integrate SocialGFs into the widely used multi-agent reinforcement learning algorithms, e.g., MAPPO. The empirical results reveal that SocialGFs offer four advantages for multi-agent systems: 1) they can be learned without requiring online interaction, 2) they demonstrate transferability across diverse tasks, 3) they facilitate credit assignment in challenging reward settings, and 4) they are scalable with the increasing number of agents.

Artificial Intelligence,Multiagent Systems

What problem does this paper attempt to address?

The paper aims to address the adaptability issues of Multi-Agent Systems (MAS) in dynamic environments, varying tasks, and different numbers and roles of agents. Most current multi-agent systems struggle to handle these complex situations, primarily due to the complexity of the state space and task space. To tackle this challenge, the researchers propose a novel gradient-based state representation method—Social Gradient Fields (SocialGFs)—for multi-agent reinforcement learning. Specifically, this method is inspired by social force theory, viewing complex influencing factors as forces acting on agents, and learns these social gradient fields from offline samples using denoising score matching techniques. During interactions, agents take actions based on multidimensional gradients to maximize their own rewards. Experimental results show that SocialGFs have the following advantages: 1. Learning can be conducted without the need for online interactions; 2. Demonstrates good transferability between different tasks; 3. Aids in credit assignment in challenging reward settings; 4. Maintains scalability as the number of agents increases. In summary, the paper improves the adaptability and generality of multi-agent systems by introducing SocialGFs, enabling them to operate effectively in various environments.

SocialGFs: Learning Social Gradient Fields for Multi-Agent Reinforcement Learning

Optimal Exploration Algorithm of Multi-Agent Reinforcement Learning Methods (Student Abstract)

Multi-agent cooperation through learning-aware policy gradients

Adaptive Incentive Design with Multi-Agent Meta-Gradient Reinforcement Learning

A Multi-agent Cooperative Learning System with Evolution of Social Roles

Emergent Social Learning via Multi-agent Reinforcement Learning

Robust Multi-Agent Reinforcement Learning via Minimax Deep Deterministic Policy Gradient

Soft-HGRNs: Soft Hierarchical Graph Recurrent Networks for Multi-Agent Partially Observable Environments

The Gradient Convergence Bound of Federated Multi-Agent Reinforcement Learning with Efficient Communication.

Social Behavior as a Key to Learning-based Multi-Agent Pathfinding Dilemmas

GAT-MF: Graph Attention Mean Field for Very Large Scale Multi-Agent Reinforcement Learning

AdaSociety: An Adaptive Environment with Social Structures for Multi-Agent Decision-Making

Evolutionary Multi-agent Reinforcement Learning in Group Social Dilemmas

Multi-Agent Exploration Via Self-Learning and Social Learning

SPD: Synergy Pattern Diversifying Oriented Unsupervised Multi-agent Reinforcement Learning.

Mean-Field Multi-Agent Reinforcement Learning: A Decentralized Network Approach

A Collaborative Multiagent Reinforcement Learning Method Based on Policy Gradient Potential

An Optimal Rewiring Strategy for Reinforcement Social Learning in Cooperative Multiagent Systems

Partially Observable Mean Field Multi-Agent Reinforcement Learning Based on Graph-Attention

Health-Informed Policy Gradients for Multi-Agent Reinforcement Learning

Scalable and Transferable Reinforcement Learning for Multi-Agent Mixed Cooperative–Competitive Environments Based on Hierarchical Graph Attention