Abstract:Multi-agent systems are trained to maximize shared cost objectives, which typically reflect system-level efficiency. However, in the resource-constrained environments of mobility and transportation systems, efficiency may be achieved at the expense of fairness -- certain agents may incur significantly greater costs or lower rewards compared to others. Tasks could be distributed inequitably, leading to some agents receiving an unfair advantage while others incur disproportionately high costs. It is important to consider the tradeoffs between efficiency and fairness. We consider the problem of fair multi-agent navigation for a group of decentralized agents using multi-agent reinforcement learning (MARL). We consider the reciprocal of the coefficient of variation of the distances traveled by different agents as a measure of fairness and investigate whether agents can learn to be fair without significantly sacrificing efficiency (i.e., increasing the total distance traveled). We find that by training agents using min-max fair distance goal assignments along with a reward term that incentivizes fairness as they move towards their goals, the agents (1) learn a fair assignment of goals and (2) achieve almost perfect goal coverage in navigation scenarios using only local observations. For goal coverage scenarios, we find that, on average, our model yields a 14% improvement in efficiency and a 5% improvement in fairness over a baseline trained using random assignments. Furthermore, an average of 21% improvement in fairness can be achieved compared to a model trained on optimally efficient assignments; this increase in fairness comes at the expense of only a 7% decrease in efficiency. Finally, we extend our method to environments in which agents must complete coverage tasks in prescribed formations and show that it is possible to do so without tailoring the models to specific formation shapes.

Cooperative Multi-Agent Fairness and Equivariant Policies

Learning Fairness in Multi-Agent Systems

Cooperation and Fairness in Multi-Agent Reinforcement Learning

Long-Term Fairness in Sequential Multi-Agent Selection with Positive Reinforcement

Welfare and Fairness in Multi-objective Reinforcement Learning

Multi-agent cooperation through learning-aware policy gradients

Policy Optimization with Advantage Regularization for Long-Term Fairness in Decision Systems

Fairness-Aware Meta-Learning via Nash Bargaining

Fairness Auditing with Multi-Agent Collaboration

Policy Diversity for Cooperative Agents

Individual Reward Assisted Multi-Agent Reinforcement Learning.

Egoism, utilitarianism and egalitarianism in multi-agent reinforcement learning

Cooperative Multi-Agent Policy Gradients with Sub-optimal Demonstration

Toward Finding Strong Pareto Optimal Policies in Multi-Agent Reinforcement Learning

Learning Fair Cooperation in Mixed-Motive Games with Indirect Reciprocity

Using Protected Attributes to Consider Fairness in Multi-Agent Systems

Learning Fair Policies for Multi-stage Selection Problems from Observational Data

Multi-agent Policy Optimization with Approximatively Synchronous Advantage Estimation

Normative Disagreement as a Challenge for Cooperative AI

Unfairness Despite Awareness: Group-Fair Classification with Strategic Agents

Algorithmic Fairness in Performative Policy Learning: Escaping the Impossibility of Group Fairness