Abstract:Multi-objective game (MOG) is a fundamental model for the decision-making problems in which each player must consider multi-dimensional payoffs that reflect different objectives. Typically, solving MOG involves refining the set of equilibrium strategies, which is also known as MOG strategy selection (MOGS). However, existing MOG algorithms only allow one metric for MOGS, which limits the application in real-world scenarios where the players may have different preferences over multiple metrics. In this paper, we first develop a preference-based MOGS framework to encompass multiple metrics with different preferences in MOGS. Based on the framework, we introduce the concept of comprehensive evaluation value (CEV) to evaluate the quality of a strategy set given the preference of each metric. Using CEV as a reward signal, we formulate the problem of finding the optimal strategy set as a Markov decision process, and use deep reinforcement learning to train a policy for MOG strategy selection given the metrics and the corresponding preferences. Specifically, we combine a rational strategy filtering procedure with a Transformer-based encoder–decoder policy network to refine the strategies given the preferences, and then we use a revised REINFORCE algorithm to train the policy network. Besides, we introduce variable beam search decoding to improve the quality of a rollout by keeping track of the most promising strategy sets and choosing the best one. We benchmark our algorithm on the MOG instances generated by GAMUT, and extensive experiments demonstrate that our algorithm can generate the strategy set significantly better than the state-of-the-art baselines with lower computational overhead given different preferences. Furthermore, we compare our approach on real-world problems, showing the great advantages in both performance and runtime.

Deep reinforcement learning for multi-objective combinatorial optimization: A case study on multi-objective traveling salesman problem

Multi-Objective Combinatorial Optimization Algorithm Based on Asynchronous Advantage Actor–Critic and Graph Transformer Networks

Deep Reinforcement Learning for Multiobjective Optimization

A deep reinforcement learning algorithm framework for solving multi-objective traveling salesman problem based on feature transformation

Multiobjective Combinatorial Optimization Using a Single Deep Reinforcement Learning Model

Meta-Learning-Based Deep Reinforcement Learning for Multiobjective Optimization Problems

Hybridization of evolutionary algorithm and deep reinforcement learning for multi-objective orienteering optimization

A reinforcement learning approach for dynamic multi-objective optimization

Leader Reward for POMO-Based Neural Combinatorial Optimization

A compass-based hyper-heuristic for multi-objective optimization problems

Deep reinforcement learning assisted novelty search in Voronoi regions for constrained multi-objective optimization

A self-organizing assisted multi-task algorithm for constrained multi-objective optimization problems

DIMES: A Differentiable Meta Solver for Combinatorial Optimization Problems

Deep reinforcement learning for multi-objective game strategy selection

Adaptive Auxiliary Task Selection for Multitasking-Assisted Constrained Multi-Objective Optimization [Feature]

Combining Reinforcement Learning and Constraint Programming for Combinatorial Optimization

The Collaborative Local Search Based on Dynamic-Constrained Decomposition With Grids for Combinatorial Multiobjective Optimization

A novel multimodal multi-objective optimization algorithm for multi-robot task allocation

Multi-Task Multi-Objective Evolutionary Search Based on Deep Reinforcement Learning for Multi-Objective Vehicle Routing Problems with Time Windows

Multi-Objective Neural Evolutionary Algorithm for Combinatorial Optimization Problems