Abstract:Multi-objective reinforcement learning (MORL) extends traditional RL by seeking policies making different compromises among conflicting objectives. The recent surge of interest in MORL has led to diverse studies and solving methods, often drawing from existing knowledge in multi-objective optimization based on decomposition (MOO/D). Yet, a clear categorization based on both RL and MOO/D is lacking in the existing literature. Consequently, MORL researchers face difficulties when trying to classify contributions within a broader context due to the absence of a standardized taxonomy. To tackle such an issue, this paper introduces multi-objective reinforcement learning based on decomposition (MORL/D), a novel methodology bridging the literature of RL and MOO. A comprehensive taxonomy for MORL/D is presented, providing a structured foundation for categorizing existing and potential MORL works. The introduced taxonomy is then used to scrutinize MORL research, enhancing clarity and conciseness through well-defined categorization. Moreover, a flexible framework derived from the taxonomy is introduced. This framework accommodates diverse instantiations using tools from both RL and MOO/D. Its versatility is demonstrated by implementing it in different configurations and assessing it on contrasting benchmark problems. Results indicate MORL/D instantiations achieve comparable performance to current state-of-the-art approaches on the studied problems. By presenting the taxonomy and framework, this paper offers a comprehensive perspective and a unified vocabulary for MORL. This not only facilitates the identification of algorithmic contributions but also lays the groundwork for novel research avenues in MORL.

Demonstration Guided Multi-Objective Reinforcement Learning

A Two-Stage Multi-Objective Deep Reinforcement Learning Framework.

C-MORL: Multi-Objective Reinforcement Learning through Efficient Discovery of Pareto Front

Multiobjective Reinforcement Learning: A Comprehensive Overview

Provable Multi-Objective Reinforcement Learning with Generative Models

Human-in-the-Loop Policy Optimization for Preference-Based Multi-Objective Reinforcement Learning

Scaling Pareto-Efficient Decision Making Via Offline Multi-Objective RL

Continual Multi-Objective Reinforcement Learning Via Reward Model Rehearsal

Multi-Objective Reinforcement Learning Based on Decomposition: A Taxonomy and Framework

Combining a Gradient-Based Method and an Evolution Strategy for Multi-Objective Reinforcement Learning.

MORAL: Aligning AI with Human Norms through Multi-Objective Reinforced Active Learning

Enhancing Robotic Navigation: An Evaluation of Single and Multi-Objective Reinforcement Learning Strategies

"Notice of Violation of IEEE Publication Principles" Multiobjective Reinforcement Learning: A Comprehensive Overview.

Navigating Trade-offs: Policy Summarization for Multi-Objective Reinforcement Learning

Hyperparameter Optimization for Multi-Objective Reinforcement Learning

PA2D-MORL: Pareto Ascent Directional Decomposition Based Multi-Objective Reinforcement Learning

Learning Adaptive Multi-Objective Robot Navigation Incorporating Demonstrations

gTLO: A Generalized and Non-linear Multi-Objective Deep Reinforcement Learning Approach

Multi-objective Reinforcement Learning: A Tool for Pluralistic Alignment

Beyond Joint Demonstrations: Personalized Expert Guidance for Efficient Multi-Agent Reinforcement Learning