Abstract:Hierarchical model-based reinforcement learning (HMBRL) aims to combine the benefits of better sample efficiency of model based reinforcement learning (MBRL) with the abstraction capability of hierarchical reinforcement learning (HRL) to solve complex tasks efficiently. While HMBRL has great potential, it still lacks wide adoption. In this work we describe a novel HMBRL framework and evaluate it thoroughly. To complement the multi-layered decision making idiom characteristic for HRL, we construct hierarchical world models that simulate environment dynamics at various levels of temporal abstraction. These models are used to train a stack of agents that communicate in a top-down manner by proposing goals to their subordinate agents. A significant focus of this study is the exploration of a static and environment agnostic temporal abstraction, which allows concurrent training of models and agents throughout the hierarchy. Unlike most goal-conditioned H(MB)RL approaches, it also leads to comparatively low dimensional abstract actions. Although our HMBRL approach did not outperform traditional methods in terms of final episode returns, it successfully facilitated decision making across two levels of abstraction using compact, low dimensional abstract actions. A central challenge in enhancing our method's performance, as uncovered through comprehensive experimentation, is model exploitation on the abstract level of our world model stack. We provide an in depth examination of this issue, discussing its implications for the field and suggesting directions for future research to overcome this challenge. By sharing these findings, we aim to contribute to the broader discourse on refining HMBRL methodologies and to assist in the development of more effective autonomous learning systems for complex decision-making environments.

Erlang Planning Network: an Iterative Model-Based Reinforcement Learning with Multi-Perspective

Model-Based Reinforcement Learning with Automated Planning for Network Management

Network planning with deep reinforcement learning

Leveraging the Efficiency of Multi-Task Robot Manipulation Via Task-Evoked Planner and Reinforcement Learning

Multi-robot Social-aware Cooperative Planning in Pedestrian Environments Using Multi-agent Reinforcement Learning

LMRL: a Multi-Agent Reinforcement Learning Model and Algorithm

An Enhanced Hierarchical Planning Framework for Multi-Robot Autonomous Exploration

Multi-Agent Path Planning Using Deep Reinforcement Learning

Graph Neural Networks with Model-based Reinforcement Learning for Multi-agent Systems

Improving Planning with Large Language Models: A Modular Agentic Architecture

On the role of planning in model-based deep reinforcement learning

Planning-Augmented Hierarchical Reinforcement Learning

LIRL: Latent Imagination-Based Reinforcement Learning for Efficient Coverage Path Planning

BiERL: A Meta Evolutionary Reinforcement Learning Framework via Bilevel Optimization

A Two-Layered Multi-Agent Reinforcement Learning Model and Algorithm

Efficient Multi-agent Reinforcement Learning by Planning

Exploring the limits of Hierarchical World Models in Reinforcement Learning

ReLEP: A Novel Framework for Real-world Long-horizon Embodied Planning

Learning Efficient Multi-Agent Cooperative Visual Exploration

ED2: Environment Dynamics Decomposition World Models for Continuous Control

Minimal Value-Equivalent Partial Models for Scalable and Robust Planning in Lifelong Reinforcement Learning