Market-Based Reinforcement Learning in Partially Observable Worlds

Ivo Kwee,Marcus Hutter,Juergen Schmidhuber
DOI: https://doi.org/10.48550/arXiv.cs/0105025
2001-05-16
Abstract:Unlike traditional reinforcement learning (RL), market-based RL is in principle applicable to worlds described by partially observable Markov Decision Processes (POMDPs), where an agent needs to learn short-term memories of relevant previous events in order to execute optimal actions. Most previous work, however, has focused on reactive settings (MDPs) instead of POMDPs. Here we reimplement a recent approach to market-based RL and for the first time evaluate it in a toy POMDP setting.
Artificial Intelligence,Machine Learning,Multiagent Systems,Neural and Evolutionary Computing
What problem does this paper attempt to address?