PufferLib: Making Reinforcement Learning Libraries and Environments Play Nice

Joseph Suarez
2024-06-12
Abstract:You have an environment, a model, and a reinforcement learning library that are designed to work together but don't. PufferLib makes them play nice. The library provides one-line environment wrappers that eliminate common compatibility problems and fast vectorization to accelerate training. With PufferLib, you can use familiar libraries like CleanRL and SB3 to scale from classic benchmarks like Atari and Procgen to complex simulators like NetHack and Neural MMO. We release pip packages and prebuilt images with dependencies for dozens of environments. All of our code is free and open-source software under the MIT license, complete with baselines, documentation, and support at <a class="link-external link-http" href="http://pufferai.github.io" rel="external noopener nofollow">this http URL</a>.
Machine Learning,Artificial Intelligence,Multiagent Systems
What problem does this paper attempt to address?