Abstract:The last decade has seen a significant increase of interest in deep learning research, with many public successes that have demonstrated its potential. As such, these systems are now being incorporated into commercial products. With this comes an additional challenge: how can we build AI systems that solve tasks where there is not a crisp, well-defined specification? While multiple solutions have been proposed, in this competition we focus on one in particular: learning from human feedback. Rather than training AI systems using a predefined reward function or using a labeled dataset with a predefined set of categories, we instead train the AI system using a learning signal derived from some form of human feedback, which can evolve over time as the understanding of the task changes, or as the capabilities of the AI system improve. The MineRL BASALT competition aims to spur forward research on this important class of techniques. We design a suite of four tasks in Minecraft for which we expect it will be hard to write down hardcoded reward functions. These tasks are defined by a paragraph of natural language: for example, "create a waterfall and take a scenic picture of it", with additional clarifying details. Participants must train a separate agent for each task, using any method they want. Agents are then evaluated by humans who have read the task description. To help participants get started, we provide a dataset of human demonstrations on each of the four tasks, as well as an imitation learning baseline that leverages these demonstrations. Our hope is that this competition will improve our ability to build AI systems that do what their designers intend them to do, even when the intent cannot be easily formalized. Besides allowing AI to solve more tasks, this can also enable more effective regulation of AI systems, as well as making progress on the value alignment problem.

Towards Solving Fuzzy Tasks with Human Feedback: A Retrospective of the MineRL BASALT 2022 Competition

Retrospective on the 2021 BASALT Competition on Learning from Human Feedback

The MineRL BASALT Competition on Learning from Human Feedback

Towards robust and domain agnostic reinforcement learning competitions

Retrospective Analysis of the 2019 MineRL Competition on Sample Efficient Reinforcement Learning

BEDD: The MineRL BASALT Evaluation and Demonstrations Dataset for Training and Benchmarking Agents that Solve Fuzzy Tasks

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

Reflect-RL: Two-Player Online RL Fine-Tuning for LMs

Fine-Grained Human Feedback Gives Better Rewards for Language Model Training

Breadcrumbs to the Goal: Goal-Conditioned Exploration from Human-in-the-Loop Feedback

Just Say What You Want: Only-prompting Self-rewarding Online Preference Optimization

The Future of Open Human Feedback

Learn 2 Rage: Experiencing The Emotional Roller Coaster That Is Reinforcement Learning

Improving Grounded Language Understanding in a Collaborative Environment by Interacting with Agents Through Help Feedback

A Retrospective on the Robot Air Hockey Challenge: Benchmarking Robust, Reliable, and Safe Learning Techniques for Real-world Robotics

RLSF: Reinforcement Learning via Symbolic Feedback

Autonomous Robotic Reinforcement Learning with Asynchronous Human Feedback

Fine-Tuning Language Models Using Formal Methods Feedback

Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback

Safe RLHF: Safe Reinforcement Learning from Human Feedback

Reinforcement Learning Friendly Vision-Language Model for Minecraft