Abstract:Sparse reward is one of the most challenging problems in reinforcement learning (RL). Hindsight Experience Replay (HER) attempts to address this issue by converting a failed experience to a successful one by relabelling the goals. In open-ended and changing environments, agents face a wide range of potential tasks that might not come with associated reward functions. Such autonomous learning agents must set their own tasks and build their own curriculum through an intrinsically motivated exploration. Because some tasks might prove easy and some impossible, agents must actively select which task to practice at any given moment, to maximize their overall mastery on the set of learnable tasks. The purpose of this technical report is two-fold. First, it introduces a suite of challenging continuous control tasks (integrated with OpenAI Gym) based on currently existing robotics hardware. The tasks include pushing, sliding and pick & place with a Fetch robotic arm as well as in-hand object manipulation with a Shadow Dexterous Hand. All tasks have sparse binary rewards and follow a Multi-Goal Reinforcement Learning (RL) framework in which an agent is told what to do using an additional input. The second part of the paper presents a set of concrete research ideas for improving RL algorithms, most of which are related to Multi-Goal RL and Hindsight Experience Replay. The Fetch environments are based on the 7-DoF Fetch robotics arm,2 which has a two-fingered parallel gripper. Agents focus on achievable tasks first and focus back on tasks that are being forgotten. Experiments conducted in a new multi-task multi-goal robotic environment show that our algorithm benefits from these two ideas and demonstrate properties of robustness to distracting tasks, forgetting and changes in body properties

ACDER: Augmented Curiosity-Driven Experience Replay

Efficient Diversity-based Experience Replay for Deep Reinforcement Learning

Soft Hindsight Experience Replay

ACTRCE: Augmenting Experience via Teacher's Advice For Multi-Goal Reinforcement Learning

Relay Hindsight Experience Replay: Self-guided continual reinforcement learning for sequential object manipulation tasks with sparse rewards

Advances in Experience Replay

Exploration via Hindsight Goal Generation

Learning and reusing primitive behaviours to improve Hindsight Experience Replay sample efficiency

Random curiosity-driven exploration in deep reinforcement learning

CIER: A Novel Experience Replay Approach with Causal Inference in Deep Reinforcement Learning

Consistent Experience Replay in High-Dimensional Continuous Control with Decayed Hindsights

Replay across Experiments: A Natural Extension of Off-Policy RL

Overcoming Exploration in Reinforcement Learning with Demonstrations

Adaptive trajectory-constrained exploration strategy for deep reinforcement learning

Reward Uncertainty for Exploration in Preference-based Reinforcement Learning

Contact Energy Based Hindsight Experience Prioritization

MRHER: Model-based Relay Hindsight Experience Replay for Sequential Object Manipulation Tasks with Sparse Rewards

HiER: Highlight Experience Replay for Boosting Off-Policy Reinforcement Learning Agents

Quantile Regression Hindsight Experience Replay

Intrinsically Motivated Multi-Goal Reinforcement Learning Using Robotics Environment Integrated with OpenAI Gym

ROER: Regularized Optimal Experience Replay