Abstract:Learning goal conditioned control in the real world is a challenging open problem in robotics. Reinforcement learning systems have the potential to learn autonomously via trial-and-error, but in practice the costs of manual reward design, ensuring safe exploration, and hyperparameter tuning are often enough to preclude real world deployment. Imitation learning approaches, on the other hand, offer a simple way to learn control in the real world, but typically require costly curated demonstration data and lack a mechanism for continuous improvement. Recently, iterative imitation techniques have been shown to learn goal directed control from undirected demonstration data, and improve continuously via self-supervised goal reaching, but results thus far have been limited to simulated environments. In this work, we present evidence that iterative imitation learning can scale to goal-directed behavior on a real robot in a dynamic setting: high speed, precision table tennis (e.g. "land the ball on this particular target"). We find that this approach offers a straightforward way to do continuous on-robot learning, without complexities such as reward design or sim-to-real transfer. It is also scalable -- sample efficient enough to train on a physical robot in just a few hours. In real world evaluations, we find that the resulting policy can perform on par or better than amateur humans (with players sampled randomly from a robotics lab) at the task of returning the ball to specific targets on the table. Finally, we analyze the effect of an initial undirected bootstrap dataset size on performance, finding that a modest amount of unstructured demonstration data provided up-front drastically speeds up the convergence of a general purpose goal-reaching policy. See <a class="link-external link-https" href="https://sites.google.com/view/goals-eye" rel="external noopener nofollow">this https URL</a> for videos.

Sample-efficient Reinforcement Learning in Robotic Table Tennis

Towards High Level Skill Learning: Learn to Return Table Tennis Ball Using Monte-Carlo Based Policy Gradient Method.

Optimal stroke learning with policy gradient approach for robotic table tennis

Robotic Table Tennis: A Case Study into a High Speed Learning System

GoalsEye: Learning High Speed Precision Table Tennis on a Physical Robot

Stylized Table Tennis Robots Skill Learning with Incomplete Human Demonstrations

Robot Air Hockey: A Manipulation Testbed for Robot Learning with Reinforcement Learning

Learning to Play Table Tennis From Scratch Using Muscular Robots

Achieving Human Level Competitive Robot Table Tennis

Learning with Training Wheels: Speeding up Training with a Simple Controller for Deep Reinforcement Learning

i-Sim2Real: Reinforcement Learning of Robotic Policies in Tight Human-Robot Interaction Loops

Data-Efficient Online Learning of Ball Placement in Robot Table Tennis

Reinforcement Learning Within the Classical Robotics Stack: A Case Study in Robot Soccer

SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning

Learning to Play Air Hockey with Model-Based Deep Reinforcement Learning

Model-Based Reinforcement Learning for Atari

Learning Diverse Robot Striking Motions with Diffusion Models and Kinematically Constrained Gradient Guidance

Learning to Play Precision Ball Sports from scratch: a Deep Reinforcement Learning Approach

Sample-Efficient Curriculum Reinforcement Learning for Complex Reward Functions

Deep Reinforcement Learning in a Racket Sport for Player Evaluation With Technical and Tactical Contexts