Abstract:Learning goal conditioned control in the real world is a challenging open problem in robotics. Reinforcement learning systems have the potential to learn autonomously via trial-and-error, but in practice the costs of manual reward design, ensuring safe exploration, and hyperparameter tuning are often enough to preclude real world deployment. Imitation learning approaches, on the other hand, offer a simple way to learn control in the real world, but typically require costly curated demonstration data and lack a mechanism for continuous improvement. Recently, iterative imitation techniques have been shown to learn goal directed control from undirected demonstration data, and improve continuously via self-supervised goal reaching, but results thus far have been limited to simulated environments. In this work, we present evidence that iterative imitation learning can scale to goal-directed behavior on a real robot in a dynamic setting: high speed, precision table tennis (e.g. "land the ball on this particular target"). We find that this approach offers a straightforward way to do continuous on-robot learning, without complexities such as reward design or sim-to-real transfer. It is also scalable -- sample efficient enough to train on a physical robot in just a few hours. In real world evaluations, we find that the resulting policy can perform on par or better than amateur humans (with players sampled randomly from a robotics lab) at the task of returning the ball to specific targets on the table. Finally, we analyze the effect of an initial undirected bootstrap dataset size on performance, finding that a modest amount of unstructured demonstration data provided up-front drastically speeds up the convergence of a general purpose goal-reaching policy. See <a class="link-external link-https" href="https://sites.google.com/view/goals-eye" rel="external noopener nofollow">this https URL</a> for videos.

A Novel Ping-pong Task Strategy Based on Model-free Multi-dimensional Q-function Deep Reinforcement Learning

Towards High Level Skill Learning: Learn to Return Table Tennis Ball Using Monte-Carlo Based Policy Gradient Method.

An adaptive trajectory prediction method for ping-pong robots

Deep Reinforcement Learning in a Racket Sport for Player Evaluation With Technical and Tactical Contexts

Optimal stroke learning with policy gradient approach for robotic table tennis

Ball Motion Control in the Table Tennis Robot System Using Time-Series Deep Reinforcement Learning

Application of deep learning in automatic detection of technical and tactical indicators of table tennis

Multi-Robot Real-time Game Strategy Learning Based on Deep Reinforcement Learning.

Strategy and Skill Learning for Physics-based Table Tennis Animation

Deep Q-Network for AI Soccer

Stylized Table Tennis Robots Skill Learning with Incomplete Human Demonstrations

State of the Art Control of Atari Games Using Shallow Reinforcement Learning

A Reinforcement Learning Badminton Environment for Simulating Player Tactics (Student Abstract)

Sample-efficient Reinforcement Learning in Robotic Table Tennis

Learning to Play Football from Sports Domain Perspective: A Knowledge-embedded Deep Reinforcement Learning Framework

Adaptive Learning Recommendation Strategy Based on Deep Q-learning

Model-based Credit Assignment for Model-free Deep Reinforcement Learning

Learning to Play Air Hockey with Model-Based Deep Reinforcement Learning

GoalsEye: Learning High Speed Precision Table Tennis on a Physical Robot

Prediction-based Hierarchical Reinforcement Learning for Robot Soccer