Abstract:Model bias is an inherent limitation of the current dominant approach to optimal quantum control, which relies on a system simulation for optimization of control policies. To overcome this limitation, we propose a circuit-based approach for training a reinforcement learning agent on quantum control tasks in a model-free way. Given a continuously parametrized control circuit, the agent learns its parameters through trial-and-error interaction with the quantum system, using measurement outcomes as the only source of information about the quantum state. Focusing on control of a harmonic oscillator coupled to an ancilla qubit, we show how to reward the learning agent with measurements of experimentally available observables. We train the agent to prepare various nonclassical states via both unitary control and control with adaptive measurement-based quantum feedback, and to execute logical gates on encoded qubits. The agent does not rely on averaging for state tomography or fidelity estimation, and significantly outperforms widely used model-free methods in terms of sample efficiency. Our numerical work is of immediate relevance to superconducting circuits and trapped ions platforms where such training can be implemented in experiment, allowing complete elimination of model bias and the adaptation of quantum control policies to the specific system in which they are deployed. https://doi.org/10.1103/PhysRevX.12.011059 Published by the American Physical Society under the terms of the Creative Commons Attribution 4.0 International license. Further distribution of this work must maintain attribution to the author(s) and the published article's title, journal citation, and DOI. Published by the American Physical Society

Model-Free Quantum Control with Reinforcement Learning

Sample-efficient Model-based Reinforcement Learning for Quantum Control

Non-Markovian Quantum Control via Model Maximum Likelihood Estimation and Reinforcement Learning

Realizing a deep reinforcement learning agent discovering real-time feedback control strategies for a quantum system

Robust Optimization for Quantum Reinforcement Learning Control Using Partial Observations

Controlling nonergodicity in quantum many-body systems by reinforcement learning

Experimentally Realizing Efficient Quantum Control with Reinforcement Learning

Universal Quantum Control through Deep Reinforcement Learning

The Quantum Cartpole: A benchmark environment for non-linear reinforcement learning

Molecular Quantum Control Algorithm Design by Reinforcement Learning

Model-free Distortion Canceling and Control of Quantum Devices

Experimentally Realizing Efficient Quantum Control with Reinforcement Learning

Re-exploring Control Strategies in a Non-Markovian Open Quantum System by Reinforcement Learning

Quantum optimal control of superconducting qubits based on machine-learning characterization

High-dimensional reinforcement learning for optimization and control of ultracold quantum gases

Reinforcement Learning for Many-Body Ground-State Preparation Inspired by Counterdiabatic Driving

Realizing a deep reinforcement learning agent for real-time quantum feedback

Quantum Control based on Deep Reinforcement Learning

Automatic pulse-level calibration by tracking observables using iterative learning

Challenges for Reinforcement Learning in Quantum Circuit Design