Abstract:Model bias is an inherent limitation of the current dominant approach to optimal quantum control, which relies on a system simulation for optimization of control policies. To overcome this limitation, we propose a circuit-based approach for training a reinforcement learning agent on quantum control tasks in a model-free way. Given a continuously parametrized control circuit, the agent learns its parameters through trial-and-error interaction with the quantum system, using measurement outcomes as the only source of information about the quantum state. Focusing on control of a harmonic oscillator coupled to an ancilla qubit, we show how to reward the learning agent with measurements of experimentally available observables. We train the agent to prepare various nonclassical states via both unitary control and control with adaptive measurement-based quantum feedback, and to execute logical gates on encoded qubits. The agent does not rely on averaging for state tomography or fidelity estimation, and significantly outperforms widely used model-free methods in terms of sample efficiency. Our numerical work is of immediate relevance to superconducting circuits and trapped ions platforms where such training can be implemented in experiment, allowing complete elimination of model bias and the adaptation of quantum control policies to the specific system in which they are deployed. https://doi.org/10.1103/PhysRevX.12.011059 Published by the American Physical Society under the terms of the Creative Commons Attribution 4.0 International license. Further distribution of this work must maintain attribution to the author(s) and the published article's title, journal citation, and DOI. Published by the American Physical Society

Preparation of cavity Fock state superpositions by reinforcement learning exploiting measurement back-action

Faster State Preparation across Quantum Phase Transition Assisted by Reinforcement Learning

Reinforcement learning for autonomous preparation of Floquet-engineered states: Inverting the quantum Kapitza oscillator

A reinforcement learning approach for quantum state engineering

Fidelity-Based Probabilistic Q-Learning for Control of Quantum Systems

Arbitrary quantum states preparation aided by deep reinforcement learning

A Strategy for Preparing Quantum Squeezed States Using Reinforcement Learning

Preparing Quantum States by Measurement-feedback Control with Bayesian Optimization

Robust Optimization for Quantum Reinforcement Learning Control Using Partial Observations

Reinforcement Learning for Many-Body Ground-State Preparation Inspired by Counterdiabatic Driving

A Quantum States Preparation Method Based on Difference-Driven Reinforcement Learning

Model-Free Quantum Control with Reinforcement Learning

Measurement-based Fast Quantum State Stabilization with Deep Reinforcement Learning

Sample-efficient Model-based Reinforcement Learning for Quantum Control

Reconstruction of a Photonic Qubit State with Reinforcement Learning

When does reinforcement learning stand out in quantum control? A comparative study on state preparation

Improving robustness of quantum feedback control with reinforcement learning

Experimentally Realizing Efficient Quantum Control with Reinforcement Learning

Re-exploring Control Strategies in a Non-Markovian Open Quantum System by Reinforcement Learning

Controlling nonergodicity in quantum many-body systems by reinforcement learning