Reinforcement Learning Based Quantum Circuit Optimization via ZX-Calculus

Jordi Riu,Jan Nogué,Gerard Vilaplana,Artur Garcia-Saez,Marta P. Estarellas

2024-06-04

Abstract:We propose a novel Reinforcement Learning (RL) method for optimizing quantum circuits using graph-theoretic simplification rules of ZX-diagrams. The agent, trained using the Proximal Policy Optimization (PPO) algorithm, employs Graph Neural Networks to approximate the policy and value functions. We demonstrate the capacity of our approach by comparing it against the best performing ZX-Calculus-based algorithm for the problem in hand. After training on small Clifford+T circuits of 5-qubits and few tenths of gates, the agent consistently improves the state-of-the-art for this type of circuits, for at least up to 80-qubit and 2100 gates, whilst remaining competitive in terms of computational performance. Additionally, we illustrate its versatility by targeting both total and two-qubit gate count reduction, conveying the potential of tailoring its reward function to the specific characteristics of each hardware backend. Our approach is ready to be used as a valuable tool for the implementation of quantum algorithms in the near-term intermediate-scale range (NISQ).

Quantum Physics

What problem does this paper attempt to address?

The problem that this paper attempts to solve is that in quantum circuit optimization, how to use reinforcement learning (RL) combined with ZX - Calculus graphical language to reduce the number of quantum gates, especially the number of two - qubit gates, thereby improving the execution efficiency and reliability of quantum circuits on current quantum devices. Specifically, the paper proposes a new RL method based on the Proximal Policy Optimization (PPO) algorithm, using Graph Neural Networks (GNNs) to approximate the policy and value functions in order to achieve effective optimization of quantum circuits. This method aims to overcome the exploration difficulties caused by the overly large action space in traditional algebraic simplification methods, and by using the simplification rules of ZX - Calculus to reduce the types of actions that need to be processed, enabling the RL agent to explore and utilize the optimal actions more effectively, thus significantly improving the optimization effect on Clifford + T circuits while maintaining competitive computational performance, and is applicable to circuits with up to 80 qubits and 2,100 gates. In addition, this method also demonstrates its flexibility and can be adjusted through the reward function to adapt to the specific characteristics of different hardware back - ends.

Reinforcement Learning Based Quantum Circuit Optimization via ZX-Calculus

Quarl: A Learning-Based Quantum Circuit Optimizer

On the optimality of quantum circuit initial mapping using reinforcement learning

Practical and efficient quantum circuit synthesis and transpiling with Reinforcement Learning

Cost Explosion for Efficient Reinforcement Learning Optimisation of Quantum Circuits

Reinforcement-Learning-Based Variational Quantum Circuits Optimization for Combinatorial Problems

Qubit-count optimization using ZX-calculus

A Reinforcement Learning Environment for Directed Quantum Circuit Synthesis

Causal flow preserving optimisation of quantum circuits in the ZX-calculus

Application of ZX-calculus to Quantum Architecture Search

Graph-theoretic Simplification of Quantum Circuits with the ZX-calculus

Quantum Circuit Optimization of Arithmetic circuits using ZX Calculus

Vanishing 2-Qubit Gates with Non-Simplification ZX-Rules

Challenges for Reinforcement Learning in Quantum Circuit Design

Optimization of Reinforcement Learning Using Quantum Computation

From Easy to Hard: Tackling Quantum Problems with Learned Gadgets For Real Hardware

Towards Faster Reinforcement Learning of Quantum Circuit Optimization: Exponential Reward Functions

Graph Neural Network Autoencoders for Efficient Quantum Circuit Optimisation

A Study on Optimization Techniques for Variational Quantum Circuits in Reinforcement Learning

A recursively partitioned approach to architecture-aware ZX Polynomial synthesis and optimization

Optimizing ZX-Diagrams with Deep Reinforcement Learning