Quantum continual learning on a programmable superconducting processor

Chuanyu Zhang,Zhide Lu,Liangtian Zhao,Shibo Xu,Weikang Li,Ke Wang,Jiachen Chen,Yaozu Wu,Feitong Jin,Xuhao Zhu,Yu Gao,Ziqi Tan,Zhengyi Cui,Aosai Zhang,Ning Wang,Yiren Zou,Tingting Li,Fanhao Shen,Jiarun Zhong,Zehang Bao,Zitian Zhu,Zixuan Song,Jinfeng Deng,Hang Dong,Pengfei Zhang,Wenjie Jiang,Zheng-Zhi Sun,Pei-Xin Shen,Hekang Li,Qiujiang Guo,Zhen Wang,Jie Hao,H. Wang,Dong-Ling Deng,Chao Song
2024-09-15
Abstract:Quantum computers may outperform classical computers on machine learning tasks. In recent years, a variety of quantum algorithms promising unparalleled potential to enhance, speed up, or innovate machine learning have been proposed. Yet, quantum learning systems, similar to their classical counterparts, may likewise suffer from the catastrophic forgetting problem, where training a model with new tasks would result in a dramatic performance drop for the previously learned ones. This problem is widely believed to be a crucial obstacle to achieving continual learning of multiple sequential tasks. Here, we report an experimental demonstration of quantum continual learning on a fully programmable superconducting processor. In particular, we sequentially train a quantum classifier with three tasks, two about identifying real-life images and the other on classifying quantum states, and demonstrate its catastrophic forgetting through experimentally observed rapid performance drops for prior tasks. To overcome this dilemma, we exploit the elastic weight consolidation strategy and show that the quantum classifier can incrementally learn and retain knowledge across the three distinct tasks, with an average prediction accuracy exceeding 92.3%. In addition, for sequential tasks involving quantum-engineered data, we demonstrate that the quantum classifier can achieve a better continual learning performance than a commonly used classical feedforward network with a comparable number of variational parameters. Our results establish a viable strategy for empowering quantum learning systems with desirable adaptability to multiple sequential tasks, marking an important primary experimental step towards the long-term goal of achieving quantum artificial general intelligence.
Quantum Physics
What problem does this paper attempt to address?