VAE-QWGAN: Improving Quantum GANs for High Resolution Image Generation

Aaron Mark Thomas,Sharu Theresa Jose

2024-09-16

Abstract:This paper presents a novel hybrid quantum generative model, the VAE-QWGAN, which combines the strengths of a classical Variational AutoEncoder (VAE) with a hybrid Quantum Wasserstein Generative Adversarial Network (QWGAN). The VAE-QWGAN integrates the VAE decoder and QGAN generator into a single quantum model with shared parameters, utilizing the VAE's encoder for latent vector sampling during training. To generate new data from the trained model at inference, input latent vectors are sampled from a Gaussian Mixture Model (GMM), learnt on the training latent vectors. This, in turn, enhances the diversity and quality of generated images. We evaluate the model's performance on MNIST/Fashion-MNIST datasets, and demonstrate improved quality and diversity of generated images compared to existing approaches.

Quantum Physics,Computer Vision and Pattern Recognition,Machine Learning

What problem does this paper attempt to address?

The paper attempts to address the problem of generating high-quality, high-resolution images under the limitations of current quantum technology, particularly NISQ devices. Specifically, the paper focuses on the challenges faced by Quantum Generative Adversarial Networks (QGANs) when dealing with high-dimensional classical datasets, such as generating low-quality samples and mode collapse issues. To tackle these challenges, the authors propose a new hybrid quantum generative model called VAE-QWGAN. This model combines the advantages of classical Variational Autoencoders (VAE) and hybrid Quantum Wasserstein Generative Adversarial Networks (QWGAN) by integrating the VAE decoder and QGAN generator into a single quantum model through shared parameters and using the VAE encoder for latent vector sampling during training. Additionally, during the inference phase, the model uses a Gaussian Mixture Model (GMM) learned from the training latent vectors to generate new data, thereby improving the quality and diversity of the generated images. The main contributions of the paper include: 1. **Model Design**: Proposing the VAE-QWGAN model, which combines the strengths of VAE and QWGAN. 2. **Training Method**: Ensuring stable training of the model by combining the loss functions of VAE and QWGAN. 3. **Performance Evaluation**: Conducting experiments on the MNIST and Fashion-MNIST datasets to validate the model's superiority in generating high-quality and diverse images. In summary, the paper aims to address the quality and mode collapse issues of existing QGANs in generating high-resolution images by introducing the VAE-QWGAN model.

VAE-QWGAN: Improving Quantum GANs for High Resolution Image Generation

Variational Quantum Circuits Enhanced Generative Adversarial Network

Generating Diverse High-Fidelity Images with VQ-VAE-2

LatentQGAN: A Hybrid QGAN with Classical Convolutional Autoencoder

A Survey of Recent Advances in Quantum Generative Adversarial Networks

Quantum Wasserstein Generative Adversarial Networks.

Latent Style-based Quantum GAN for high-quality Image Generation

Efficient-VQGAN: Towards High-Resolution Image Generation with Efficient Vision Transformers

Anomaly detection with variational quantum generative adversarial networks

CVAE-GAN: Fine-Grained Image Generation through Asymmetric Training

Wavelets to the Rescue: Improving Sample Quality of Latent Variable Deep Generative Models

Quantum Generative Learning for High-Resolution Medical Image Generation

Lossy Image Compression with Quantized Hierarchical VAEs

$ζ$-QVAE: A Quantum Variational Autoencoder utilizing Regularized Mixed-state Latent Representations

Experimental Quantum Generative Adversarial Networks for Image Generation

Vector Quantized Time Series Generation with a Bidirectional Prior Model

A Bayesian Non-parametric Approach to Generative Models: Integrating Variational Autoencoder and Generative Adversarial Networks using Wasserstein and Maximum Mean Discrepancy

Variational Conditional GAN for Fine-grained Controllable Image Generation

Entangling Quantum Generative Adversarial Networks