Abstract:Diffusion models have been extensively utilized in AI-generated content (AIGC) in recent years, thanks to the superior generation capabilities. Combining with semantic communications, diffusion models are used for tasks such as denoising, data reconstruction, and content generation. However, existing diffusion-based generative models do not consider the stringent bandwidth limitation, which limits its application in wireless communication. This paper introduces a diffusion-driven semantic communication framework with advanced VAE-based compression for bandwidth-constrained generative model. Our designed architecture utilizes the diffusion model, where the signal transmission process through the wireless channel acts as the forward process in diffusion. To reduce bandwidth requirements, we incorporate a downsampling module and a paired upsampling module based on a variational auto-encoder with reparameterization at the receiver to ensure that the recovered features conform to the Gaussian distribution. Furthermore, we derive the loss function for our proposed system and evaluate its performance through comprehensive experiments. Our experimental results demonstrate significant improvements in pixel-level metrics such as peak signal to noise ratio (PSNR) and semantic metrics like learned perceptual image patch similarity (LPIPS). These enhancements are more profound regarding the compression rates and SNR compared to deep joint source-channel coding (DJSCC).

What problem does this paper attempt to address?

The paper aims to address the application of generative models in bandwidth-constrained environments, particularly in wireless communications. Specifically, while existing diffusion models perform well in generative tasks such as image denoising, data reconstruction, and content generation, they do not consider strict bandwidth limitations, which restricts their application in wireless communications. To this end, the authors propose a semantic communication framework based on diffusion models, combined with advanced compression techniques based on variational autoencoders (VAE) to cope with bandwidth constraints. The main contributions of the paper include: 1. **Proposing a communication-efficient generative semantic communication system**: This system combines the forward process of the diffusion model with channel noise, mapping the channel noise to the T-th step of the forward process in the diffusion model to adapt to different signal-to-noise ratio (SNR) conditions. The receiver uses the reverse process of the diffusion model to remove the channel noise. 2. **Designing a bandwidth compression module**: By introducing a downsampling module and a VAE-based upsampling module to reduce bandwidth requirements. To ensure that the recovered features conform to a Gaussian distribution, a VAE upsampling network with reparameterization is proposed. 3. **Integrating a guidance mechanism**: The architecture incorporates a guidance method to improve the feature extraction capability of the downsampling module and the recovery capability of the upsampling module by learning from the distribution of the uncompressed generator. Additionally, a comprehensive loss function combining VAE compression and the guidance mechanism is introduced. 4. **Experimental validation**: Extensive experiments demonstrate that, compared to methods based on deep joint source-channel coding (DJSCC), this method significantly improves pixel-level metrics (such as PSNR) and semantic metrics (such as LPIPS) under different compression rates and SNR conditions. In summary, the paper aims to overcome bandwidth limitations through a novel diffusion-driven semantic communication framework, thereby achieving efficient data transmission and high-quality content generation in wireless communications.

Diffusion-Driven Semantic Communication for Generative Models with Bandwidth Constraints

Generative Semantic Communication: Diffusion Models Beyond Bit Recovery

Latency-Aware Generative Semantic Communications with Pre-Trained Diffusion Models

Rate-Adaptive Generative Semantic Communication Using Conditional Diffusion Models

Diff-GO: Diffusion Goal-Oriented Communications to Achieve Ultra-High Spectrum Efficiency

Lightweight Diffusion Models for Resource-Constrained Semantic Communication

Diffusion models for audio semantic communication

Diffusion-based Generative Multicasting with Intent-aware Semantic Decomposition

Rethinking Multi-User Semantic Communications with Deep Generative Models

A novel image semantic communication method via dynamic decision generation network and generative adversarial network

Language-oriented Semantic Communication for Image Transmission with Fine-Tuned Diffusion Model

Semantic Successive Refinement: A Generative AI-aided Semantic Communication Framework

Toward Adaptive Semantic Communications: Efficient Data Transmission Via Online Learned Nonlinear Transform Source-Channel Coding

Latent Diffusion Model-Enabled Low-Latency Semantic Communication in the Presence of Semantic Ambiguities and Wireless Channel Noises

Semantics-Guided Diffusion for Deep Joint Source-Channel Coding in Wireless Image Transmission

A Wireless AI-Generated Content (AIGC) Provisioning Framework Empowered by Semantic Communication

Semantic Change Driven Generative Semantic Communication Framework

Goal-Oriented Semantic Communication for Wireless Image Transmission via Stable Diffusion

Generative Semantic Communication for Text-to-Speech Synthesis

DiffCom: Channel Received Signal is a Natural Condition to Guide Diffusion Posterior Sampling

Semantic-Aware Power Allocation for Generative Semantic Communications with Foundation Models