Diffusion-Driven Semantic Communication for Generative Models with Bandwidth Constraints

Lei Guo,Wei Chen,Yuxuan Sun,Bo Ai,Nikolaos Pappas,Tony Quek
2024-07-26
Abstract:Diffusion models have been extensively utilized in AI-generated content (AIGC) in recent years, thanks to the superior generation capabilities. Combining with semantic communications, diffusion models are used for tasks such as denoising, data reconstruction, and content generation. However, existing diffusion-based generative models do not consider the stringent bandwidth limitation, which limits its application in wireless communication. This paper introduces a diffusion-driven semantic communication framework with advanced VAE-based compression for bandwidth-constrained generative model. Our designed architecture utilizes the diffusion model, where the signal transmission process through the wireless channel acts as the forward process in diffusion. To reduce bandwidth requirements, we incorporate a downsampling module and a paired upsampling module based on a variational auto-encoder with reparameterization at the receiver to ensure that the recovered features conform to the Gaussian distribution. Furthermore, we derive the loss function for our proposed system and evaluate its performance through comprehensive experiments. Our experimental results demonstrate significant improvements in pixel-level metrics such as peak signal to noise ratio (PSNR) and semantic metrics like learned perceptual image patch similarity (LPIPS). These enhancements are more profound regarding the compression rates and SNR compared to deep joint source-channel coding (DJSCC).
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?
The paper aims to address the application of generative models in bandwidth-constrained environments, particularly in wireless communications. Specifically, while existing diffusion models perform well in generative tasks such as image denoising, data reconstruction, and content generation, they do not consider strict bandwidth limitations, which restricts their application in wireless communications. To this end, the authors propose a semantic communication framework based on diffusion models, combined with advanced compression techniques based on variational autoencoders (VAE) to cope with bandwidth constraints. The main contributions of the paper include: 1. **Proposing a communication-efficient generative semantic communication system**: This system combines the forward process of the diffusion model with channel noise, mapping the channel noise to the T-th step of the forward process in the diffusion model to adapt to different signal-to-noise ratio (SNR) conditions. The receiver uses the reverse process of the diffusion model to remove the channel noise. 2. **Designing a bandwidth compression module**: By introducing a downsampling module and a VAE-based upsampling module to reduce bandwidth requirements. To ensure that the recovered features conform to a Gaussian distribution, a VAE upsampling network with reparameterization is proposed. 3. **Integrating a guidance mechanism**: The architecture incorporates a guidance method to improve the feature extraction capability of the downsampling module and the recovery capability of the upsampling module by learning from the distribution of the uncompressed generator. Additionally, a comprehensive loss function combining VAE compression and the guidance mechanism is introduced. 4. **Experimental validation**: Extensive experiments demonstrate that, compared to methods based on deep joint source-channel coding (DJSCC), this method significantly improves pixel-level metrics (such as PSNR) and semantic metrics (such as LPIPS) under different compression rates and SNR conditions. In summary, the paper aims to overcome bandwidth limitations through a novel diffusion-driven semantic communication framework, thereby achieving efficient data transmission and high-quality content generation in wireless communications.