Abstract:Semantic communications (SemCom) have emerged as a new paradigm for supporting sixth-generation applications, where semantic features of data are transmitted using artificial intelligence algorithms to attain high communication efficiencies. Most existing SemCom techniques utilize deep neural networks (DNNs) to implement analog source-channel mappings, which are incompatible with existing digital communication architectures. To address this issue, this paper proposes a novel framework of digital deep joint source-channel coding (D$^2$-JSCC) targeting image transmission in SemCom. The framework features digital source and channel codings that are jointly optimized to reduce the end-to-end (E2E) distortion. First, deep source coding with an adaptive density model is designed to encode semantic features according to their distributions. Second, digital channel coding is employed to protect encoded features against channel distortion. To facilitate their joint design, the E2E distortion is characterized as a function of the source and channel rates via the analysis of the Bayesian model and Lipschitz assumption on the DNNs. Then to minimize the E2E distortion, a two-step algorithm is proposed to control the source-channel rates for a given channel signal-to-noise ratio. Simulation results reveal that the proposed framework outperforms classic deep JSCC and mitigates the cliff and leveling-off effects, which commonly exist for separation-based approaches.

What problem does this paper attempt to address?

### Problems Addressed by the Paper This paper aims to propose a novel framework—Digital Deep Joint Source-Channel Coding (D2-JSCC) to address the issue of image transmission in Semantic Communication (SemCom). Specifically, the paper addresses the following points: 1. **Semantic Feature Extraction and Encoding**: - Proposes a new architecture combining deep source coding and digital channel coding, efficiently extracting and encoding the semantic features of data through an adaptive density model. - Designs an adaptive density model to learn the probability density function (PDF) of features, thereby improving coding efficiency. 2. **End-to-End Distortion Minimization**: - Based on the proposed architecture, defines an end-to-end distortion minimization problem and approximates the intractable end-to-end distortion as a function of DNN parameters and channel rate through Bayesian approximation and the Lipschitz assumption of deep neural networks (DNN). - This allows for the joint optimization of source rate and channel rate to adapt to different channel signal-to-noise ratios (SNR). 3. **Optimal Rate Control Algorithm**: - Proposes an efficient two-step algorithm to balance the trade-off between source rate and channel rate for a given channel SNR, thereby minimizing end-to-end distortion. - In the first step, selects a DNN model with an appropriate source rate; in the second step, retrains the selected DNN model to adapt to the channel SNR, achieving near-optimal end-to-end performance. 4. **Experimental Validation**: - Experimental results show that the proposed D2-JSCC method can mitigate the "cliff effect" and "flat effect" present in traditional digital systems, as it can adaptively optimize source coding and channel coding according to different channel SNRs. - Additionally, the D2-JSCC method outperforms classical deep JSCC schemes, as the latter fixes the number of transmission symbols for all images, while the former dynamically adjusts the number of symbols based on image content and channel SNR. - As the block length increases, the end-to-end performance of D2-JSCC gradually approaches the performance of separate source-channel coding and capacity-achieving codes. In summary, this paper aims to enhance the overall performance of semantic communication systems, especially in image transmission, by combining deep learning techniques and digital communication architectures, achieving better end-to-end distortion performance through adaptive models and joint optimization techniques.

D$^2$-JSCC: Digital Deep Joint Source-channel Coding for Semantic Communications

Joint Task and Data Oriented Semantic Communications: A Deep Separate Source-channel Coding Scheme

Joint Source-Channel Coding for Channel-Adaptive Digital Semantic Communications

Perceptual Learned Source-Channel Coding for High-Fidelity Image Semantic Transmission

Joint Source-Channel Coding: Fundamentals and Recent Progress in Practical Designs

Joint Source-Channel Coding System for 6G Communication: Design, Prototype and Future Directions

Nonlinear Transform Source-Channel Coding for Semantic Communications

Deep Joint Source-Channel Coding for Wireless Image Transmission with Adaptive Models

DeepJSCC-Q: Constellation Constrained Deep Joint Source-Channel Coding

Semantic Communications for Image Recovery and Classification via Deep Joint Source and Channel Coding

From Analog to Digital: Multi-Order Digital Joint Coding-Modulation for Semantic Communication

Variational Source-Channel Coding for Semantic Communication

Deep Joint Source-Channel and Encryption Coding: Secure Semantic Communications

Deep Joint Source-Channel Coding Based on Semantics of Pixels

Universal Joint Source-Channel Coding for Modulation-Agnostic Semantic Communication

NeurJSCC Enabled Semantic Communications: Paradigms, Applications, and Potentials

A Deep Joint Source-Channel Coding Scheme for Hybrid Mobile Multi-hop Networks

Predictive and Adaptive Deep Coding for Wireless Image Transmission in Semantic Communication

A Hybrid Joint Source-Channel Coding Scheme for Mobile Multi-hop Networks