D$^2$-JSCC: Digital Deep Joint Source-channel Coding for Semantic Communications

Jianhao Huang,Kai Yuan,Chuan Huang,Kaibin Huang
2024-03-14
Abstract:Semantic communications (SemCom) have emerged as a new paradigm for supporting sixth-generation applications, where semantic features of data are transmitted using artificial intelligence algorithms to attain high communication efficiencies. Most existing SemCom techniques utilize deep neural networks (DNNs) to implement analog source-channel mappings, which are incompatible with existing digital communication architectures. To address this issue, this paper proposes a novel framework of digital deep joint source-channel coding (D$^2$-JSCC) targeting image transmission in SemCom. The framework features digital source and channel codings that are jointly optimized to reduce the end-to-end (E2E) distortion. First, deep source coding with an adaptive density model is designed to encode semantic features according to their distributions. Second, digital channel coding is employed to protect encoded features against channel distortion. To facilitate their joint design, the E2E distortion is characterized as a function of the source and channel rates via the analysis of the Bayesian model and Lipschitz assumption on the DNNs. Then to minimize the E2E distortion, a two-step algorithm is proposed to control the source-channel rates for a given channel signal-to-noise ratio. Simulation results reveal that the proposed framework outperforms classic deep JSCC and mitigates the cliff and leveling-off effects, which commonly exist for separation-based approaches.
Information Theory,Multimedia,Signal Processing
What problem does this paper attempt to address?
### Problems Addressed by the Paper This paper aims to propose a novel framework—Digital Deep Joint Source-Channel Coding (D2-JSCC) to address the issue of image transmission in Semantic Communication (SemCom). Specifically, the paper addresses the following points: 1. **Semantic Feature Extraction and Encoding**: - Proposes a new architecture combining deep source coding and digital channel coding, efficiently extracting and encoding the semantic features of data through an adaptive density model. - Designs an adaptive density model to learn the probability density function (PDF) of features, thereby improving coding efficiency. 2. **End-to-End Distortion Minimization**: - Based on the proposed architecture, defines an end-to-end distortion minimization problem and approximates the intractable end-to-end distortion as a function of DNN parameters and channel rate through Bayesian approximation and the Lipschitz assumption of deep neural networks (DNN). - This allows for the joint optimization of source rate and channel rate to adapt to different channel signal-to-noise ratios (SNR). 3. **Optimal Rate Control Algorithm**: - Proposes an efficient two-step algorithm to balance the trade-off between source rate and channel rate for a given channel SNR, thereby minimizing end-to-end distortion. - In the first step, selects a DNN model with an appropriate source rate; in the second step, retrains the selected DNN model to adapt to the channel SNR, achieving near-optimal end-to-end performance. 4. **Experimental Validation**: - Experimental results show that the proposed D2-JSCC method can mitigate the "cliff effect" and "flat effect" present in traditional digital systems, as it can adaptively optimize source coding and channel coding according to different channel SNRs. - Additionally, the D2-JSCC method outperforms classical deep JSCC schemes, as the latter fixes the number of transmission symbols for all images, while the former dynamically adjusts the number of symbols based on image content and channel SNR. - As the block length increases, the end-to-end performance of D2-JSCC gradually approaches the performance of separate source-channel coding and capacity-achieving codes. In summary, this paper aims to enhance the overall performance of semantic communication systems, especially in image transmission, by combining deep learning techniques and digital communication architectures, achieving better end-to-end distortion performance through adaptive models and joint optimization techniques.