Wireless Transmission of Images With The Assistance of Multi-level Semantic Information

Zhenguo Zhang,Qianqian Yang,Shibo He,Mingyang Sun,Jiming Chen
2023-12-08
Abstract:Semantic-oriented communication has been considered as a promising to boost the bandwidth efficiency by only transmitting the semantics of the data. In this paper, we propose a multi-level semantic aware communication system for wireless image transmission, named MLSC-image, which is based on the deep learning techniques and trained in an end to end manner. In particular, the proposed model includes a multilevel semantic feature extractor, that extracts both the highlevel semantic information, such as the text semantics and the segmentation semantics, and the low-level semantic information, such as local spatial details of the images. We employ a pretrained image caption to capture the text semantics and a pretrained image segmentation model to obtain the segmentation semantics. These high-level and low-level semantic features are then combined and encoded by a joint semantic and channel encoder into symbols to transmit over the physical channel. The numerical results validate the effectiveness and efficiency of the proposed semantic communication system, especially under the limited bandwidth condition, which indicates the advantages of the high-level semantics in the compression of images.
Image and Video Processing,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper aims to address the issue of bandwidth efficiency in wireless image transmission by proposing a Multi-Level Semantic-Aware Communication System (MLSC-image) based on deep learning. Specifically, the main objectives of the paper include: 1. **Improving Bandwidth Efficiency**: Enhance the bandwidth utilization of wireless communication systems by transmitting only the semantic information of the image instead of the raw data. 2. **Multi-Level Semantic Feature Extraction**: Utilize a multi-level semantic feature extractor to extract high-level semantic information (such as textual semantics and segmentation semantics) as well as low-level semantic information (such as local spatial details) from the image. 3. **End-to-End Training**: The entire system adopts an end-to-end training approach to ensure robustness and efficiency during the transmission process. 4. **Advantages under Limited Bandwidth Conditions**: Validate the effectiveness of the proposed system in compressing images using high-level semantic information under bandwidth-limited conditions. Through the aforementioned methods, the paper demonstrates that the proposed MLSC-image system shows significant advantages in terms of Peak Signal-to-Noise Ratio (PSNR) and Structural Similarity Index Measure (SSIM) compared to other traditional methods and deep learning-based methods under different Signal-to-Noise Ratio (SNR) conditions, especially performing better under bandwidth-limited scenarios.