FSSC: Federated Learning of Transformer Neural Networks for Semantic Image Communication

Yuna Yan,Xin Zhang,Lixin Li,Wensheng Lin,Rui Li,Wenchi Cheng,Zhu Han
2024-07-31
Abstract:In this paper, we address the problem of image semantic communication in a multi-user deployment scenario and propose a federated learning (FL) strategy for a Swin Transformer-based semantic communication system (FSSC). Firstly, we demonstrate that the adoption of a Swin Transformer for joint source-channel coding (JSCC) effectively extracts semantic information in the communication system. Next, the FL framework is introduced to collaboratively learn a global model by aggregating local model parameters, rather than directly sharing clients' data. This approach enhances user privacy protection and reduces the workload on the server or mobile edge. Simulation evaluations indicate that our method outperforms the typical JSCC algorithm and traditional separate-based communication algorithms. Particularly after integrating local semantics, the global aggregation model has further increased the Peak Signal-to-Noise Ratio (PSNR) by more than 2dB, thoroughly proving the effectiveness of our algorithm.
Artificial Intelligence,Machine Learning,Image and Video Processing
What problem does this paper attempt to address?
This paper attempts to address the problem of implementing image semantic communication in multi-user deployment scenarios and proposes a Swin Transformer-based semantic communication system (FSSC) based on Federated Learning (FL). Specifically, the paper aims to: 1. **Improve the efficiency of image semantic information extraction and transmission**: By adopting Swin Transformer for Joint Source-Channel Coding (JSCC), effectively extracting semantic information in the communication system. 2. **Protect user privacy**: Through the Federated Learning framework, clients can train models locally instead of directly sharing data, thereby enhancing user privacy protection. 3. **Reduce the workload of servers or mobile edges**: By performing local training and parameter aggregation on the client side, the burden on the central server is alleviated. 4. **Improve image transmission quality**: Experimental results show that this method outperforms traditional JSCC algorithms and separate communication algorithms in multiple performance metrics, especially after integrating local semantics, the Peak Signal-to-Noise Ratio (PSNR) of the globally aggregated model is improved by more than 2dB. In summary, the goal of this paper is to achieve efficient, secure, and high-quality multi-user image semantic communication by combining Swin Transformer and Federated Learning technologies.