Latency-minimizing Semantic Communication with Dynamic Model Partitioning

Yuxuan Yan,Yuhao Chen,Qianqian Yang,Zhiguo Shi
DOI: https://doi.org/10.1109/icc51166.2024.10622688
2024-01-01
Abstract:Semantic communication is an emerging communication approach that aims to enhance efficient transmission by conveying the essential semantic meaning of the information while eliminating redundancy. In the current deep learning (DL)-based semantic communication systems, the encoder and decoder at the sender and receiver persist without modification after deployment, irrespective of variations in device computing power and channel bandwidth. This lack of adaptability may result in a decline in performance. To overcome this issue, we introduce an adaptive semantic communication approach aimed at minimizing end-to-end latency by leveraging a dynamic model partitioning mechanism. This mechanism dynamically splits the overall model into the encoder and decoder components, with the partitioning points adapting to changing communication and computing resources. Furthermore, we present a training method referred to as scheduled random partition point training to ensure that changes in the partitioning points do not adversely impact the performance of downstream tasks. Our experimental results affirm the effectiveness of these methods in terms of reducing latency and improving task performance.
What problem does this paper attempt to address?