Task-Oriented Semantic Communications for Speech Transmission

Zhenzi Weng,Zhijin Qin,Xiaoming Tao
DOI: https://doi.org/10.1109/vtc2023-fall60731.2023.10333632
2023-01-01
Abstract:Semantic communications execute intelligent tasks at the receiver by only transmitting necessary information. In this paper, we introduce TOS-ST, a task-oriented semantic communication system for speech transmission, which efficiently serves the semantic tasks at the receiver, including speech-to-text translation and speech-to-speech translation. Particularly, TOS-ST condenses the input speech in the source language and extracts the task-related semantics features prior to transmission. At the receiver, these features are recovered and utilized by the neural network-based semantic preserver and machine translation module to generate the uncorrupted text in the target language. To perform the speech-to-speech translation task, the translated text passes through a sophisticated neural network to obtain speech in the target language. According to the simulation results, the TOS-ST outperforms conventional speech transmission systems and exhibits higher robustness against channel impairment.
What problem does this paper attempt to address?