Improvement of Packet Loss Concealment for EVS Codec Based on Deep Learning

Baoping Cheng,He Wang,Sheng Wang,Min Chen,Xiaoming Tao
DOI: https://doi.org/10.1109/ICBAIE59714.2023.10281294
2023-01-01
Abstract:With the advancement of network technology, real-time communication has become an integral part of daily life. However, packet loss presents a significant challenge to the quality of real-time communication. The Enhanced Voice Services (EVS) codec includes an inherent Linear Prediction (LP)-based packet loss concealment system to mitigate the effects of packet loss, but this system exhibits noticeable limitations concerning voice quality and resilience during severe packet loss scenarios. In this paper, we propose an enhancement to the EVS’s PLC performance using a Generative Adversarial Network (GAN)-based deep learning model. This model employs a generator with an asymmetric causal convolutional network and a multi-resolution discriminator for lost speech reconstruction. Experimental results show that the proposed system outperforms the inherent PLC system of the EVS codec in terms of Perceptual Evaluation of Speech Quality (PESQ), Short Time Object Intelligibility (STOI) and PLCMOS. Additionally, subjective listening tests following the MUSHRA standard further confirmed the improvement in speech quality.
What problem does this paper attempt to address?