DCT_M Model for Excitation Parameter in Low Bit Rate Vocoder

Xiaoyan Dang,Kun Tang
DOI: https://doi.org/10.1007/s11460-008-0043-1
2008-01-01
Frontiers of Electrical and Electronic Engineering in China
Abstract:The description precision of an excitation signal greatly influences the quality of reconstructed speech in low bit rate vocoders. To improve the reconstruction quality, the DCT_M model is proposed to express the excitation spectral parameter, which transforms the variable length vector to fixed dimension vector through DCT transformation. It then quantizes the fixed length vector using multi-stage vector quantization. Tests show that the proposed method can keep the shape of the entire spectral envelope and reduce model error thus greatly improve the description precision. Test results in the sine excitation linear prediction (SELP) vocoder show that the DCT_M model can improve the naturalness of reconstructed speech, with subjective test score of 65%.
What problem does this paper attempt to address?