HMM estimation of energy contours in speech decoders

计哲,高圣翔,唐昆,金鑫
DOI: https://doi.org/10.16511/j.cnki.qhdxxb.2013.06.029
2013-01-01
Abstract:Low bit rate speech coding must effectively quantize the parameters. This article presents an energy contour estimation algorithm to predict changes of speech energy from the line spectral frequency (LSF) and the unvoiced/voiced (U/V) decision parameters. The statistical properties of the energy, the LSF and the U/V decision parameters are characterized based on the hidden Markov model (HMM) which uses the correlations between different parameters. The algorithm properly estimates the energy contour, which contributes to quantization of the decoder parameters. Tests show that the energy contour estimation algorithm improves the mean opinion score (MOS) of the synthesized speech for the mixed excitation linear prediction (MELP) vocoder at a 150 b/s bit rate by 0.042, which shows that this algorithm improves parameter quantization in ultra low bit rate vocoders.
What problem does this paper attempt to address?