A study on dynamic adjustment of the non-uniform quantizer in advanced audio coding

Wu Sheng,Qiu Xiao-Jun
DOI: https://doi.org/10.3321/j.issn:0469-5097.2009.01.008
2009-01-01
Abstract:Non-uniform quantier is widely used in perceptual audio coders. In existing studies, quantization distortion is measured by mean square error. In this paper, through the study of non-uniform quantier, the cluster energy distortion is introduced. The cluster energy distortion is defined as the mathematical expectation of the energy error between the original signal before quantization and the reconstructed signal after quantization. With the cluster energy distortion principal, a quantizer without cluster energy distortion is proposed. This quantizer keeps the energy of the original signal and the reconstructed signal conservation. By applying the zero cluster energy error distortion rule among quantization levels to constraint the cluster energy distortion between quantization levels, a partition method of the dynamic quantization threshold can be obtained. This partition method correlates with the distribution of the raw quantization spectrum. By counting the appearance frequency in small divided spaces to obtain the approximate distribution of the raw quantization spectrum, an approximate solution of the dynamic quantization threshold could be calculated. With this dynamic quantization threshold by modifying the rounding operation of quantizer, a dynamic adjustment quantizer which follows the distribution of input signal is designed. This dynamic adjustment quantizer is applied in the advanced audio coding. Objective audio quality evaluation based on the perceptual evaluation of audio quality method shows that the encoder with the proposed dynamic adjustment quantizer has a better encoding performance than the encoder with the recommended quantizer that is defined in the advanced audio coding standard. The distortion index and the noise to mask radio, which are key objective audio quality measures are improved. This improvement becomes more significant as the bitrate increases. Subjective audio quality degeneration evaluation based on hearing test also shows that, at 218 kbsp bitrate, the encoder with the dynamic adjustment quantizer has less audio quality degeneration than the encoder with the recommended quantizer. With about 3% reduction of the bitrate, the encoder with the dynamic adjustment quantizer keeps the same auditory perception level as the encoder with the recommended quantizer. The dynamic adjustment quantizer is independent of the structure of the encoder, which only costs a slight increasing of computational complexity and storage space. This proposed method could also be applied to other encoders.
What problem does this paper attempt to address?