Perceptual Video Coding Based on SSIM-Inspired Divisive Normalization

Shiqi Wang,Abdul Rehman,Zhou Wang,Siwei Ma,Wen Gao
DOI: https://doi.org/10.1109/tip.2012.2231090
IF: 10.6
2012-01-01
IEEE Transactions on Image Processing
Abstract:We propose a perceptual video coding framework based on the divisive normalization scheme, which is found to be an effective approach to model the perceptual sensitivity of biological vision, but has not been fully exploited in the context of video coding. At the macroblock (MB) level, we derive the normalization factors based on the structural similarity (SSIM) index as an attempt to transform the discrete cosine transform domain frame residuals to a perceptually uniform space. We further develop an MB level perceptual mode selection scheme and a frame level global quantization matrix optimization method. Extensive simulations and subjective tests verify that, compared with the H.264/AVC video coding standard, the proposed method can achieve significant gain in terms of rate-SSIM performance and provide better visual quality.
What problem does this paper attempt to address?