Discretized Gaussian Mixture Hyperprior for Learned Image Compression with Mask Module

Shengkai Wang,Hanli Wang
DOI: https://doi.org/10.1109/icme52920.2022.9859971
2022-01-01
Abstract:Learned image compression approaches have shown great potential with promising results. However, according to the commonly used measurement methods, there still lies a performance gap between learned compression methods and the latest compression standard versatile video coding (VVC), because of the remaining redundancy existing in contemporary algorithms. To obtain a more accurate entropy model for rate estimation, discretized Gaussian mixture hyperprior is proposed in this work to parameterize the distribution of latent codes. In addition, a proposed mask module is exploited to enhance the feature extraction ability of the encoder and adaptively allocate bit rates. In this case, the proposed learned image compression model achieves the state-of-the-art performance among the existing learned compression methods and most compression standards on both Kodak24 and CLIC Mobile Validation datasets. The proposed model also outperforms VVC under several bitrate scenarios.
What problem does this paper attempt to address?