Progressive image compression for Gaussian mixture model quartile intervals

DOI: https://doi.org/10.1007/s10489-024-05577-w
IF: 5.3
2024-06-08
Applied Intelligence
Abstract:In this paper, we proposed a novel deep image coding and decoding model for GMMQI (Gaussian mixture model quartile intervals, GMMQI) and a variable rate bit allocation method to optimize the rate distortion performance and prioritize the transmissions of more significant information. First, to address the problem of bit streams leading to ambiguity during encoding and decoding, we convert the image into a potential tensor, each element of which uses a four-bit parameter dictionary to preserve the parameter bits. Then, the variable rate is calculated using quaternions based on the parameter dictionary, and a hybrid CLM &BAM (Channel latent map & Bit allocation map) approach is designed to assign bits to the potential tensor and encode it, which transforms the problem of finding the optimal encoder-decoder into finding the optimal hyper-parameters in the model and reduces the complexity of the GMMQI model. Finally, a GMMQI approach with variable rate bit allocation is developed in combination with CLM &BAM, to be able to prioritize the transmission of more significant information. The experimental results show that the GMMQI method reaches an advanced level compared to the traditional image compression standards BPG, JPEG2000, and JPEG, and is comparable to the most advanced level compared to the existing deep learning based compression methods.
computer science, artificial intelligence
What problem does this paper attempt to address?