Test-time adaptation for image compression with distribution regularization

Kecheng Chen,Pingping Zhang,Tiexin Qin,Shiqi Wang,Hong Yan,Haoliang Li
2024-10-16
Abstract:Current test- or compression-time adaptation image compression (TTA-IC) approaches, which leverage both latent and decoder refinements as a two-step adaptation scheme, have potentially enhanced the rate-distortion (R-D) performance of learned image compression models on cross-domain compression tasks, \textit{e.g.,} from natural to screen content images. However, compared with the emergence of various decoder refinement variants, the latent refinement, as an inseparable ingredient, is barely tailored to cross-domain scenarios. To this end, we aim to develop an advanced latent refinement method by extending the effective hybrid latent refinement (HLR) method, which is designed for \textit{in-domain} inference improvement but shows noticeable degradation of the rate cost in \textit{cross-domain} tasks. Specifically, we first provide theoretical analyses, in a cue of marginalization approximation from in- to cross-domain scenarios, to uncover that the vanilla HLR suffers from an underlying mismatch between refined Gaussian conditional and hyperprior distributions, leading to deteriorated joint probability approximation of marginal distribution with increased rate consumption. To remedy this issue, we introduce a simple Bayesian approximation-endowed \textit{distribution regularization} to encourage learning a better joint probability approximation in a plug-and-play manner. Extensive experiments on six in- and cross-domain datasets demonstrate that our proposed method not only improves the R-D performance compared with other latent refinement counterparts, but also can be flexibly integrated into existing TTA-IC methods with incremental benefits.
Computer Vision and Pattern Recognition,Multimedia
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? This paper aims to solve the problem of rate - distortion (R - D) performance degradation in cross - domain image compression tasks. Specifically, although current test - time adaptation for image compression (TTA - IC) methods perform well in in - domain inference, in cross - domain scenarios (for example, from natural images to screen - content images), due to the mismatch between latent variables and hyper - prior distributions, there is a significant decline in R - D performance. #### Main problems: 1. **Limitations of existing methods**: - Current TTA - IC methods rely on a two - step adaptation scheme of latent variables and decoders, and the optimization of latent variables (latent refinement) is not effective in cross - domain scenarios. - The hybrid latent variable optimization (HLR) method, although performing well in in - domain inference, will increase additional bit - rate consumption in cross - domain scenarios, thus affecting R - D performance. 2. **Theoretical analysis**: - Through marginalization approximation analysis, the paper reveals the reasons for the degradation of the existing HLR method in cross - domain scenarios, and points out that the fundamental problem is the mismatch between the Gaussian conditional distribution and the hyper - prior distribution, which leads to poor joint probability approximation and further increases bit - rate consumption. 3. **Solutions**: - To improve this, the paper proposes a distribution regularization method based on Bayesian approximation to encourage learning better joint probability approximation, thereby improving R - D performance in cross - domain scenarios without modifying model parameters. - This method can not only improve the reconstruction quality but also effectively control bit - rate consumption, and is applicable to the existing TTA - IC framework. ### Summary: The goal of the paper is to develop an advanced latent variable optimization method that can adapt to cross - domain TTA - IC tasks and maintain consistent R - D gains. By introducing distribution regularization, the paper solves the problem of R - D performance degradation of existing methods in cross - domain scenarios, providing new ideas and technical means for the field of image compression.