An Efficient Implicit Neural Representation Image Codec Based on Mixed Autoregressive Model for Low-Complexity Decoding

Xiang Liu,Jiahong Chen,Bin Chen,Zimo Liu,Baoyi An,Shu-Tao Xia,Zhi Wang
2024-06-07
Abstract:Displaying high-quality images on edge devices, such as augmented reality devices, is essential for enhancing the user experience. However, these devices often face power consumption and computing resource limitations, making it challenging to apply many deep learning-based image compression algorithms in this field. Implicit Neural Representation (INR) for image compression is an emerging technology that offers two key benefits compared to cutting-edge autoencoder models: low computational complexity and parameter-free decoding. It also outperforms many traditional and early neural compression methods in terms of quality. In this study, we introduce a new Mixed AutoRegressive Model (MARM) to significantly reduce the decoding time for the current INR codec, along with a new synthesis network to enhance reconstruction quality. MARM includes our proposed AutoRegressive Upsampler (ARU) blocks, which are highly computationally efficient, and ARM from previous work to balance decoding time and reconstruction quality. We also propose enhancing ARU's performance using a checkerboard two-stage decoding strategy. Moreover, the ratio of different modules can be adjusted to maintain a balance between quality and speed. Comprehensive experiments demonstrate that our method significantly improves computational efficiency while preserving image quality. With different parameter settings, our method can achieve over a magnitude acceleration in decoding time without industrial level optimization, or achieve state-of-the-art reconstruction quality compared with other INR codecs. To the best of our knowledge, our method is the first INR-based codec comparable with Hyperprior in both decoding speed and quality while maintaining low complexity.
Image and Video Processing,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is to improve the balance between decoding speed and reconstruction quality in image compression techniques based on implicit neural representations (INR). Specifically, although existing INR methods perform well in low - complexity decoding, their encoding processes are time - consuming, and there is an irreconcilable contradiction between decoding time and image reconstruction quality. Moreover, although these methods have advantages on edge devices with limited computing resources, their decoding speeds are still slow, which limits their wide adoption in practical applications. To solve these problems, the author introduces a new hybrid autoregressive model (MARM), aiming to significantly reduce the decoding time of current INR codecs while improving the reconstruction quality by introducing a new synthesis network. MARM includes the autoregressive upsampler (ARU) modules proposed by the author, which are computationally very efficient and combine the autoregressive model (ARM) in previous work to balance decoding time and reconstruction quality. In addition, the author also proposes a checkerboard two - stage decoding strategy to enhance the performance of ARU. By adjusting the proportions of different modules, the balance between quality and speed can be maintained. Overall, the goal of this research is to improve the decoding efficiency and image quality of INR - based image codecs while maintaining low computational complexity, making them more suitable for application scenarios of edge devices such as augmented reality devices.