An Efficient Implicit Neural Representation Image Codec Based on Mixed Autoregressive Model for Low-Complexity Decoding

Xiang Liu,Jiahong Chen,Bin Chen,Zimo Liu,Baoyi An,Shu-Tao Xia,Zhi Wang

2024-06-07

Abstract:Displaying high-quality images on edge devices, such as augmented reality devices, is essential for enhancing the user experience. However, these devices often face power consumption and computing resource limitations, making it challenging to apply many deep learning-based image compression algorithms in this field. Implicit Neural Representation (INR) for image compression is an emerging technology that offers two key benefits compared to cutting-edge autoencoder models: low computational complexity and parameter-free decoding. It also outperforms many traditional and early neural compression methods in terms of quality. In this study, we introduce a new Mixed AutoRegressive Model (MARM) to significantly reduce the decoding time for the current INR codec, along with a new synthesis network to enhance reconstruction quality. MARM includes our proposed AutoRegressive Upsampler (ARU) blocks, which are highly computationally efficient, and ARM from previous work to balance decoding time and reconstruction quality. We also propose enhancing ARU's performance using a checkerboard two-stage decoding strategy. Moreover, the ratio of different modules can be adjusted to maintain a balance between quality and speed. Comprehensive experiments demonstrate that our method significantly improves computational efficiency while preserving image quality. With different parameter settings, our method can achieve over a magnitude acceleration in decoding time without industrial level optimization, or achieve state-of-the-art reconstruction quality compared with other INR codecs. To the best of our knowledge, our method is the first INR-based codec comparable with Hyperprior in both decoding speed and quality while maintaining low complexity.

Image and Video Processing,Computer Vision and Pattern Recognition

What problem does this paper attempt to address?

The main problem that this paper attempts to solve is to improve the balance between decoding speed and reconstruction quality in image compression techniques based on implicit neural representations (INR). Specifically, although existing INR methods perform well in low - complexity decoding, their encoding processes are time - consuming, and there is an irreconcilable contradiction between decoding time and image reconstruction quality. Moreover, although these methods have advantages on edge devices with limited computing resources, their decoding speeds are still slow, which limits their wide adoption in practical applications. To solve these problems, the author introduces a new hybrid autoregressive model (MARM), aiming to significantly reduce the decoding time of current INR codecs while improving the reconstruction quality by introducing a new synthesis network. MARM includes the autoregressive upsampler (ARU) modules proposed by the author, which are computationally very efficient and combine the autoregressive model (ARM) in previous work to balance decoding time and reconstruction quality. In addition, the author also proposes a checkerboard two - stage decoding strategy to enhance the performance of ARU. By adjusting the proportions of different modules, the balance between quality and speed can be maintained. Overall, the goal of this research is to improve the decoding efficiency and image quality of INR - based image codecs while maintaining low computational complexity, making them more suitable for application scenarios of edge devices such as augmented reality devices.

An Efficient Implicit Neural Representation Image Codec Based on Mixed Autoregressive Model for Low-Complexity Decoding

Hybrid Implicit Neural Image Compression with Subpixel Context Model and Iterative Pruner

RQAT-INR: Improved Implicit Neural Image Compression

Breaking the Barriers of One-to-One Usage of Implicit Neural Representation in Image Compression: A Linear Combination Approach with Performance Guarantees

Exploring the Rate-Distortion-Complexity Optimization in Neural Image Compression

A Slimmable Framework for Practical Neural Video Compression

Releasing the Parameter Latency of Neural Representation for High-Efficiency Video Compression

PNVC: Towards Practical INR-based Video Compression

Rapid-INR: Storage Efficient CPU-free DNN Training Using Implicit Neural Representation

Fast Encoding and Decoding for Implicit Video Representation

A New Image Codec Paradigm for Human and Machine Uses

Advancing The Rate-Distortion-Computation Frontier For Neural Image Compression

Rate-Distortion-Cognition Controllable Versatile Neural Image Compression

A Unified End-to-End Framework for Efficient Deep Image Compression

HiNeRV: Video Compression with Hierarchical Encoding-based Neural Representation

NeRFCodec: Neural Feature Compression Meets Neural Radiance Fields for Memory-Efficient Scene Representation

A Unified Efficient Deep Image Compression Framework and Its Application on Human-Centric Task

Residual-INR: Communication Efficient On-Device Learning Using Implicit Neural Representation

Dynamic Neural Networks for Adaptive Implicit Image Compression.

Implicit-explicit Integrated Representations for Multi-view Video Compression

I2C: Invertible Continuous Codec for High-Fidelity Variable-Rate Image Compression