A Unified End-to-End Framework for Efficient Deep Image Compression

Jiaheng Liu,Guo Lu,Zhihao Hu,Dong Xu
DOI: https://doi.org/10.48550/arXiv.2002.03370
2020-05-24
Abstract:Image compression is a widely used technique to reduce the spatial redundancy in images. Recently, learning based image compression has achieved significant progress by using the powerful representation ability from neural networks. However, the current state-of-the-art learning based image compression methods suffer from the huge computational cost, which limits their capacity for practical applications. In this paper, we propose a unified framework called Efficient Deep Image Compression (EDIC) based on three new technologies, including a channel attention module, a Gaussian mixture model and a decoder-side enhancement module. Specifically, we design an auto-encoder style network for learning based image compression. To improve the coding efficiency, we exploit the channel relationship between latent representations by using the channel attention module. Besides, the Gaussian mixture model is introduced for the entropy model and improves the accuracy for bitrate estimation. Furthermore, we introduce the decoder-side enhancement module to further improve image compression performance. Our EDIC method can also be readily incorporated with the Deep Video Compression (DVC) framework to further improve the video compression performance. Simultaneously, our EDIC method boosts the coding performance significantly while bringing slightly increased computational cost. More importantly, experimental results demonstrate that the proposed approach outperforms the current state-of-the-art image compression methods and is up to more than 150 times faster in terms of decoding speed when compared with Minnen's method. The proposed framework also successfully improves the performance of the recent deep video compression system DVC. Our code will be released at <a class="link-external link-https" href="https://github.com/liujiaheng/compression" rel="external noopener nofollow">this https URL</a>.
Image and Video Processing,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is to improve the efficiency and performance of deep - learning - based image compression methods while reducing their computational costs. Specifically, although the current state - of - the - art learning - based image compression methods have made significant progress in compression performance, their computational costs are very high, which limits the use of these methods in practical applications. To solve this problem, the author proposes a unified framework - Efficient Deep Image Compression (EDIC), which is based on three new technologies: the channel - attention module, the Gaussian mixture model, and the decoder - side enhancement module. 1. **Channel - attention module**: It enhances the corresponding representation ability by using the channel relationships between latent representations, thereby improving the encoding efficiency. 2. **Gaussian mixture model**: It is used for the entropy model to improve the accuracy of bit - rate estimation. Compared with the single - Gaussian model, it can model the distribution of latent representations more accurately. 3. **Decoder - side enhancement module**: It further improves the image compression performance and reduces compression artifacts. In addition, the proposed EDIC method can also be combined with the Deep Video Compression (DVC) framework to further improve the video compression performance. The experimental results show that the proposed method not only outperforms the current state - of - the - art image compression methods in compression performance, but also is more than 150 times faster than Minnen's method in decoding speed, especially when processing images with a resolution of 768×512. These improvements make this framework significantly reduce the computational cost while maintaining high performance, and it is more suitable for practical applications.