Real-Time Adaptive Image Compression

Oren Rippel,Lubomir Bourdev
DOI: https://doi.org/10.48550/arXiv.1705.05823
2017-05-17
Abstract:We present a machine learning-based approach to lossy image compression which outperforms all existing codecs, while running in real-time. Our algorithm typically produces files 2.5 times smaller than JPEG and JPEG 2000, 2 times smaller than WebP, and 1.7 times smaller than BPG on datasets of generic images across all quality levels. At the same time, our codec is designed to be lightweight and deployable: for example, it can encode or decode the Kodak dataset in around 10ms per image on GPU. Our architecture is an autoencoder featuring pyramidal analysis, an adaptive coding module, and regularization of the expected codelength. We also supplement our approach with adversarial training specialized towards use in a compression setting: this enables us to produce visually pleasing reconstructions for very low bitrates.
Machine Learning,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the deficiencies in performance and real - time performance of existing image compression algorithms. Specifically, the paper proposes an image compression method based on machine learning, aiming to outperform all existing compression codecs (such as JPEG, JPEG 2000, WebP and BPG) while maintaining real - time processing capabilities. This method can not only generate smaller files than existing codecs (usually 2.5 times smaller than JPEG and JPEG 2000, 2 times smaller than WebP, and 1.7 times smaller than BPG), but also can perform encoding and decoding on the GPU with very low latency (about 10 milliseconds per picture). The main contributions of the paper include: 1. **Performance improvement**: By using an auto - encoder architecture, combined with pyramid analysis, an adaptive coding module and regularization of the expected coding length, higher compression efficiency is achieved. 2. **Real - time processing**: A lightweight and efficient model is designed, which can achieve real - time image encoding and decoding on the GPU. 3. **Adversarial training**: Adversarial training specifically for compression scenarios is introduced, enabling visually satisfactory reconstructed images to be produced even at extremely low bit rates. The combination of these techniques enables this method to not only outperform traditional and existing machine - learning - based codecs in compression performance, but also to be efficiently deployed in resource - constrained environments.