Entropy Coding Improvement for Low-complexity Compressive Auto-encoders

Franck Galpin,Muhammet Balcilar,Frédéric Lefebvre,Fabien Racapé,Pierre Hellier
2023-10-04
Abstract:End-to-end image and video compression using auto-encoders (AE) offers new appealing perspectives in terms of rate-distortion gains and applications. While most complex models are on par with the latest compression standard like VVC/H.266 on objective metrics, practical implementation and complexity remain strong issues for real-world applications. In this paper, we propose a practical implementation suitable for realistic applications, leading to a low-complexity model. We demonstrate that some gains can be achieved on top of a state-of-the-art low-complexity AE, even when using simpler implementation. Improvements include off-training entropy coding improvement and encoder side Rate Distortion Optimized Quantization. Results show a 19% improvement in BDrate on basic implementation of fully-factorized model, and 15.3% improvement compared to the original implementation. The proposed implementation also allows a direct integration of such approaches on a variety of platforms.
Image and Video Processing
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to improve the performance of end - to - end image and video compression techniques based on auto - encoders (AE) while maintaining low complexity. Specifically, although existing complex models are comparable to the latest compression standards (such as VVC/H.266) in objective metrics, their actual implementation and complexity remain major obstacles in practical applications. Therefore, the paper proposes a low - complexity model implementation method suitable for practical applications, aiming to further improve performance on the basis of simplified implementation. The main contributions of the paper include: 1. **Entropy Coding Improvement**: An entropy coding improvement method after offline training is proposed. By learning a new conditional entropy model based on context switching, the coding efficiency of latent variables is optimized. 2. **Rate - Distortion Optimized Quantization (RDOQ) on the Encoding Side**: A new rate - distortion optimized process during encoding is introduced to reduce the amortization gap without the support of an external framework, which can be achieved only using the distilled decoder. 3. **Hardware - Friendly Implementation**: By replacing the GDN/IGDN activation functions with ReLU/ReLU, and using 16 - bit integer networks and simplified integer operations, the model is made more suitable for hardware implementation. 4. **Performance Verification**: Experimental results show that, compared with the basic full - factor model, the proposed implementation method improves the BD - rate by 19%, and by 15.3% compared with the original implementation. In addition, this method also allows direct integration into various platforms. In conclusion, through a series of innovative technical means, this paper significantly improves the performance of auto - encoder - based image and video compression techniques while maintaining low complexity, making it more suitable for practical applications.