CAPformer: Compression-Aware Pre-trained Transformer for Low-Light Image Enhancement

Wei Wang,Zhi Jin
2024-07-10
Abstract:Low-Light Image Enhancement (LLIE) has advanced with the surge in phone photography demand, yet many existing methods neglect compression, a crucial concern for resource-constrained phone photography. Most LLIE methods overlook this, hindering their effectiveness. In this study, we investigate the effects of JPEG compression on low-light images and reveal substantial information loss caused by JPEG due to widespread low pixel values in dark areas. Hence, we propose the Compression-Aware Pre-trained Transformer (CAPformer), employing a novel pre-training strategy to learn lossless information from uncompressed low-light images. Additionally, the proposed Brightness-Guided Self-Attention (BGSA) mechanism enhances rational information gathering. Experiments demonstrate the superiority of our approach in mitigating compression effects on LLIE, showcasing its potential for improving LLIE in resource-constrained scenarios.
Computer Vision and Pattern Recognition,Image and Video Processing
What problem does this paper attempt to address?
The paper aims to address the challenges of Low-Light Image Enhancement (LLIE) in the processing of JPEG compressed images. Specifically: 1. **Problems with existing methods**: Most current low-light image enhancement methods are optimized primarily for uncompressed images, neglecting the impact of JPEG compression on image quality. This results in poor performance when these methods are applied to compressed images. 2. **Issues caused by JPEG compression**: During JPEG compression, especially in dark areas, there is significant information loss, leading to increased noise, artifacts, and blurred details. These issues make it difficult for existing LLIE methods to effectively enhance the quality of compressed low-light images. 3. **Proposed solutions**: - A new network architecture named **Compression-Aware Pre-trained Transformer (CAPformer)** is designed, utilizing a Transformer to model long-range dependencies and introducing a Brightness-Guided Self-Attention (BGSA) mechanism to guide the model in ignoring low-quality information in extremely dark regions. - A novel pre-training strategy is proposed, which learns lossless information from uncompressed low-light images to improve the processing of compressed images. 4. **Experimental results**: The paper demonstrates the superior performance of CAPformer on multiple compressed low-light image datasets, proving its effectiveness in mitigating the impact of JPEG compression. In summary, the core objective of this paper is to improve the enhancement of JPEG compressed low-light images in resource-constrained scenarios, such as mobile photography.