AGLLDiff: Guiding Diffusion Models Towards Unsupervised Training-free Real-world Low-light Image Enhancement

Yunlong Lin,Tian Ye,Sixiang Chen,Zhenqi Fu,Yingying Wang,Wenhao Chai,Zhaohu Xing,Lei Zhu,Xinghao Ding
2024-07-23
Abstract:Existing low-light image enhancement (LIE) methods have achieved noteworthy success in solving synthetic distortions, yet they often fall short in practical applications. The limitations arise from two inherent challenges in real-world LIE: 1) the collection of distorted/clean image pairs is often impractical and sometimes even unavailable, and 2) accurately modeling complex degradations presents a non-trivial problem. To overcome them, we propose the Attribute Guidance Diffusion framework (AGLLDiff), a training-free method for effective real-world LIE. Instead of specifically defining the degradation process, AGLLDiff shifts the paradigm and models the desired attributes, such as image exposure, structure and color of normal-light images. These attributes are readily available and impose no assumptions about the degradation process, which guides the diffusion sampling process to a reliable high-quality solution space. Extensive experiments demonstrate that our approach outperforms the current leading unsupervised LIE methods across benchmarks in terms of distortion-based and perceptual-based metrics, and it performs well even in sophisticated wild degradation.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
### Problems the paper attempts to solve This paper aims to address two major challenges in real - world low - light image enhancement (LIE): 1. **Difficulty in obtaining distorted/clear image pairs**: In practical applications, collecting low - light and normal - light image pairs is often impractical and sometimes even impossible. 2. **Accurate modeling of complex degradations**: The degradation processes in the real world are very complex and difficult to model accurately. Although existing low - light image enhancement methods have achieved remarkable success on synthetic data, they perform poorly in practical applications. These problems mainly stem from the inconsistency and complexity between synthetic data and real - world data. To overcome these challenges, the authors propose an unsupervised and training - free method named **AGLLDiff**. This method achieves effective real - world low - light image enhancement by guiding diffusion models. Specifically, instead of defining a specific degradation process, AGLLDiff models the required properties, such as image exposure, structure, and color. These properties are easy to obtain and do not require assumptions about the degradation process, thus guiding the diffusion sampling process into a reliable high - quality solution space. ### Method overview The core idea of AGLLDiff is to use the properties of normal - light images to guide the diffusion generation process, making it enter a high - quality solution space. The advantages of this method are: - **No need for prior knowledge of degradation**: Avoids modeling the degradation process, improving the generality and robustness of the method. - **High - quality generation**: Through property - guided generation, the generated images perform well in terms of brightness, contrast, and naturalness. ### Property - guided AGLLDiff guides the generation process through the following three properties: 1. **Exposure - guided**: Uses a spatially - varying exposure map to constrain the exposure of the output image. The loss function is as follows: \[ L_1=\| \text{Mean}(\hat{x}_0)-\text{Mean}(E)\|^2_2 \] where \( E \) is the spatially - varying exposure map. 2. **Structure - guided**: Constrains the structure of the output image by minimizing the phase error in the Fourier domain between the degraded image and the output image. The loss function is as follows: \[ L_2 = \| P(\hat{x}_0)-P(y_0)\|^2_2 \] where \( P(\cdot) \) represents the phase in the Fourier domain. 3. **Color - guided**: According to the Retinex theory, uses the reflectance map to guide the color synthesis process. The loss function is as follows: \[ L_3=\| F(\hat{x}_0)-F(y_0)\|^2_2 \] where \( F(\cdot) \) represents a pre - trained Retinex decomposition network. The final total loss function is: \[ L=\lambda_1 L_1+\lambda_2 L_2+\lambda_3 L_3 \] ### Dynamic - guidance scheme To further improve the generation quality, AGLLDiff introduces a dynamic - guidance scheme, including a dynamic - guidance scale \( \hat{s} \) and dynamic - gradient steps \( \hat{N} \). The specific calculations are as follows: \[ \hat{s}=\frac{\| x_t - x_{t - 1}\|_2}{\| \nabla_{\hat{x}_0}L\|_2}\cdot s \] \[ \hat{N}=\max\left(1,\frac{\| x_t - x_{t - 1}\|_2}{\| \nabla_{\hat{x}_0}L\|_2}\cdot N\right) \] This dynamic - guidance scheme allows users to adjust the intensity of property - guidance as needed, thereby improving the overall controllability. ### Experimental results The experimental results show that AGLLDiff performs well on multiple benchmark datasets, both for synthetic data and real - world data. Specifically, AGLLDiff performs well in terms of PSNR, SSIM