Abstract:In this paper, we rethink the low-light image enhancement task and propose a physically explainable and generative diffusion model for low-light image enhancement, termed as Diff-Retinex. We aim to integrate the advantages of the physical model and the generative network. Furthermore, we hope to supplement and even deduce the information missing in the low-light image through the generative network. Therefore, Diff-Retinex formulates the low-light image enhancement problem into Retinex decomposition and conditional image generation. In the Retinex decomposition, we integrate the superiority of attention in Transformer and meticulously design a Retinex Transformer decomposition network (TDN) to decompose the image into illumination and reflectance maps. Then, we design multi-path generative diffusion networks to reconstruct the normal-light Retinex probability distribution and solve the various degradations in these components respectively, including dark illumination, noise, color deviation, loss of scene contents, etc. Owing to generative diffusion model, Diff-Retinex puts the restoration of low-light subtle detail into practice. Extensive experiments conducted on real-world low-light datasets qualitatively and quantitatively demonstrate the effectiveness, superiority, and generalization of the proposed method.
What problem does this paper attempt to address?
The problems that this paper attempts to solve are a series of degradation problems existing in images taken under low - light conditions, such as uncertain noise, low contrast, color deviation changes, etc., and especially the loss of scene structure is the most difficult. These problems not only affect the visual effect but also reduce the amount of information in the image. The existing low - light image enhancement methods have limitations in dealing with these degradations. For example, traditional methods are usually based on image priors or simple physical models and lack generalization ability and robustness; while deep - learning - based methods can construct complex mappings from low - light to normal - light images, but are deficient in the definition and targeted treatment of certain specific degradations, manifested as uneven illumination, poor robustness to noise and other problems.
For this reason, the paper proposes a new method - Diff - Retinex, aiming to combine the advantages of physical models and generative networks and rethink the low - light image enhancement task through generative diffusion models. Specifically, Diff - Retinex decomposes the low - light image enhancement problem into two parts: Retinex decomposition and conditional image generation. In Retinex decomposition, the paper designs a Retinex Transformer Decomposition Network (TDN) that incorporates the Transformer attention mechanism to decompose the image into an illumination map and a reflectance map. Then, a multi - path generative diffusion network is designed to respectively reconstruct the Retinex probability distribution under normal illumination and solve various degradation problems in these components, including low - light illumination, noise, color deviation, loss of scene content, etc.
In short, Diff - Retinex not only aims to restore the details in low - light images but also attempts to infer the missing information, thereby achieving higher - quality image enhancement.