Abstract:Deep generative models have garnered significant attention in low-level vision tasks due to their generative capabilities. Among them, diffusion model-based solutions, characterized by a forward diffusion process and a reverse denoising process, have emerged as widely acclaimed for their ability to produce samples of superior quality and diversity. This ensures the generation of visually compelling results with intricate texture information. Despite their remarkable success, a noticeable gap exists in a comprehensive survey that amalgamates these pioneering diffusion model-based works and organizes the corresponding threads. This paper proposes the comprehensive review of diffusion model-based techniques. We present three generic diffusion modeling frameworks and explore their correlations with other deep generative models, establishing the theoretical foundation. Following this, we introduce a multi-perspective categorization of diffusion models, considering both the underlying framework and the target task. Additionally, we summarize extended diffusion models applied in other tasks, including medical, remote sensing, and video scenarios. Moreover, we provide an overview of commonly used benchmarks and evaluation metrics. We conduct a thorough evaluation, encompassing both performance and efficiency, of diffusion model-based techniques in three prominent tasks. Finally, we elucidate the limitations of current diffusion models and propose seven intriguing directions for future research. This comprehensive examination aims to facilitate a profound understanding of the landscape surrounding denoising diffusion models in the context of low-level vision tasks. A curated list of diffusion model-based techniques in over 20 low-level vision tasks can be found at <a class="link-external link-https" href="https://github.com/ChunmingHe/awesome-diffusion-models-in-low-level-vision" rel="external noopener nofollow">this https URL</a>.

What problem does this paper attempt to address?

### What problem does this paper attempt to solve? This paper primarily addresses the following core issues: 1. **Review of Diffusion Models in Low-Level Vision Tasks**: - Despite the significant achievements of methods based on Diffusion Models (DMs) in low-level vision tasks, there is a lack of a comprehensive and in-depth review to integrate these methods and organize the corresponding research threads. - The paper aims to fill this gap by providing the first comprehensive review of diffusion models for low-level vision tasks, covering both theoretical and practical contributions. 2. **Theoretical Foundation and Framework**: - Three general diffusion modeling frameworks are proposed, and their connections with other commonly used deep generative models are explored, establishing the theoretical foundation for subsequent analysis. 3. **Multi-Perspective Classification**: - Diffusion models applied to low-level vision tasks are classified from multiple angles, considering the underlying frameworks and target tasks, facilitating understanding and enhancement of existing technologies. 4. **Extended Application Scope**: - The application of diffusion models in other low-level vision tasks, including medical image processing, remote sensing data processing, and video processing, is summarized, providing a broad overview of applications. 5. **Benchmarking and Evaluation**: - Commonly used benchmark datasets and evaluation metrics are summarized, and a detailed performance and efficiency evaluation of diffusion model methods on three typical tasks (super-resolution, image deblurring, and low-light enhancement) is provided. 6. **Future Research Directions**: - The current limitations of diffusion models in low-level vision tasks are elucidated, and seven potential future research directions are proposed to promote further development in this field. Through the above work, this paper aims to enhance the understanding of diffusion models in low-level vision tasks within the computer vision community and stimulate interest in related research.

Diffusion Models in Low-Level Vision: A Survey