Diffusion Models: A Comprehensive Survey of Methods and Applications

Ling Yang,Zhilong Zhang,Yang Song,Shenda Hong,Runsheng Xu,Yue Zhao,Wentao Zhang,Bin Cui,Ming-Hsuan Yang

2024-06-24

Abstract:Diffusion models have emerged as a powerful new family of deep generative models with record-breaking performance in many applications, including image synthesis, video generation, and molecule design. In this survey, we provide an overview of the rapidly expanding body of work on diffusion models, categorizing the research into three key areas: efficient sampling, improved likelihood estimation, and handling data with special structures. We also discuss the potential for combining diffusion models with other generative models for enhanced results. We further review the wide-ranging applications of diffusion models in fields spanning from computer vision, natural language generation, temporal data modeling, to interdisciplinary applications in other scientific disciplines. This survey aims to provide a contextualized, in-depth look at the state of diffusion models, identifying the key areas of focus and pointing to potential areas for further exploration. Github: <a class="link-external link-https" href="https://github.com/YangLing0818/Diffusion-Models-Papers-Survey-Taxonomy" rel="external noopener nofollow">this https URL</a>.

Machine Learning,Artificial Intelligence,Computer Vision and Pattern Recognition

What problem does this paper attempt to address?

The paper attempts to address the issue of providing a comprehensive review of Diffusion Models, an emerging and powerful family of deep generative models. Specifically, the paper aims to: 1. **Overview the research progress of diffusion models**: The paper categorizes the research in the field of diffusion models into three key areas: efficient sampling, improved likelihood estimation, and handling data with special structures (such as discrete data, data with invariant structures, and data on manifolds). 2. **Explore the combination with other generative models**: The paper also discusses how to combine diffusion models with other generative models (such as Variational Autoencoders, Generative Adversarial Networks, Normalizing Flows, Autoregressive Models, and Energy-Based Models) to enhance performance. 3. **Summarize application areas**: The paper reviews the wide applications of diffusion models in various fields, including computer vision, natural language processing, time series modeling, multimodal learning, robust learning, and interdisciplinary applications (such as drug design, material design, and medical image reconstruction). 4. **Identify future research directions**: The paper finally looks forward to the future research directions in the field of diffusion models, including revisiting assumptions, theoretical understanding, latent representations, AI-generated content (AIGC), and foundational diffusion models. Through these contents, the paper aims to provide a comprehensive introductory guide for new researchers entering the field, while also offering a broader perspective for experienced researchers.

Diffusion Models: A Comprehensive Survey of Methods and Applications

Diffusion Models: A Comprehensive Survey of Methods and Applications

Efficient Diffusion Models: A Comprehensive Survey from Principles to Practices

A Survey on Generative Diffusion Model

A Survey on Generative Diffusion Models

A Comprehensive Survey on Diffusion Models and Their Applications

An Overview of Diffusion Models: Applications, Guided Generation, Statistical Rates and Optimization

Diffusion Models in NLP: A Survey

A Survey on Graph Diffusion Models: Generative AI in Science for Molecule, Protein and Material

Generative Diffusion Models on Graphs: Methods and Applications

Diffusion Models in Low-Level Vision: A Survey

A Survey of Diffusion Models in Natural Language Processing

A Survey on Video Diffusion Models

A Survey of Multimodal Controllable Diffusion Models

Diffusion Models and Representation Learning: A Survey

Diffusion-based Graph Generative Methods

Diffusion models in text generation: a survey

A Survey on Diffusion Models for Recommender Systems

Video Diffusion Models: A Survey