Abstract:Diffusion models have emerged from various theoretical and methodological perspectives, each offering unique insights into their underlying principles. In this work, we provide an overview of the most prominent approaches, drawing attention to their striking analogies -- namely, how seemingly diverse methodologies converge to a similar mathematical formulation of the core problem. While our ultimate goal is to understand these models in the context of graphs, we begin by conducting experiments in a simpler setting to build foundational insights. Through an empirical investigation of different diffusion and sampling techniques, we explore three critical questions: (1) What role does noise play in these models? (2) How significantly does the choice of the sampling method affect outcomes? (3) What function is the neural network approximating, and is high complexity necessary for optimal performance? Our findings aim to enhance the understanding of diffusion models and in the long run their application in graph machine learning.

What problem does this paper attempt to address?

The problems that this paper attempts to solve mainly focus on understanding the core principles of diffusion models and their applications on graph - structured data. Specifically, the paper experimentally explores the following three key issues: 1. **What is the role of noise in these models?** - The paper investigates the impact of noise on the performance of diffusion models through different experimental setups. The authors find that noise helps cover low - density regions, thereby enhancing the model's representational ability in these regions. However, excessive noise will disrupt the signal, so the amount of noise is a crucial factor. In addition, the authors also explore whether a deterministic diffusion process can replace random noise, and the results show that a deterministic diffusion process can also achieve good performance. 2. **How much does the choice of sampling method affect the results?** - The paper compares three different sampling methods: single - step sampling, full - step sampling, and noise sampling. The experimental results show that the noise sampling method performs best in most cases because it can better utilize the information of the forward diffusion process to more accurately predict the state of data points. While single - step sampling and full - step sampling show limitations in low - density regions and high - density regions respectively. 3. **What function is approximated by the neural network in these models? Is complexity necessary?** - The authors study the target functions approximated by the neural network in different tasks and explore the impact of model complexity on performance. Experiments show that even a simple network structure can reasonably approximate the data distribution, especially in high - density regions. This indicates that the model's complexity is not the only factor determining performance, and the key lies in how to effectively utilize the information of the forward diffusion process. Overall, this paper aims to enhance the understanding of diffusion models through systematic experiments and analysis, and provide theoretical support and practical guidance for their applications in graph machine learning.

Towards understanding Diffusion Models (on Graphs)

A Survey on Graph Diffusion Models: Generative AI in Science for Molecule, Protein and Material

Generative Diffusion Models on Graphs: Methods and Applications

An Overview of Diffusion Models: Applications, Guided Generation, Statistical Rates and Optimization

Diffusion Models: A Comprehensive Survey of Methods and Applications

A Comprehensive Survey on Diffusion Models and Their Applications

Unifying Diffusion Models on Networks and Their Influence Maximisation.

Sifting through the Noise: A Survey of Diffusion Probabilistic Models and Their Applications to Biomolecules

Diffusion Model for Graph Inverse Problems: Towards Effective Source Localization on Complex Networks.

A Geometric Perspective on Diffusion Models

Lecture Notes in Probabilistic Diffusion Models

Directional diffusion models for graph representation learning

Capturing Dynamics of Information Diffusion in SNS: A Survey of Methodology and Techniques

Where to Diffuse, How to Diffuse, and How to Get Back: Automated Learning for Multivariate Diffusions

Demystifying Variational Diffusion Models

Diffusion-based Graph Generative Methods

Diffusion Models for Time Series Applications: A Survey

Discrete-state Continuous-time Diffusion for Graph Generation

Learning Diffusions under Uncertainty