Discrete-state Continuous-time Diffusion for Graph Generation

Zhe Xu,Ruizhong Qiu,Yuzhong Chen,Huiyuan Chen,Xiran Fan,Menghai Pan,Zhichen Zeng,Mahashweta Das,Hanghang Tong
2024-05-19
Abstract:Graph is a prevalent discrete data structure, whose generation has wide applications such as drug discovery and circuit design. Diffusion generative models, as an emerging research focus, have been applied to graph generation tasks. Overall, according to the space of states and time steps, diffusion generative models can be categorized into discrete-/continuous-state discrete-/continuous-time fashions. In this paper, we formulate the graph diffusion generation in a discrete-state continuous-time setting, which has never been studied in previous graph diffusion models. The rationale of such a formulation is to preserve the discrete nature of graph-structured data and meanwhile provide flexible sampling trade-offs between sample quality and efficiency. Analysis shows that our training objective is closely related to generation quality, and our proposed generation framework enjoys ideal invariant/equivariant properties concerning the permutation of node ordering. Our proposed model shows competitive empirical performance against state-of-the-art graph generation solutions on various benchmarks and, at the same time, can flexibly trade off the generation quality and efficiency in the sampling phase.
Machine Learning
What problem does this paper attempt to address?
The paper attempts to address the problem of developing a novel discrete-state continuous-time diffusion model for graph generation tasks. Specifically, existing graph diffusion generation models often use a continuous state space approach when handling graph-structured data, which deviates from the discrete nature of graph-structured data and requires additional hyperparameters to handle discretization during the generation process. Moreover, existing models like DiGress, while maintaining the discrete characteristics of graph data, have a discrete-time setting that makes the generation process less flexible. To address these issues, the paper proposes a new discrete-state continuous-time graph diffusion generation model (DISCO), aiming to: 1. Preserve the discrete nature of graph data. 2. Provide flexible sampling trade-offs to balance between generation quality and efficiency. 3. Demonstrate good invariance and equivariance properties in theoretical analysis. 4. Achieve performance comparable to or even better than current state-of-the-art graph generation methods. With these improvements, DISCO is able to exhibit superior performance in multiple benchmarks and can flexibly adjust between generation quality and efficiency according to actual needs.