Challenges of Generating Structurally Diverse Graphs

Fedor Velikonivtsev,Mikhail Mironov,Liudmila Prokhorenkova
2024-09-27
Abstract:For many graph-related problems, it can be essential to have a set of structurally diverse graphs. For instance, such graphs can be used for testing graph algorithms or their neural approximations. However, to the best of our knowledge, the problem of generating structurally diverse graphs has not been explored in the literature. In this paper, we fill this gap. First, we discuss how to define diversity for a set of graphs, why this task is non-trivial, and how one can choose a proper diversity measure. Then, for a given diversity measure, we propose and compare several algorithms optimizing it: we consider approaches based on standard random graph models, local graph optimization, genetic algorithms, and neural generative models. We show that it is possible to significantly improve diversity over basic random graph generators. Additionally, our analysis of generated graphs allows us to better understand the properties of graph distances: depending on which diversity measure is used for optimization, the obtained graphs may possess very different structural properties which gives insights about the sensitivity of the graph distance underlying the diversity measure.
Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to ensure the structural diversity of these graphs when generating graph datasets. Specifically, the author points out that in many graph - related tasks, having a set of structurally diverse graphs is very important. For example, such graphs can be used to test graph algorithms or their neural network approximations. However, as far as the author knows, the problem of generating structurally diverse graphs has not been fully explored in the literature. Therefore, this paper fills this gap and mainly studies how to define the diversity of graph sets, why this is a non - trivial task, and how to select appropriate diversity measurement methods. Then, for a given diversity measure, the author proposes and compares several optimization algorithms, including methods based on standard random graph models, local graph optimization, genetic algorithms, and methods based on neural generative models. Research shows that compared with basic random graph generators, the diversity of graphs can be significantly improved. In addition, the analysis of the generated graphs also enables the author to better understand the properties of graph distances, that is, according to different diversity measures used for optimization, the obtained graphs may have very different structural properties, which provides insights into the sensitivity of graph distances.