GraphPiece: Efficiently Generating High-Quality Molecular Graph with Substructures

Xiangwei Kong,Zhixing Tan,Yang Liu
2021-01-01
Abstract:Molecule generation, which requires generating valid molecules with desired properties, is a fundamental but challenging task. Recent years have witnessed the rapid development of atom-level auto-regressive models, which usually construct graphs following sequential actions of adding atom-level nodes and edges. However, these atom-level models ignore high-frequency substructures, which not only capture the regularities of atomic combination in molecules but are also often related to desired chemical properties, and therefore may be sub-optimal for generating high-quality molecules. In this paper, we propose a method to automatically discover such common substructures, which we call graph pieces, from given molecular graphs. We also present a graph piece variational autoencoder (GP-VAE) for generating molecular graphs based on graph pieces. Experiments show that our GP-VAE models not only achieve better performance than the state-of-the-art baseline for distribution-learning, property optimization, and constrained property optimization tasks but are also computationally efficient.
What problem does this paper attempt to address?