DP-GEN: A concurrent learning platform for the generation of reliable deep learning based potential energy models
Yuzhi Zhang,Haidi Wang,Weijie Chen,Jinzhe Zeng,Linfeng Zhang,Han Wang,Weinan E
DOI: https://doi.org/10.1016/j.cpc.2020.107206
IF: 4.717
2020-08-01
Computer Physics Communications
Abstract:<p>In recent years, promising deep learning based interatomic potential energy surface (PES) models have been proposed that can potentially allow us to perform molecular dynamics simulations for large scale systems with quantum accuracy. However, making these models truly reliable and practically useful is still a very non-trivial task. A key component in this task is the generation of datasets used in model training. In this paper, we introduce the Deep Potential GENerator (DP-GEN), an open-source software platform that implements the recently proposed "on-the-fly" learning procedure (Zhang et al. 2019) and is capable of generating uniformly accurate deep learning based PES models in a way that minimizes human intervention and the computational cost for data generation and model training. DP-GEN automatically and iteratively performs three steps: exploration, labeling, and training. It supports various popular packages for these three steps: LAMMPS for exploration, Quantum Espresso, VASP, CP2K, etc. for labeling, and DeePMD-kit for training. It also allows automatic job submission and result collection on different types of machines, such as high performance clusters and cloud machines, and is adaptive to different job management tools, including Slurm, PBS, and LSF. As a concrete example, we illustrate the details of the process for generating a general-purpose PES model for Cu using DP-GEN.</p><h3 class="u-h4 u-margin-m-top u-margin-xs-bottom">Program summary</h3><p><em>Program Title:</em> DP-GEN</p><p><em>Program Files doi:</em> <a href="https://dx.doi.org/10.17632/sxybkgc5xc.1">http://dx.doi.org/10.17632/sxybkgc5xc.1</a></p><p><em>Licensing provisions:</em> LGPL</p><p><em>Programming language:</em> Python</p><p><em>Nature of problem:</em> Generating reliable deep learning based potential energy models with minimal human intervention and computational cost.</p><p><em>Solution method:</em> The concurrent learning scheme is implemented. Supports for sampling configuration space with LAMMPS, generating <em>ab initio</em> data with Quantum Espresso, VASP, CP2K and training potential models with DeePMD-kit are provided. Supports for different machines including workstations, high performance clusters and cloud machines are provided. Supports for job management tools including Slurm, PBS, LSF are provided.</p>
physics, mathematical,computer science, interdisciplinary applications