Abstract:Atomic simulations based on quantum mechanics (QM) calculations have entered into the tool box of chemists over the past few decades, facilitating an understanding of a wide range of chemistry problems, from structure characterization to reactivity determination. Due to the poor scaling and high computational cost intrinsic to QM calculations, one has to either sacrifice accuracy or time when performing large-scale atomic simulations. The battle to find a better compromise between accuracy and speed has been central to the development of new theoretical methods.The recent advances of machine-learning (ML)-based large-scale atomic simulations has shown great promise to the benefit of many branches of chemistry. Instead of solving the Schrödinger equation directly, ML-based simulations rely on a large data set of accurate potential energy surfaces (PESs) and complex numerical models to predict the total energy. These simulations feature both a high speed and a high accuracy for computing large systems. Due to the lack of a physical foundation in numerical models, ML models are often frustrated in their predictivity and robustness, which are key to applications. Focusing on these concerns, here we overview the recent advances in ML methodologies for atomic simulations on three key aspects. Namely, the generation of a representative data set, the extensity of ML models, and the continuity of data representation. While global optimization methods are the natural choice for building a representative data set, the stochastic surface walking method is shown to provide the desired PES sampling for both minima and transition regions on the PES. The current ML models generally utilize local geometrical descriptors as an input and consider the total energy as the sum of atomic energies. There are many flavors of data descriptors and ML models, but the applications for material and reaction predictions are still limited, not least because of the difficulty to train the associated vast global data sets. We show that our recently designed power-type structure descriptors together with a feed-forward neural network (NN) model are compatible with highly complex global PES data, which has led to a large family of global NN (G-NN) potentials.Two recent applications of G-NN potentials in material and reaction simulations are selected to illustrate how ML-based atomic simulations can help the discovery of new materials and reactions.This article has not yet been cited by other publications.

Active learning meets metadynamics: Automated workflow for reactive machine learning potentials

A transferable active-learning strategy for reactive molecular force fields

Modeling Chemical Processes in Explicit Solvents with Machine Learning Potentials

Developing General Reactive Element-Based Machine Learning Potentials as the Main Computational Engine for Heterogeneous Catalysis

An Automated Pynta-based Curriculum for ML-Accelerated Calculation of Transition States

Using metadynamics to build neural network potentials for reactive events: the case of urea decomposition in water

Exploring the frontiers of condensed-phase chemistry with a general reactive machine learning potential

Stable and Accurate Atomistic Simulations of Flexible Molecules using Conformationally Generalisable Machine Learned Potentials

Data-efficient modeling of catalytic reactions via enhanced sampling and on-the-fly learning of machine learning potentials

Using machine learning to go beyond potential energy surface benchmarking for chemical reactivity

Machine Learning of Reactive Potentials

Beyond potential energy surface benchmarking: a complete application of machine learning to chemical reactivity

Active learning of reactive Bayesian force fields: Application to heterogeneous hydrogen-platinum catalysis dynamics

Charting electronic-state manifolds across molecules with multi-state learning and gap-driven dynamics via efficient and robust active learning

Considerations in the use of ML interaction potentials for free energy calculations

ArcaNN: automated enhanced sampling generation of training sets for chemically reactive machine learning interatomic potentials

Learning reduced kinetic Monte Carlo models of complex chemistry from molecular dynamics

Large-Scale Atomic Simulation via Machine Learning Potentials Constructed by Global Potential Energy Surface Exploration

Machine Learning Potentials for Heterogeneous Catalysis

Transformative Applications of Machine Learning for Chemical Reactions

Refining Potential Energy Surface through Dynamical Properties via Differentiable Molecular Simulation