Abstract:The development of data-informed predictive models for dynamical systems is of widespread interest in many disciplines. We present a unifying framework for blending mechanistic and machine-learning approaches to identify dynamical systems from noisily and partially observed data. We compare pure data-driven learning with hybrid models which incorporate imperfect domain knowledge, referring to the discrepancy between an assumed truth model and the imperfect mechanistic model as model error. Our formulation is agnostic to the chosen machine learning model, is presented in both continuous- and discrete-time settings, and is compatible both with model errors that exhibit substantial memory and errors that are memoryless. First, we study memoryless linear (w.r.t. parametric-dependence) model error from a learning theory perspective, defining excess risk and generalization error. For ergodic continuous-time systems, we prove that both excess risk and generalization error are bounded above by terms that diminish with the square-root of T T , the time-interval over which training data is specified. Secondly, we study scenarios that benefit from modeling with memory, proving universal approximation theorems for two classes of continuous-time recurrent neural networks (RNNs): both can learn memory-dependent model error, assuming that it is governed by a finite-dimensional hidden variable and that, together, the observed and hidden variables form a continuous-time Markovian system. In addition, we connect one class of RNNs to reservoir computing, thereby relating learning of memory-dependent error to recent work on supervised learning between Banach spaces using random features. Numerical results are presented (Lorenz ’63, Lorenz ’96 Multiscale systems) to compare purely data-driven and hybrid approaches, finding hybrid methods less datahungry and more parametrically efficient. We also find that, while a continuous-time framing allows for robustness to irregular sampling and desirable domain- interpretability, a discrete-time framing can provide similar or better predictive performance, especially when data are undersampled and the vector field defining the true dynamics cannot be identified. Finally, we demonstrate numerically how data assimilation can be leveraged to learn hidden dynamics from noisy, partially-observed data, and illustrate challenges in representing memory by this approach, and in the training of such models.

Machine Learning Memory Kernels as Closure for Non-Markovian Stochastic Processes

Machine Learning Memory Kernels as Closure for Non-Markovian Stochastic Processes

A deep learning approach to the measurement of long-lived memory kernels from Generalised Langevin Dynamics

Data-driven learning of the generalized Langevin equation with state-dependent memory

Dynamics of micro and nanoscale systems in the weak-memory regime: A mathematical framework beyond the Markov approximation

Machine learning stochastic differential equations for the evolution of order parameters of classical many-body systems in and out of equilibrium

Learning Moment Closure in Reaction-Diffusion Systems with Spatial Dynamic Boltzmann Distributions

On the integration of Physics-Based Machine Learning with hierarchical Bayesian modeling techniques

The generalized Langevin equation with power-law memory in a nonlinear potential well

Accurate Memory Kernel Extraction from Discretized Time Series Data

Memory Corrections to Markovian Langevin Dynamics

Machine learning in and out of equilibrium

Dynamics of supercooled liquids from static averaged quantities using machine learning

Learning Memory Kernels in Generalized Langevin Equations

Anomalous Polymer Dynamics Is Non-Markovian: Memory Effects and The Generalized Langevin Equation Formulation

Nonlinear stochastic modeling with Langevin regression

A framework for machine learning of model error in dynamical systems

Learning about learning by many-body systems

Learning fast, accurate, and stable closures of a kinetic theory of an active fluid