空间站广场

论文

Notebooks

比赛

课程

Apps

我的主页

我的Notebooks

我的论文库

我的足迹

我的工作空间

任务

节点

文件

数据集

镜像

项目

数据库

公开

Integrators

notebook

AI4S

Deep Learning

python

Molecular Dynamics

notebookAI4SDeep LearningpythonMolecular Dynamics

hmj

Linfeng Zhang

发布于 2023-09-24

推荐镜像 :Third-party software:dmff:0.2.0-notebook

推荐机型 :c32_m64_cpu

Integrators

1. 导入依赖

2. ODE Integrators

2.1 Euler's method

2.2 verlet method and velocity-verlet method

2.3 Numerical Example

2. Simple SDE Integrators

2.1 Euler-Maruyama Scheme

2.2 Milstein Scheme

3. Langevin Dynamics & Its numerical Integrators

Exercise: Can you try to modify the code above and write down the OABAO integrator? Then, using the code of simulating the one-dimensional harmonic oscillator provided below to test your integrator by finding an invariant/preserved quantity and monitoring whether it is invariant/preserved during the simulation.

4. Overdamped Langevin Dynamics

5. Nose-Hoover Dynamics

6. Numerical Example (1D Harmonic Oscillator)

7. Numerical Example (2D Lennard-Jones Fluid)

Integrators

本文档介绍了关于ordinary differential equations(ODEs) 和stochastic differential equations(SDEs) 常用的integrators。本文档借鉴和照搬了去年的demo list里面的许多内容，这里特别感谢去年的作者们！本文档主要以英文介绍为主（因为作者不太确定很多英文的名词的中文对应），希望各位能提供宝贵意见。

作者：花勐健以及去年demo list的作者们

审核：

1. 导入依赖

本文档使用的镜像为dmff_0.2.0-notebook。首先我们拉取DMFF最新版本代码，并安装一些必要的依赖。

可以通过import dmff的命令验证dmff已安装成功，在CPU节点上可能会看到一些warning信息，这是正常的。

代码

文本

[1]

! if [ ! -e DMFF ];then git clone https://gitee.com/deepmodeling/DMFF.git;fi

! git config --global --add safe.directory `pwd`/DMFF

! cd DMFF && git checkout devel

! pip install matplotlib

#! pip install setuptools_scm mdtraj

#! cd DMFF && python3 setup.py install

import sys

import os

import shutil

sys.path.insert (0, os.path.join(os.getcwd(),"DMFF"))

sys.path.append (os.path.join(os.getcwd(),"DMFF", "examples", "tutorial_utils"))

if os.path.isdir("data"):

shutil.rmtree("data")

shutil.copytree(os.path.join(os.getcwd(),"DMFF", "examples", "tutorial_utils", "data"), "data")

import dmff

import numpy as np

import matplotlib.pyplot as plt

import jax

import jax.numpy as jnp

from tqdm import tqdm, trange

import openmm.app as app

import openmm.unit as unit

import tutorial_utils as utils

from tutorial_utils import State, BaseIntegrator, XYZWriter, init_state_from_PDB

dmff.settings.update_jax_precision("float")

Already on 'devel'
Your branch is up to date with 'origin/devel'.
Looking in indexes: https://pypi.tuna.tsinghua.edu.cn/simple
Requirement already satisfied: matplotlib in /opt/mamba/lib/python3.10/site-packages (3.6.2)
Requirement already satisfied: contourpy>=1.0.1 in /opt/mamba/lib/python3.10/site-packages (from matplotlib) (1.0.6)
Requirement already satisfied: fonttools>=4.22.0 in /opt/mamba/lib/python3.10/site-packages (from matplotlib) (4.38.0)
Requirement already satisfied: cycler>=0.10 in /opt/mamba/lib/python3.10/site-packages (from matplotlib) (0.11.0)
Requirement already satisfied: pyparsing>=2.2.1 in /opt/mamba/lib/python3.10/site-packages (from matplotlib) (3.0.9)
Requirement already satisfied: numpy>=1.19 in /opt/mamba/lib/python3.10/site-packages (from matplotlib) (1.23.4)
Requirement already satisfied: pillow>=6.2.0 in /opt/mamba/lib/python3.10/site-packages (from matplotlib) (9.3.0)
Requirement already satisfied: kiwisolver>=1.0.1 in /opt/mamba/lib/python3.10/site-packages (from matplotlib) (1.4.4)
Requirement already satisfied: python-dateutil>=2.7 in /opt/mamba/lib/python3.10/site-packages (from matplotlib) (2.8.2)
Requirement already satisfied: packaging>=20.0 in /opt/mamba/lib/python3.10/site-packages (from matplotlib) (21.3)
Requirement already satisfied: six>=1.5 in /opt/mamba/lib/python3.10/site-packages (from python-dateutil>=2.7->matplotlib) (1.16.0)
WARNING: Running pip as the 'root' user can result in broken permissions and conflicting behaviour with the system package manager. It is recommended to use a virtual environment instead: https://pip.pypa.io/warnings/venv
2023-09-27 21:26:45.061163: W external/org_tensorflow/tensorflow/stream_executor/platform/default/dso_loader.cc:64] Could not load dynamic library 'libcuda.so.1'; dlerror: /usr/lib/x86_64-linux-gnu/libcuda.so.1: file too short; LD_LIBRARY_PATH: /usr/local/nvidia/lib:/usr/local/nvidia/lib64
2023-09-27 21:26:45.061950: W external/org_tensorflow/tensorflow/stream_executor/cuda/cuda_driver.cc:263] failed call to cuInit: UNKNOWN ERROR (303)
WARNING:absl:No GPU/TPU found, falling back to CPU. (Set TF_CPP_MIN_LOG_LEVEL=0 and rerun for more info.)

代码

文本

2. ODE Integrators

2.1 Euler's method

Many commonly seen deterministic systems are governed by Newton's third law. An object's velocity and position as functions of time can be described by the following ordinary differential equations: $\frac{d}{d t} r_{t} = v_{t}$ $\frac{d}{d t} v_{t} = a (t) = F (t) / m$ where $r_{t}$ denotes the position of the object, $v_{t}$ is its time-dependent velocity, and $a (t) = F (t) / m$ is the acceleration calculated from Newton's third law with a time-dependent force $F (t)$ and a mass $m$ .

Suppose we have a grid in time $t_{0}, t_{1}, \dots, t_{n}$ , where we denote the mesh width/time step size by $Δ t$ . The Euler's method is the simplest way to integrate such ODE systems and the discretization is as follows:

$r_{k + 1} = r_{k} + v_{k} Δ t$ $v_{k + 1} = v_{k} + a (t_{k}) Δ t$

where $r_{k}, v_{k}$ are the position and velocity at time $t_{k}$ .

With a Taylor expansion, we can show that the numerical error brought by this way of discretization in time is $O (Δ t)$ .

Moreover, it is worth noting that the Euler's method, and more generally, all explicit ODE integrators suffer from the problem of numerical stability. Here's a simple example that demonstrate the issue of numerical stability. Suppose we have an ODE system:

$\frac{d}{d t} y_{t} = c y_{t}$

where $c < 0$ is a constant. The analytical solution of this ODE is $y_{t} = exp (c t) y_{0}$ , where $y_{0}$ is the initial condition. The numerical solution given by the Euler's method is $y_{k + 1} = y_{k} + c Δ t y_{k} = (1 + c Δ t) y_{k}$ . It follows that $y_{n} = (1 + c Δ t)^{n} y_{0}$ . Since $c < 0$ , $1 + c Δ t < 1$ . What if $Δ t$ is so large that $1 + c Δ t < - 1$ ? It is obvious that $y_{n}$ would blow up in this case. The implicit Euler's method would help avoid this issue. Let's take the same ODE $\frac{d}{d t} y_{t} = c y_{t}$ as the example. The discretization of the implicit Euler's method is also very simple:

$y_{k + 1} = y_{k} + c Δ t y_{k + 1}$

We don't know the right-hand side but we can work out an equivalent expression:

$y_{k + 1} = (1 - c Δ t)^{- 1} y_{k}$

We observe that the numerical stability is no longer a problem for the implicit Euler's method because $(1 - c Δ t)^{- 1} \in (0, 1)$ as long as $Δ t > 0$ .

In the block below, we implement the explicit Euler's method for general 1D systems governed by Newton's third law.

代码

文本

[2]

class EulerIntegrator(utils.BaseIntegrator):

def update_state(self, pos_0, vel_0, box_0, mas, ene_0, frc_0, engrad=None):

acc = frc_0 / mas

pos_1 = pos_0 + vel_0 * self.dt

vel_1 = vel_0 + acc * self.dt

ene_1, grd_1 = engrad(pos_1, box_0)

frc_1 = - grd_1

return pos_1, vel_1, box_0, mas, ene_1, frc_1

代码

文本

2.2 verlet method and velocity-verlet method

Since the Euler's method is only first-order accurate (i.e. the error ~ $O (Δ t)$ ), we often want to use a high-order accuracy method such that we can use a larger time step size $Δ t$ without loss of accuracy. There are many methods of such (e.g. Runge-Kutta methods, multistep methods). If the readers want to know more about the families of high-order methods, we recommend this book: https://faculty.washington.edu/rjl/fdmbook/. Here, We introduce the verlet and the velocity-verlet method, which are more relevant to MD problems. Now, we consider our system of ODEs governed by Newton's third law:

$\frac{d}{d t} r_{t} = v_{t}$ $\frac{d}{d t} v_{t} = a (t) = F (t) / m$

For the time step $n$ , the Taylor expansion of $r_{n + 1}$ is $r_{n + 1} = r_{n} + Δ t \frac{d}{d t} r_{n} + \frac{1}{2} (Δ t)^{2} \frac{d ^{2}}{d t ^{2}} r_{n} + \frac{1}{6} (Δ t)^{3} \frac{d ^{3}}{d t ^{3}} r_{n} + O (Δ t^{4})$

From our ODEs, we know that $\frac{d}{d t} r_{n} = v_{n}$ , $\frac{d ^{2}}{d t ^{2}} r_{n} = \frac{d}{d t} v_{n} = a (t_{n})$ , and $\frac{d ^{3}}{d t ^{3}} r_{n} = a^{'} (t_{n})$ . Hence, $r_{n + 1} = r_{n} + Δ t v_{n} + \frac{1}{2} a (t_{n}) (Δ t)^{2} + \frac{1}{6} (Δ t)^{3} a^{'} (t_{n}) + O (Δ t^{4})$

Similarly, if we expand $r_{n - 1}$ with Taylor series, we would get

$r_{n - 1} = r_{n} - Δ t v_{n} + \frac{1}{2} a (t_{n}) (Δ t)^{2} - \frac{1}{6} (Δ t)^{3} a^{'} (t_{n}) + O (Δ t^{4})$

Therefore, adding these two Taylor expansions yields

$r_{n - 1} + r_{n + 1} = 2 r_{n} + a (t_{n}) (Δ t)^{2} + O (Δ t^{4})$

and this gives us a numerical scheme, which people often call the verlet method,

$r_{n + 1} = 2 r_{n} - r_{n - 1} + a (t_{n}) (Δ t)^{2}$

which is third-order accurate. Note that the velocity-verlet method does not require we update the velocity at every time step and we do not need to keep track of $v_{1}, v_{2}, \dots, v_{n}$ . We can compute $r_{n + 1}$ directly as long as we have access to $r_{n}, r_{n - 1}$ and we can evaluate $a (t_{n})$ .

As opposed to the verlet method, we also have the velocity-verlet method, which requires us to update the velocity at every step. The scheme is as follows:

$r_{n + 1} = r_{n} + v_{n} Δ t + \frac{1}{2} a (t_{n}) (Δ t)^{2}$ $v_{n + 1} = v_{n} + \frac{( a ( t _{n} ) + a ( t _{n + 1} )) Δ t}{2}$

The velocity is updated via a central difference scheme with error $O (Δ t^{2})$ . So, this error will be become $O (Δ t^{3})$ when it comes to the update of the position. Therefore, the position is updated with a local truncation error of $O (Δ t^{3})$ .

Both the verlet and the velocity-verlet can achieve better numerical accuracy than the simple explicit and implicit Euler methods. But we should also note here that the time step size for the verlet and the velocity-verlet methods is also restricted by the numerical stability but they both provide much better numerical stability than the explicit Euler's method. So, in practice, they are the methods that people often use.

代码

文本

[3]

class VerletIntegrator(BaseIntegrator):

def __init__(self, timestep=1.0e-3):

self.dt = timestep

def update_state(self, pos_0, pos_1, box_0, mas, ene_1, frc_1, engrad=None):

acc_1 = frc_1/mas

pos_2 = 2*pos_1 - pos_0 + acc_1*(self.dt**2)

ene_2, grd_2 = engrad(pos_2, box_0)

frc_2 = - grd_2

return pos_2,ene_2,frc_2

def update_state_velocity_verlet(self, pos_0, vel_0, box_0, mas, ene_0, frc_0, engrad=None):

acc_0 = frc_0 / mas

pos_1 = pos_0 + vel_0 * self.dt + 0.5 * acc_0 * self.dt ** 2

ene_1, grd_1 = engrad(pos_1, box_0)

frc_1 = - grd_1

acc_1 = frc_1 / mas

vel_1 = vel_0 + 0.5 * (acc_0 + acc_1) * self.dt

return pos_1, vel_1, box_0, mas, ene_1, frc_1

def step(self, pos_0, state,engrad = None):

pos = state.getPositions()

vel = state.getVelocities()

box = state.getCellVector()

mas = state.getMasses()

frc = state.getForces()

ener = state.getPotentialEnergy()

pos_2,ene_2,frc_2 = self.update_state(pos_0, pos, box, mas, ener, frc, engrad=engrad)

state_new = State(positions=pos_2,

velocities=vel,

cell=box,

energy=ene_2,

forces=frc_2,

masses=mas)

return state_new

def first_step(self, state, engrad=None):

pos = state.getPositions()

vel = state.getVelocities()

box = state.getCellVector()

mas = state.getMasses()

frc = state.getForces()

ener = state.getPotentialEnergy()

pos_1, vel_1, box_1, mas_1, ene_1, frc_1 = self.update_state_velocity_verlet(

pos, vel, box, mas, ener, frc, engrad=engrad)

state_new = State(positions=pos_1,

velocities=vel_1,

cell=box_1,

energy=ene_1,

forces=frc_1,

masses=mas)

return state_new

class VelocityVerletIntegrator(BaseIntegrator): # Credit to Xinyan Wang

def update_state(self, pos_0, vel_0, box_0, mas, ene_0, frc_0, engrad=None):

acc_0 = frc_0 / mas

pos_1 = pos_0 + vel_0 * self.dt + 0.5 * acc_0 * self.dt ** 2

ene_1, grd_1 = engrad(pos_1, box_0)

frc_1 = - grd_1

acc_1 = frc_1 / mas

vel_1 = vel_0 + 0.5 * (acc_0 + acc_1) * self.dt

return pos_1, vel_1, box_0, mas, ene_1, frc_1

代码

文本

2.3 Numerical Example

In the below blocks, we use a simple 1D harmonic oscillator example to illustrate the performance of the methods we introduced above. The energy of the system we are considering is obviously conserved but it is not conserved in the numerical solutions because of discretization errors. Here, we do a convergence study on this. Let our time step sizes be $t_{k} = 2^{- 2 - k}$ and the potential energy we get from numerical solver integrating till $t = 1$ with time step size $t_{k}$ be $E_{k}$ . The potential energy $r_{k}$ only depends on the position $r_{k}$ , so we use it instead of the total energy to measure the convergence.

Then, the relative error is defined by $∣ E_{k + 1} - E_{k} ∣$ and a plot of which can indicate the speed of convergence of the numerical solver. We make a plot of such for the three integrators we introduced above for the 1D harmonic oscillator and we see that the verlet method and the velocity-verlet method achieve almost the same accuracy and they are all much better than the forward Euler method.

Here, we should emphasize that the actual error in the numerical solution at the final time (i.e. the global error) is often different in order from the local truncation error, which is what we analyzed from Taylor expansions. In practice, both the verlet and the velocity-verlet methods have a global error of $O (Δ t^{2})$ . This difference is due to the fact that we have assumed in our Taylor expansions that $\frac{d}{d t} r (t_{n}) = v_{n}$ and $\frac{d}{d t} v (t_{n}) = a (x_{n})$ , which only hold when $n = 0$ (i.e. initial conditions). The error accumulates in time and this makes these relations no longer hold. This also explains why the verlet method and the velocity-verlet method achieve almost the same accuracy in the end whereas our analysis shows that the verlet method has a local truncation error of approximately $O (Δ t^{4})$ in the position whereas the velocity-verlet should have a local truncation error of approximately $O (Δ t^{3})$ in the position.

When we perform the analysis, we consider $a (t)$ as a function of $t$ , but in this example, we have $a (t) = a (r_{t})$ . With this difference, the third-order term in the Taylor expansion is now

$\frac{1}{6} (Δ t)^{3} \frac{d}{d t} a (x (t_{n})) = \frac{1}{6} (Δ t)^{3} a^{'} (x (t_{n})) \frac{d}{d t} x (t_{n}) \neq = \frac{1}{6} (Δ t)^{3} a^{'} (x_{n}) v_{n}$

代码

文本

[4]

energies_vv = []

time_step_size = 1e-3

omm_pdb, engrad = utils.create1DHarmonicOscillator()

state = init_state_from_PDB(omm_pdb, engrad=engrad)

state.velocities = jax.numpy.array([[-0.79840655, 1.3889997, -0.25326978]])

total_energy = state.getTotalEnergy()

integ = VelocityVerletIntegrator(time_step_size)

steps = int(1/time_step_size)

for nstep in trange(steps):

state = integ.step(state, engrad)

energies_vv.append([state.getTotalEnergy()])

energies_vv = np.array(energies_vv)

plt.plot(energies_vv - total_energy, label="Velocity-Verlet",linewidth = 2)

energies_vv = []

omm_pdb, engrad = utils.create1DHarmonicOscillator()

state = init_state_from_PDB(omm_pdb, engrad=engrad)

state.velocities = jax.numpy.array([[-0.79840655, 1.3889997, -0.25326978]])

integ = EulerIntegrator(time_step_size)

steps = int(1/time_step_size)

for nstep in trange(steps):

state = integ.step(state, engrad)

energies_vv.append([state.getTotalEnergy()])

energies_vv = np.array(energies_vv)

plt.plot(energies_vv - total_energy, label="ForwardEuler",linewidth = 2)

plt.xlabel("Time Steps",fontsize = 15)

plt.ylabel('Total Energy Drift',fontsize = 15)

plt.title("The total energy change in the simulation",fontsize = 15)

plt.legend()

plt.show()

代码

文本

[5]

energies_vv = []

time_grid = (1/8)/(2**(np.arange(5)))

for step_size in range(6):

omm_pdb, engrad = utils.create1DHarmonicOscillator()

state = init_state_from_PDB(omm_pdb, engrad=engrad)

state.velocities = jax.numpy.array([[-0.79840655, 1.3889997, -0.25326978]])

time_step_size = 1/4/(2**step_size)

integ = VelocityVerletIntegrator(time_step_size)

steps = int(1/time_step_size)

for nstep in trange(steps):

state = integ.step(state, engrad)

energies_vv.append([state.getPotentialEnergy()])

energies_vv = np.array(energies_vv)

plt.semilogy(time_grid,np.abs(energies_vv[1:] - energies_vv[:-1]), label="Velocity_Verlet",linestyle = "--", marker='o',linewidth = 2)

energies_v = []

for step_size in range(6):

omm_pdb, engrad = utils.create1DHarmonicOscillator()

state = init_state_from_PDB(omm_pdb, engrad=engrad)

state.velocities = jax.numpy.array([[-0.79840655, 1.3889997, -0.25326978]])

time_step_size = 1/4/(2**step_size)

integ = VerletIntegrator(time_step_size)

steps = int(1/time_step_size)

pos_0 = state.getPositions()

state = integ.first_step(state,engrad)

for nstep in trange(steps-1):

state_0 = state

state = integ.step(pos_0, state, engrad)

pos_0 = state_0.getPositions()

energies_v.append([state.getPotentialEnergy()])

energies_v = np.array(energies_v)

plt.semilogy(time_grid,np.abs(energies_v[1:] - energies_v[:-1]), label="Verlet",linestyle = "--", marker='o',linewidth = 2)

energies_e = []

for step_size in range(6):

omm_pdb, engrad = utils.create1DHarmonicOscillator()

state = init_state_from_PDB(omm_pdb, engrad=engrad)

state.velocities = jax.numpy.array([[-0.79840655, 1.3889997, -0.25326978]])

time_step_size = 1/4/(2**step_size)

integ = EulerIntegrator(time_step_size)

steps = int(1/time_step_size)

for nstep in trange(steps):

state = integ.step(state, engrad)

energies_e.append([state.getPotentialEnergy()])

energies_e = np.array(energies_e)

plt.semilogy(time_grid,np.abs(energies_e[1:] - energies_e[:-1]), label="Euler",linestyle = "--", marker='o',linewidth = 2)

plt.gca().invert_xaxis()

plt.xscale('log', base=2)

plt.xlabel("Time step size",fontsize = 15)

plt.ylabel(r'$|E_{n} - E_{n-1}|$',fontsize = 15)

plt.title("Relative error in the potential energy",fontsize = 15)

plt.legend()

plt.show()

代码

文本

2. Simple SDE Integrators

We will put more emphasis on the following sections on SDE integrators rather than the ODE integrators because we more often deal with SDEs instead of ODEs in MD simulations. Suppose we have an one-dimensional SDE as follows:

$d X_{t} = b (t, X_{t}) d t + σ (t, X_{t}) d W_{t}$

where $b (t, X_{t})$ is often called the drift term, $σ (t, X_{t})$ is often called the diffusion term, and $W_{t}$ denotes a standard Brownian motion/Wiener measure.

We often write stochastic processes as the form of SDEs but they are only defined as the short-hand for the integral equation

$X_{t} = X_{0} + \int_{0}^{t} b (t, X_{s}) d s + \int_{0}^{t} σ (s, X_{s}) d W_{s}$

The integral $\int_{0}^{t} σ (s, X_{s}) d W_{s}$ is defined as the Ito integral. Unlike ODEs, the SDE themselves do not have a rigorous mathematical definition because the Brownian motion is almost surely nowhere differentiable. Therefore, $d W_{t}$ is undefined as a mathematical object.

Now, we are going to introduce several SDE integrators, including the Euler-Maruyama scheme, which is the counterpart of the Euler method we introduced as an ODE integrator. We will also introduce the Milstein scheme, which is less frequently used than the Euler-Maruyama scheme.

2.1 Euler-Maruyama Scheme

The Euler-Maruyama scheme is very simple. Suppose we know $X_{t}$ at step n (i.e. time $t_{n}$ ), which we denote by $X_{n}$ , then

$X_{n + 1} = X_{n} + b (t_{n}, X_{n}) Δ t + σ (t_{n}, X_{n}) ξ$

where $Δ t$ denotes the time step size and $ξ \sim N (0, Δ t)$ is a sample from the Gaussian distribution with mean $0$ and variance $Δ t$ . This can be derived from the stochastic Ito-Taylor expansion.

2.2 Milstein Scheme

Using the notation in the previous section, we state the Milstein scheme, which is less stable but more accurate than the Euler-Maruyama scheme, for the time-homogeneous SDEs (i.e. $b, σ$ only depend on $X_{t}$ )as follows:

$X_{n + 1} = X_{n} + b (X_{n}) Δ t + σ (X_{n}) ξ + \frac{1}{2} σ (X_{n}) σ^{'} (X_{n}) (ξ^{2} - Δ t)$

where $Δ t$ denotes the time step size and $ξ \sim N (0, Δ t)$ is a sample from the Gaussian distribution with mean $0$ and variance $Δ t$ . Notice that the first two terms on the right-hand side coincide with the Euler-Maruyama scheme. Here the last term on the right-hand side is a correction term, which can be derived from the stochastic Ito-Taylor expansion.

代码

文本

3. Langevin Dynamics & Its numerical Integrators

We consider the Hamiltonian system governed by the following equations:

$d q = M^{- 1} p d t$

$d p = (- \nabla U (q) - γ p) d t + 2 k_{B} T γ M d W_{t}$

where the bold characters are vector-valued functions. In the above equations, $q, p, U (q)$ denote the position, momentum, and the potential energy of a particle. $T$ denotes the temperature, $k_{B}$ denotes the Boltzmann constant, and $γ$ denotes the friction coefficient.

Question: by definition, what is the Hamiltonian of the system?

Answer: $H (q, p) = U (q) + \frac{1}{2} p^{T} M^{- 1} p$

We can rewrite this system of equations as

$d [q p] = [M^{- 1} p 0] d t + [0 - U (q)] d t + [0 - γ p d t + σ M^{1/2} d W_{t}]$

where we let $σ = 2 k_{B} T γ$ to simplify the notations.

We label the three terms on the right-hand side as $A = [M^{- 1} p 0] d t$ , $B = [0 - U (q)] d t$ , and $O = [0 - γ p d t + σ M^{1/2} d W_{t}]$ . Suppose we have the following three equations:

$d [q p] = A, d [q p] = B, d [q p] = O$

From time step $n$ to $n + 1$ , to the best of our knowledge, these three equations can be solved for one step using the following update:

$[q_{n + 1} p_{n + 1}] = [q_{n} + M^{- 1} p_{n} Δ t p_{n}], [q_{n + 1} p_{n + 1}] = [q_{n} p_{n} - U (q_{n}) Δ t], [q_{n + 1} p_{n + 1}] = [q_{n} e^{- γ Δ t} p_{n} + k_{B} T (1 - e^{- 2 γ Δ t}) M^{1/2} R_{n}]$

where $R_{n}$ is a Gaussian vector with mean $0$ and covariance $I$ . Now, we have splitted the Langevin equations into three pieces and we know how to solve each one of them.

Since the Langevin equations can be written as

$d [q p] = A + B + O$

we then know how to solve the Langevin equations with the above solutions to the three splitted equations and there are many numerical schemes we can come up with.

For example, we can first solve A for an half time step, then solve B with $q_{n + 1/2}$ obtained from solving A, then solve A again with $p_{n + 1/2}$ we get from solving B, and finally solve O with $q_{n + 1}$ . This method is short-handed as ABAO. Similarly, one can invent many schemes of such, such as BABO, ABOBA, BAOAB. Writing out these schemes (e.g. BABO, ABOBA, BAOAB) will leave as an exercise to the readers.

As a side note, ABOBA and BAOAB are often referred to as the Position-Verlet method and the Velocity-Verlet method and they perserve the variance of position.

代码

文本

[6]

class BAOABLangevinIntegrator(utils.BaseIntegrator): # credit to Xinyan Wang

def __init__(self, temperature=298.15, gamma=5.0, timestep=1.0e-3, removeCMMotion=False):

self.dt = timestep

self.gamma = gamma

self.temperature = temperature

self.removeCMMotion = removeCMMotion

self.vscale = jnp.exp(- self.gamma * self.dt)

kbT = 1.380649 * 6.02214076 * 1e-3 * self.temperature

self.noisescale = jnp.sqrt(kbT * (1. - self.vscale * self.vscale))

def update_state(self, pos_0, vel_0, box_0, mas, ene_0, frc_0, engrad=None):

# B

vel_1 = vel_0 + frc_0 / mas * 0.5 * self.dt

# A

pos_1 = pos_0 + vel_1 * 0.5 * self.dt

# O

vel_1 = self.vscale * vel_1 + self.noisescale / jnp.sqrt(mas) * jnp.array(np.random.normal(size=vel_0.shape))

# A

pos_1 = pos_1 + vel_1 * 0.5 * self.dt

# B

ene_1, grd_1 = engrad(pos_1, box_0)

frc_1 = - grd_1

vel_1 = vel_1 + frc_1 / mas * 0.5 * self.dt

if self.removeCMMotion:

vel_1 -= vel_1.mean(axis=0)

return pos_1, vel_1, box_0, mas, ene_1, frc_1

class ABOBALangevinIntegrator(utils.BaseIntegrator):

def __init__(self, temperature=298.15, gamma=5.0, timestep=1.0e-3):

self.dt = timestep

self.gamma = gamma

self.temperature = temperature

self.vscale = jnp.exp(- self.gamma * self.dt)

kbT = 1.380649 * 6.02214076 * 1e-3 * self.temperature

self.noisescale = jnp.sqrt(kbT * (1. - self.vscale * self.vscale))

def update_state(self, pos_0, vel_0, box_0, mas, ene_0, frc_0, engrad=None):

# A

pos_1 = pos_0 + vel_0 * 0.5 * self.dt

# B

ene_1, grd_1 = engrad(pos_1, box_0)

frc_1 = - grd_1

vel_1 = vel_0 + frc_1 / mas * 0.5 * self.dt

# O

vel_1 = self.vscale * vel_1 + self.noisescale / jnp.sqrt(mas) * jnp.array(np.random.normal(size=vel_0.shape))

# B

vel_1 = vel_1 + frc_1 / mas * 0.5 * self.dt

# A

pos_1 = pos_1 + vel_1 * 0.5 * self.dt

if self.removeCMMotion:

vel_1 -= vel_1.mean(axis=0)

return pos_1, vel_1, box_0, mas, ene_1, frc_1

代码

文本

Exercise: Can you try to modify the code above and write down the OABAO integrator? Then, using the code of simulating the one-dimensional harmonic oscillator provided below to test your integrator by finding an invariant/preserved quantity and monitoring whether it is invariant/preserved during the simulation.

代码

文本

代码

文本

4. Overdamped Langevin Dynamics

An interesting special case of Langevin dynamics is obtained by considering the large $γ$ limit, which is often referred to as the overdamped limit. The resulting equation is called the overdamped Langevin dynamics. In the limit of large $γ$ , the change in $p$ damps very quickly and we can assume that the acceleration is negligible, which yields the following system of equations:

$d q = M^{- 1} p d t$

$0 = (- \nabla U (q) - γ p) d t + 2 k_{B} T γ M d W_{t}$

From the second equation, we obtain that

$p d t = - γ^{- 1} \nabla U (q) d t + 2 k_{B} TM γ^{- 1} d W_{t}$

We combine it with the first equation and we get

$d q = - γ^{- 1} M^{- 1} \nabla U (q) d t + 2 k_{B} T M^{- 1} γ^{- 1} d W_{t}$

Eliminating $γ$ and $M$ by rescaling the equation yields

$d q = - \nabla U (q) d t + 2 k_{B} T / M d W_{t}$

This is also sometimes referred to as Brownian dynamics. The overdamped Langevin dynamics have an equilibriunm Gibbs distribution $ρ (q) \propto exp (- U (q) / (k_{B} T))$ , which is also referred to as the canonical ensemble, and people often integrate the above SDE to sample from $ρ (q)$ . This sampling technique is called Langevin dynamics in the machine learning community and it is widely used to sample from energy-based models and diffusion models. Another less commonly used name for this method is called the stochastic gradient descent algorithm.

代码

文本

[7]

class OverdampedLangevinIntegrator(utils.BaseIntegrator):

def __init__(self, temperature=298.15, timestep=1.0e-3, removeCMMotion=True):

self.dt = timestep

self.temperature = temperature

kbT = 1.380649 * 6.02214076 * 1e-3 * self.temperature

self.noisescale = jnp.sqrt(kbT * 2 * self.dt)

def update_state(self, pos_0, vel_0, box_0, mas, ene_0, frc_0, engrad=None):

vel_0 = jnp.zeros_like(vel_0)

# Euler-Maruyama

pos_1 = pos_0 + frc_0*self.dt + self.noisescale*jnp.array(np.random.normal(size=pos_0.shape))

ene_1, grd_1 = engrad(pos_1, box_0)

frc_1 = - grd_1

return pos_1, vel_0, box_0, mas, ene_1, frc_1

代码

文本

5. Nose-Hoover Dynamics

Nose-Hoover dynamics provide an alternative to the Langevin dynamics for sampling from the canonical measure $ρ (p, q) \propto exp (- H (p, q) / (k_{B} T))$ , where $H (p, q)$ denotes the system Hamiltonian, with deterministic paths. For any test function $ϕ (p, q)$ , we want to compute the integral

$\int_{D} ϕ (p, q) ρ (p, q) d ω_{q} ω_{p}$

where $D$ denotes the domain and $ω_{q}, ω_{p}$ are the coordinates of $q$ and $p$ . One way to compute this integral in an extended space by introducing new variables $ξ$ in addition to $(q, p)$ and additional equations of motion to drive them. The combined system is carefully designed to preserve an extended invariant distribution $ρ (q, p, ξ) \propto ρ (q, p) \times ρ (ξ)$

where the auxiliary density $ρ (ξ)$ has a simple form (e.g. a multivariate Gaussian). Then, it follows that

$\int \int_{D} ϕ (p, q) ρ (q, p, ξ) d ω_{q} ω_{p} d ξ \propto \int_{D} ϕ (p, q) ρ (p, q) d ω_{q} ω_{p}$

With the ergodicity assumption of the coupled system, we can then calculate the integral with respect to the Gibbs distribution by direct averaging along trajectories of the extended system. However, it is likely that the assumption of ergodicity fails and there are some known cases where this deterministic way of computing the integral leads to incorrect results. Still, this class of deterministic methods has been often used in molecular simulation.

Nose-Hoover dynamics is perhaps the simplest one of the class of deterministic methods we summarized above. Let $N_{d}$ denotes the number of degrees of freedom of the system. The extended system of Nose-Hoover dynamics takes the form

$\frac{d q}{d t} = M^{- 1} p$ $\frac{d p}{d t} = - \nabla U (q) - ξ p$ $μ \frac{d ξ}{d t} = p^{T} M^{- 1} p - N_{d} k_{B} T$

When the temperature of the system, measured as the average kinetic energy per degree of freedom, is higher than the prescribed temperature $k_{B} T$ , the equation for $ξ$ will have a positive right-hand side and it will ensure that $ξ$ increases, and $ξ$ will eventually become positive. This will damp/cool the physical variables through equation for $p$ ; eventually the kinetic energy will fall below the target value, and the right-hand side of the last equation will become negative. By fluctuating in this way, $ξ$ typically controls the time-averaged kinetic energy of the system such that it is very close to $N_{d} k_{B} T /2$ , which is what the kinetic energy should be at the equilibrium. $μ$ is a parameter that can be arbitrarily chosen to improve the performance of the dynamics.

This Nose-Hoover Dynamics preserves an extended measure with density of the form

$ρ (q, p) \times exp (- m u ξ^{2} / (2 k_{B} T))$

which we will not prove here. To monitor the performance of numerical methods, we also observe that the Nose-Hoover system preserves the quantity

$I (q, p, ξ) = H (q, p) + \frac{μ ξ ^{2}}{2} + N_{d} k_{B} T lo g s$

where $s$ solves the ODE $\frac{d s}{d t} = ξ$ . In the spirit of the splitting method we present when introducing numerical integrators for the Langevin equation, we here provide a splitting method for solving for the evolution of the Nose-Hoover dynamics. Suppose we want to integrate from time step $n$ to $n + 1$ with time step size $Δ t$ , then the scheme is as follows

$q_{n + 1/2} = q_{n} + \frac{Δ t}{2} M^{- 1} p_{n}$

$p_{n + 1/2} = e^{- Δ t ξ_{n} /2} p_{n} - \frac{1 - e ^{- Δ t ξ_{n} /2}}{ξ _{n}} \nabla U (q_{n + 1/2})$

$ξ_{n + 1} = ξ_{n} + Δ t μ^{- 1} (p_{n + 1/2}^{T} M^{- 1} p_{n + 1/2} - N_{d} k_{B} T)$

$p_{n + 1} = e^{- Δ t ξ_{n + 1} /2} p_{n + 1/2} - \frac{1 - e ^{- Δ t ξ_{n + 1} /2}}{ξ _{n + 1}} \nabla U (q_{n + 1/2})$

$q_{n + 1} = q_{n + 1/2} + \frac{Δ t}{2} M^{- 1} p_{n + 1}$

代码

文本

[8]

class NoseHooverIntegrator(utils.BaseIntegrator):

def __init__(self, temperature=298.15, mu=5.0, Nd = 1, timestep=1.0e-3,removeCMMotion=False):

self.dt = timestep

self.mu = mu

self.temperature = temperature

self.removeCMMotion= removeCMMotion

self.Nd = 1

self.kbT = 1.380649 * 6.02214076 * 1e-3 * self.temperature

self.xi = -1

def step(self, state, engrad=None):

pos = state.getPositions()

vel = state.getVelocities()

box = state.getCellVector()

mas = state.getMasses()

frc = state.getForces()

ener = state.getPotentialEnergy()

pos_1, vel_1, self.xi, box_1, mas_1, ene_1, frc_1 = self.update_state(

pos, vel, self.xi, box, mas, ener, frc, engrad=engrad)

state_new = State(positions=pos_1,

velocities=vel_1,

cell=box_1,

energy=ene_1,

forces=frc_1,

masses=mas)

return state_new

def update_state(self, pos_0, vel_0, xi_0, box_0, mas, ene_0, frc_0, engrad=None):

pos_1 = pos_0 + vel_0 * 0.5 * self.dt

ene_1, grd_1 = engrad(pos_1, box_0)

frc_1 = - grd_1

vel_1 = vel_0*jnp.exp(-self.dt* xi_0/2) + frc_1 / mas / xi_0 *(1-jnp.exp(-self.dt* xi_0/2))

xi_1 = xi_0 + self.dt/self.mu*(jnp.sum(vel_1**2)*mas -self.Nd*self.kbT)

vel_1 = vel_1*jnp.exp(-self.dt* xi_1/2) + frc_1 / mas / xi_1 *(1-jnp.exp(-self.dt* xi_1/2))

pos_1 = pos_1 + vel_1 * 0.5 * self.dt

if self.removeCMMotion:

vel_1 -= vel_1.mean(axis=0)

return pos_1, vel_1, xi_1, box_0, mas, ene_1, frc_1

代码

文本

6. Numerical Example (1D Harmonic Oscillator)

代码

文本

[9]

def HarmonicOscillatorExample(integrator):

if integrator == "ABOBA":

integ = BAOABLangevinIntegrator()

else:

if integrator == "BAOAB":

integ = BAOABLangevinIntegrator()

if integrator == "Overdamped":

integ = OverdampedLangevinIntegrator()

if integrator == "NoseHoover":

integ = NoseHooverIntegrator()

else:

integ = BAOABLangevinIntegrator()

omm_pdb, engrad = utils.create1DHarmonicOscillator()

state = init_state_from_PDB(omm_pdb, engrad=engrad)

state.velocities.at[1:].set(0)

pos_list = []

pe_list, ke_list, temp_list = [], [], []

for nstep in trange(200000):

state = integ.step(state, engrad)

if nstep % 10 == 0:

pos = state.getPositions()[0,0]

pos_list.append(pos)

pe_list.append(state.getPotentialEnergy())

ke_list.append(state.getKineticEnergy())

temp_list.append(state.getTemperature())

pos_list = np.array(pos_list)

pe_list = np.array(pe_list)

ke_list = np.array(ke_list)

temp_list = np.array(temp_list)

yy, xx, axis = plt.hist(pos_list, bins=31, density=True, label="sample")

xx = (xx[1:] + xx[:-1]) / 2

ee = 50. * xx * xx

pp = np.exp(- ee / 1.380649 / 6.02214076e-3 / 298.15)

pp = pp / pp.sum() / (xx[1] - xx[0])

plt.plot(xx, pp, label="prob.")

plt.xlabel("x coord")

plt.ylabel("energy (kJ/mol)")

plt.legend()

return state

HarmonicOscillatorExample("NoseHoover")

代码

文本

[10]

HarmonicOscillatorExample("ABOBA")

代码

文本

[11]

HarmonicOscillatorExample("Overdamped")

代码

文本

7. Numerical Example (2D Lennard-Jones Fluid)

代码

文本

[ ]

def LennardJonesFluid(integrator):

if integrator == "ABOBA":

integ = BAOABLangevinIntegrator(removeCMMotion=True)

else:

if integrator == "BAOAB":

integ = BAOABLangevinIntegrator(removeCMMotion=True)

if integrator == "Overdamped":

integ = OverdampedLangevinIntegrator()

if integrator == "NoseHoover":

integ = NoseHooverIntegrator(removeCMMotion=True)

else:

integ = BAOABLangevinIntegrator(removeCMMotion=True)

omm_pdb, engrad = utils.createLJFluid()

state = init_state_from_PDB(omm_pdb, engrad=engrad)

trj_writer = XYZWriter("traj_langevin_lj.xyz", omm_pdb.topology)

pe_list, ke_list, temp_list = [], [], []

for nstep in trange(500 * 200):

state = integ.step(state, engrad)

if nstep % 500 == 0:

pe_list.append(state.getPotentialEnergy())

ke_list.append(state.getKineticEnergy())

temp_list.append(state.getTemperature())

trj_writer.write(state)

pe_list = np.array(pe_list)

ke_list = np.array(ke_list)

temp_list = np.array(temp_list)

trj_writer.close()

plt.plot(temp_list)

plt.show()

plt.hist(temp_list,bins=31)

plt.show()

LennardJonesFluid("NoseHoover")

 23%|██▎       | 23045/100000 [27:39<1:28:28, 14.50it/s]IOPub message rate exceeded.
The Jupyter server will temporarily stop sending output
to the client in order to avoid crashing it.
To change this limit, set the config variable
`--ServerApp.iopub_msg_rate_limit`.

Current values:
ServerApp.iopub_msg_rate_limit=1000.0 (msgs/sec)
ServerApp.rate_limit_window=3.0 (secs)

 69%|██████▊   | 68571/100000 [1:19:53<35:16, 14.85it/s]

代码

文本

[ ]

LennardJonesFluid("Overdamped")

代码

文本

[ ]

LennardJonesFluid("ABOBA")

代码

文本

[ ]

代码

文本

notebook

AI4S

Deep Learning

python

Molecular Dynamics

notebookAI4SDeep LearningpythonMolecular Dynamics

已赞2

本文被以下合集收录

xuzhen

更新于 2024-03-15

2 篇0 人关注