Voronoi CVs for enhanced sampling autoionization and tautomerism
©️ Copyright 2023 @ Authors
Author:Pengchao Zhang 📨
Date:2024-04
License: This work is licensed under a Creative Commons Attribution-NonCommercial Use-ShareAlike 4.0 International License.
🎉 We will explain the Voronoi collective variables (CVs) based on the references
- Atomistic Simulations of Acid-Base Equilibria
- Microscopic Description of Acid-Base Equilibrium
- Tautomeric Equilibrium in Condensed Phases
- Intramolecular and water mediated tautomerism of solvated glycine
- Propensity of water self-ions at air(oil)-water interfaces revealed by deep potential molecular dynamics with enhanced sampling
If you are not familiar with enhanced sampling, here are some references:
- Metadynamics
- Enhancing Important Fluctuations: Rare Events and Metadynamics from a Conceptual Viewpoint
- Using metadynamics to explore complex free-energy landscapes
- Enhanced Sampling Methods
- PLUMED Tutorials
- OPES (on-the-fly probability enhanced sampling)
- RiD
If you can already largely manage the above, let's get started! This Notebook is cataloged below:
For acid-base equilibria in water, a crucial step is the identification of charge defects (HO and OH) within the hydrogen bond network of the system. The most intuitive and naive approach to detect such charge carriers is to partition the space with spheres around every water oxygen atom and to count how many hydrogen atoms fall within each sphere. Therefore, the oxygen atoms whose spheres count an excess or a defect of hydrogen atoms are identified as the charge carriers. This measurement can be easily defined mathematically by means of sum of smooth step-like functions, also known as a switching function reports an example of such measurement by means of a rational switching function counting how many atoms in position are within the sphere of the atom . Here and are two coefficients that control the smoothness of the function, runs over the hydrogen atoms and is the cutoff radius of the sphere.
Compared to standard chemical reactions, here the elusive nature of water ions makes their description through well-defined structures not possible and their identification only via hydronium and hydroxide ions leads to errors in the way these species are counted. In order to overcome these limitations and to build a generally applicable set of CVs we have made two conceptual steps: indistinguishable acid-base sites are considered as participating as a whole to the reaction and, once tessellated the entire space in Voronoi-like polyhedra centered on these sites, different protonation states are distinguished by the number of hydrogen atoms that contain.
The first point circumvents the problem of determining well defined geometries of the charged species. Reactants and products are in this way defined according to local anomalies in the density of hydrogen atoms without fixing beforehand any reference structures. The second one ensures a very precise and unambiguous way of evaluating this density. Figure below clearly shows how this way of partitioning the space does not present any of the previous limitations.
According to Voronoi principle, the total number of hydrogen atoms assigned to a given site is taken as the fraction of protons that are much closer to that site than from all the other ones. Then, the whole space is tessellated in polyhedra centered on any atom able to break or form bonds with an acid proton and every hydrogen atom is assigned unequivocally to one and only one of these polyhedra. This provides a very robust definition for assigning hydrogen atoms to the respective acid-base site without arbitrarily defining parameters and avoiding ambiguities when tiling the space.
However, a function to be used in enhanced sampling method must be continuous and differentiable everywhere. Unfortunately, the standard Voronoi partition is described by a set of step functions,
so that if the site is the closest to the hydrogen and 0 otherwise. In proximity of the Voronoi boundaries the function derivatives, and therefore any force imposed by an external potential, are ill defined.
In order to reproduce the same behaviour of above equation without discontinuity or singularity, index functions are replaced by soft-max functions .
In the case of glycine tautomerism, several charge defects are possible: the -NH and -COO groups in glycine, and the two water self-ions (hydronium HO and hydroxide OH). We then sum the charge contained in each polyhedron. If the charge in a polyhedron is different from zero, we attribute the charge defect to the atom N or O that is at the polyhedron center, and we distinguish the O defects according to whether they are centered on water or glycine oxygen ( and ).
To count the number of H atoms centered on O or N atom , we use the formula where the first sum is over all H atoms with index , and is the index of Voronoi centers. The parameter regulates the smoothness of the function.
The charge defect number of the Voronoi center is calculated by subtracting the reference number (the original proton number connecting to the atom at the polyhedron center) from H number . where we take for the water oxygen () and the glycine nitrogen (N), while for the carboxylic oxygens () we set on account of the symmetry between two oxygen atoms.
Since HO and OH can be present in water, we use the CV to promote the self-ionization processes, and its value varies from 0 to 2.
The CV is supplemented by a CV that represents the distance between HO and OH,
While can distinguish between the pure water state () and autoionization state (), it is hard to identify the initial proton transfer which corresponds to a value nearly concentrated around zero. Therefore, we use a piecewise logarithmic function where is a regularization parameter.
As the system is neutral overall, the [C] and [A] forms of glycine are compensated by OH and HO, respectively. Then, to distinguish between the [C]OH pair, [Z][N] states and [A]HO pair, we define a CV that can identify these protonation states, by combining the charge defects of both glycine and water. where for [C]OH, for [A]HO, and in the other two cases.
Since cannot distinguish the [Z] and [N] forms, we introduce another CV to estimate the charge-charge distance that is capable of separating the two cases. The is measured in terms of the distances between and N and between and , as well as the average distance between N and , as follows where is the modulus distance between the two Voronoi centers. The CV will approximately be 0 and 3 Å in the [N] and [Z] forms, respectively, and will become larger than \textasciitilde 3 Å in the other two cases.
Based on OPES with well-defined Voronoi CVs, we can obtain accurate free energy surfaces that distinguish all possible glycine configurations and their transition processes.
When adapting Voronoi CVs () to distinguish glycine states, it is important to note that these CVs can be derived with respect to the atomic coordinates (), as indicated in the following equation. where the implementation of the first term has been completed within the PLUMED plugin, we will proceed to describe the second term in detail and make corresponding changes to the code. For clarity and ease of understanding, we will focus on the relatively complicated part of the Voronoi CVs, as represented by the following equation. If there is a need to obtain derivatives for other CVs (, , , and ), such calculations can be easily achieved by applying the derivative chain rule. Next, we will illustrate the process using the example of distinguishing autoionization in pure water without any prior information. In this scenario, all O atoms act as Voronoi centers and form a defined group. To identify the protonation states of all O atoms, all nearby H atoms around the target O atom (labeled as OH) and all nearby O atoms around the target H atom (labeled as OHO) are searched.
In OH stage, if H (selected in H) is a neighbor of a given target O, the derivative for H coordinate is governed by the following equation. For simplicity, some small terms are ignored. If H (selected in H) is a neighbor of a non-target Voronoi center O (), the derivative for H coordinate is given by the following equation. For simplicity, some small terms are ignored.
In OHO stage, if O (selected in O) is a neighbor of H, the derivative for O coordinate is governed by the following equation If O (selected in O, ) is a neighbor of H, the derivative for O coordinate is given by the following equation At this point, all derivatives with respect to the coordinates have been determined. In addition, we incorporate the neighbor list function, which efficiently searches for nearby atoms, reducing the computational complexity from to . In particular, a neighbor list with a cutoff Å is set, updated at each step, to search for atoms in groups.
🎉 If you can't get enough of above, then you may want to read it:
Available code of Voronoi CVs:
- https://archive.materialscloud.org/record/2021.20
- https://github.com/Zhang-pchao/GlycineTautomerism/tree/main/Voronoi_collective_variables
Application of the Voronoi CVs:
- Intramolecular and water mediated tautomerism of solvated glycine
- Propensity of water self-ions at air(oil)-water interfaces revealed by deep potential molecular dynamics with enhanced sampling
- Toturial: Intramolecular and water mediated tautomerism of solvated glycine
Notebooks: