Crystal Polymorph Search in the NPT Ensemble via a Deposition/Sublimation Alchemical Path

Aaron J. Nessler,Okimasa Okada,Yuya Kinoshita,Koki Nishimura,Hiroomi Nagata,Kaori Fukuzawa,Etsuo Yonemochi,Michael J. Schnieders
DOI: https://doi.org/10.1021/acs.cgd.3c01358
IF: 4.01
2024-03-09
Crystal Growth & Design
Abstract:The formulation of active pharmaceutical ingredients involves discovering stable crystal packing arrangements or polymorphs, each of which has distinct pharmaceutically relevant properties. Traditional experimental screening techniques utilizing various conditions are commonly supplemented with in silico crystal structure prediction (CSP) to inform the crystallization process and mitigate risk. Predictions are often based on advanced classical force fields or quantum mechanical calculations that model the crystal potential energy landscape but do not fully incorporate temperature, pressure, or solution conditions during the search procedure. This study proposes an innovative alchemical path that utilizes an advanced polarizable atomic multipole force field to predict crystal structures based on direct sampling of the NPT ensemble. The use of alchemical (i.e., nonphysical) intermediates, a novel Monte Carlo barostat, and an orthogonal space tempering bias combine to enhance the sampling efficiency of the deposition/sublimation phase transition. The proposed algorithm was applied to 2-((4-(2-(3,4-dichlorophenyl)ethyl)phenyl)amino)benzoic acid (Cambridge Crystallography Database Centre ID: XAFPAY) as a case study to showcase the algorithm. Each experimentally determined polymorph with one molecule in the asymmetric unit was successfully reproduced via approximately 1000 short 1 ns simulations per space group where each simulation was initiated from random rigid body coordinates and unit cell parameters. Utilizing two threads of a recent Intel CPU (a Xeon Gold 6330 CPU at 2.00 GHz), 1 ns of sampling using the polarizable AMOEBA force field can be acquired in 4 h (equating to more than 300 ns/day using all 112 threads/56 cores of a dual CPU node) within the Force Field X software (https://ffx.biochem.uiowa.edu). These results demonstrate a step forward in the rigorous use of the NPT ensemble during the CSP search process and open the door to future algorithms that incorporate solution conditions using continuum solvation methods.
chemistry, multidisciplinary,materials science,crystallography
What problem does this paper attempt to address?
### Problems the paper attempts to solve The paper aims to solve the challenges in the prediction of different crystal forms (i.e., polymorphism) of active pharmaceutical ingredients (API). Specifically, the paper proposes an innovative method to predict crystal structures by direct sampling in the NPT ensemble, thereby improving the accuracy of polymorphism prediction. Traditional methods are usually based on advanced classical force fields or quantum - mechanical calculations to model the crystal potential energy surface, but these methods often do not fully include temperature, pressure or solution conditions. Therefore, the paper proposes a new alchemical path, using the advanced polarizable atomic multipole force field to predict crystal structures, and combines non - physical intermediates, a new Monte Carlo barostat and orthogonal space tempering bias to enhance the sampling efficiency of deposition / sublimation phase transitions. ### Main contributions 1. **Innovative alchemical path**: By introducing non - physical intermediates, the paper proposes a new alchemical path that can effectively connect the vacuum state and the crystal state, thus simplifying the search process on the free - energy surface. 2. **Efficient sampling method**: Combining orthogonal space tempering bias and the new Monte Carlo barostat improves the sampling efficiency during the phase - change process, making it possible to efficiently generate potential polymorphs under different temperature and pressure conditions. 3. **Practical application cases**: Taking 2 - ((4 - (2 - (3,4 - dichlorophenyl)ethyl)phenyl)amino)benzoic acid (XAFPAY) as an example, the paper shows the successful application of this algorithm. All the polymorphs of the single - molecule asymmetric unit determined experimentally were successfully reproduced. ### Method overview 1. **Polarized AMOEBA force field**: The paper uses the AMOEBA force field, which combines bonding and non - bonding interactions and can support molecular dynamics simulations and the calculation of free - energy differences. 2. **Initial coordinates and force - field parameters**: The three - dimensional coordinates of the molecule and the AMOEBA force - field parameters are generated by the Poltype tool. 3. **Alchemical path**: Sampling is carried out from the vacuum state to the crystal state along the state parameter λ by the Generalized Ensemble (GE) free - energy simulation method. 4. **Orthogonal space temperature control**: The orthogonal space temperature - control method is adopted to eliminate hidden free - energy barriers and ensure a comprehensive exploration of the crystal free - energy surface. 5. **Monte Carlo barostat**: Through a custom - made Monte Carlo barostat, the lattice parameters are kept fluctuating at a constant pressure while respecting the lattice - system constraints. 6. **Screening and clustering**: By screening the energy and density of NPT snapshots and using the PAC algorithm for clustering, similar structures are excluded to reduce the burden of subsequent DFT calculations. 7. **DFT optimization and ranking**: Finally, the DFT method is used to further optimize and evaluate the screened polymorphs to ensure the accuracy of the prediction results. ### Results The paper shows that all the experimentally determined polymorphs of XAFPAY were successfully predicted by this method, and after DFT optimization, the prediction results are highly consistent with the experimental data. This indicates that this method has high accuracy and reliability in polymorphism prediction, providing new tools and methods for drug formulation and materials - science research.