MDFit: Automated molecular simulations workflow enables high throughput assessment of ligands-protein dynamics

Sirish Kaushik Lakkaraju,Alexander Brueckner,Benjamin Shields,Palani Kirubakaran,Alexander Suponya,Manoranjan Panda,Shana Posy,Stephen Johnson
DOI: https://doi.org/10.26434/chemrxiv-2024-gfcqx
2024-01-24
Abstract:Molecular dynamics (MD) simulation is a powerful tool for characterizing ligand-protein conformational dynamics and offers significant advantages over docking and other rigid structure-based computational methods. However, setting up, running, and analyzing MD simulations continues to be a multi-step process making it cumbersome to assess a library of ligands using MD. We present an automated workflow that streamlines setting up, running, and analyzing Desmond MD simulations. The workflow takes a library of pre-docked ligands and a protein structure as input, sets up and runs MD with each protein-ligand complex, and generates simulation fingerprints for each ligand. Simulation fingerprints (SimFP) capture protein-ligand compatibility, including stability of different ligand-pocket interactions and other useful metrics that enable easy rank-ordering of the ligand library for pocket optimization. SimFP from a ligand library can also be used to build machine learning (ML) models that can predict binding assay outcomes and automatically infer important interactions. Unlike relative free-energy methods that are constrained to assess ligands with high chemical similarity, ML models based on SimFPs can accommodate diverse ligand sets. We present a case study on how SimFP helps delineate structure-activity relationship (SAR) trends and explain potency differences across matched-molecular pairs of cyclic peptides targeting the PD-L1 protein.
Chemistry
What problem does this paper attempt to address?
The problem this paper attempts to address is the complexity and inefficiency of molecular dynamics (MD) simulations in the drug discovery process. Specifically, the traditional MD simulation setup, execution, and analysis process is very complex and requires multiple steps, making it difficult to routinely use MD simulations when optimizing compounds. Additionally, existing relative free energy methods are limited by the chemical similarity of compounds and cannot handle diverse ligand sets. To address these issues, the authors propose an automated molecular simulation workflow (MDFit) that simplifies the setup, execution, and analysis of MD simulations. MDFit can take pre-docked ligand libraries and protein structures as input, automatically set up and run MD simulations for each protein-ligand complex, and generate simulation fingerprints (SimFP) for each ligand. These SimFPs capture the compatibility of the protein-ligand interactions, including the stability of different ligand-pocket interactions and other useful metrics, allowing for easy ranking of the ligand library to optimize the pocket. SimFPs can also be used to build machine learning (ML) models to predict binding experimental results and automatically infer important interactions. Through this approach, MDFit not only improves the efficiency of MD simulations but also enables accurate prediction and optimization across a broader set of ligands, not just chemically similar compounds. The paper demonstrates how MDFit helps elucidate structure-activity relationship (SAR) trends and explains potency differences between matched molecular pairs (MMPs), particularly for cyclic peptides targeting the PD-L1 protein.