Active Learning Guided Hit Optimization for the Leucine-Rich Repeat Kinase 2 WDR Domain Based on In Silico Ligand Binding Affinities

Olexandr Isayev,Filipp Gusev,Evgeny Gutkin,Francesco Gentile,Fuqiang Ban,S. Benjamin Koby,Fengling Li,Irene Chau,Suzanne Ackloo,Cheryl H. Arrowsmith,Albina Bolotokova,Pegah Ghiabi,Elisa Gibson,Levon Halabelian,Scott Houliston,Rachel J. Harding,Ashley Hutchinson,Peter Loppnau,Sumera Perveen,Almagul Seitova,Hong Zeng,Matthieu Schapira,Artem Cherkasov,Maria G. Kurnikova
DOI: https://doi.org/10.26434/chemrxiv-2024-jv0rx
2024-10-04
Abstract:The leucine-rich repeat kinase 2 (LRRK2) is the most mutated gene in familial Parkinson’s disease, whose mutations lead to pathogenic hallmarks of the disease. The LRRK2 WDR domain is an understudied drug target for Parkinson’s disease with no known inhibitors prior to the first phase of the Critical Assessment of Computational Hit-Finding Experiments (CACHE) Challenge. CACHE challenges are designed to attract state-of-the-art computational methods for both hit-finding and lead optimization of small molecule inhibitors to challenging protein targets. A unique advantage of the CACHE challenge is that the predicted molecules are experimentally validated in-house. Here we report on our winning submission of experimentally confirmed LRRK2 WDR inhibitor molecules, predicted from thermodynamics integration (TI) calculations performed on only 672 compounds within a chemical space of 25,171 molecules. We used a free energy molecular dynamics (MD) -based active learning (AL) workflow to optimize our two previously confirmed hit molecules. We identified 8 experimentally verified novel inhibitors out of 35 tested (23% hit rate) with a maximum affinity increase of almost 35-fold. These results demonstrate the efficacy of free energy-based active learning workflow to quickly and efficiently explore large chemical spaces while minimizing the number and length of computational simulations. This workflow is widely applicable to the screening of any chemical space for small molecule analogs with increased affinity, subject to the general constraints of RBFE calculations. The mean absolute error of TI MD calculations was 1.30 kcal/mol with respect to measured KD of hit compounds.
Chemistry
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to discover small - molecule inhibitors for the WDR domain of Leucine - rich repeat kinase 2 (LRRK2). The LRRK2 gene is the most frequently mutated gene in familial Parkinson's disease, and its mutations can lead to the pathological features of the disease. Although there have been research reports on inhibitors and PROTACs of the LRRK2 kinase domain, prior to this study, there were no small - molecule inhibitors for the LRRK2 WDR domain. Therefore, this study aims to screen out small - molecule inhibitors that can bind to the LRRK2 WDR domain from a large number of commercially available compounds through computational methods and machine - learning techniques, and optimize the binding affinities of these inhibitors to increase their potential as drug candidates. Specifically, the researchers utilized a workflow based on Active Learning (AL), combined with Relative Binding Free Energy (RBFE) calculations, to optimize two previously identified LRRK2 WDR domain - binding molecules (Hit 1 and Hit 2). Through this method, they screened out approximately 25,000 analogs from a library containing approximately 550 million commercially available compounds, and finally performed RBFE calculations on 672 analogs. Based on the calculation results, 75 molecules were selected for experimental verification, among which 35 molecules were further tested, and finally 8 new inhibitors were identified, with the binding affinity of one molecule being nearly 35 times higher than that of the initial Hit. This study not only demonstrates how to efficiently explore large - scale chemical spaces using computational methods, but also provides an important foundation for drug development with the LRRK2 WDR domain as a therapeutic target for Parkinson's disease.