Inactive-enriched machine-learning models exploiting patent data improve structure-based virtual screening for PDL1 dimerizers

Pablo Gómez-Sacristán,Saw Simeon,Viet-Khoa Tran-Nguyen,Sachin Patil,Pedro J. Ballester
DOI: https://doi.org/10.1016/j.jare.2024.01.024
IF: 12.822
2024-01-27
Journal of Advanced Research
Abstract:Highlights • New machine-learning scoring functions were developed to predict PD1/PDL1 inhibitors. • PDL1-specific regressors exploiting large volumes of inactive-enriched data and PLEC fingerprints as features were the most predictive for this important target. • These new scoring functions are made available free of charge for prospective use. Introduction Small-molecule Programmable Cell Death Protein 1/Programmable Death-Ligand 1 (PD1/PDL1) inhibition via PDL1 dimerization has the potential to lead to inexpensive drugs with better cancer patient outcomes and milder side effects. However, this therapeutic approach has proven challenging, with only one PDL1 dimerizer reaching early clinical trials so far. There is hence a need for fast and accurate methods to develop alternative PDL1 dimerizers. Objectives We aim to show that structure-based virtual screening (SBVS) based on PDL1-specific machine-learning (ML) scoring functions (SFs) is a powerful drug design tool for detecting PD1/PDL1 inhibitors via PDL1 dimerization. Methods By incorporating the latest MLSF advances, we generated and evaluated PDL1-specific MLSFs (classifiers and inactive-enriched regressors) on two demanding test sets. Results 60 PDL1-specific MLSFs (30 classifiers and 30 regressors) were generated. Our large-scale analysis provides highly predictive PDL1-specific MLSFs that benefitted from training with large volumes of docked inactives and enabling inactive-enriched regression. Conclusion PDL1-specific MLSFs strongly outperformed generic SFs of various types on this target and are released here without restrictions. Graphical abstract Download : Download high-res image (127KB) Download : Download full-size image
multidisciplinary sciences
What problem does this paper attempt to address?