Learning Binding Affinities via Fine-tuning of Protein and Ligand Language Models

Rohan Gorantla,Aryo Pradipta Gema,Ian Xi Yang,Álvaro Serrano-Morrás,Benjamin Suutari,Jordi Juárez Jiménez,Antonia S. J. S. Mey
DOI: https://doi.org/10.1101/2024.11.01.621495
2024-11-01
Abstract:Accurate in-silico prediction of protein-ligand binding affinity is essential for efficient hit identification in large molecular libraries. Commonly used structure-based methods such as giga-docking often fail to rank compounds effectively, and free energy-based approaches, while accurate, are too computationally intensive for large-scale screening. Existing deep learning models struggle to generalize to new targets or drugs, and current evaluation methods do not reflect real-world performance accurately. We introduce BALM, a deep learning framework that predicts binding affinity using pretrained protein and ligand language models. BALM learns experimental binding affinities by optimizing cosine similarity in a shared embedding space. We also propose improved evaluation strategies with diverse data splits and metrics to better assess model performance. Using the BindingDB dataset, BALM shows strong generalization to unseen drugs, scaffolds, and targets. It excels in few-shot scenarios for targets such as USP7 and Mpro, outperforming traditional machine learning and docking methods, including AutoDock Vina. Adoption of the target-based evaluation methods proposed will allow for more stringent evaluation of machine learning-based scoring tools. Frameworks such as BALM show good performance, are computationally efficient, and are highly adaptable within this evaluation setting, making them practical tools for early-stage drug discovery screening.
Bioinformatics
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the challenge of accurately predicting protein - ligand binding affinities when efficiently identifying hit compounds in large - scale molecular libraries. Specifically, existing structure - based methods such as giga - docking are not effective in compound ranking, and while free - energy - based methods are accurate, they are too computationally expensive for large - scale screening. Existing deep - learning models are difficult to generalize to new targets or drugs, and current evaluation methods do not accurately reflect actual performance. To address these issues, the authors introduced BALM (Binding Affinity via Language Models), a deep - learning framework that uses pre - trained protein and ligand language models to predict binding affinities. BALM learns experimental binding affinities by optimizing cosine similarity in a shared embedding space and proposes an improved evaluation strategy to better assess model performance. Using the BindingDB dataset, BALM has demonstrated strong generalization capabilities for unseen drugs, scaffolds, and targets, and outperforms traditional machine - learning and docking methods for targets such as USP7 and Mpro in few - shot scenarios. ### Main contributions: 1. **Model level**: Introduced BALM, a sequence - based deep - learning method that uses pre - trained protein and ligand language models to predict binding affinities. 2. **Data level**: Systematically studied the performance of BALM on the BindingDB dataset, including zero - shot (cold drugs and cold targets), scaffolds, and random splits. 3. **Evaluation level**: Proposed an improved evaluation strategy, including diverse data splits and metrics, to more accurately assess model performance. ### Experimental results: - **Random split**: BALM outperforms the baseline models on all evaluation metrics, especially showing significant advantages in Pearson correlation coefficient and root - mean - square error (RMSE). - **Cold target split**: BALM performs significantly better than the baseline models on cold target splits, showing stronger generalization ability. - **Cold drug split**: The Pearson correlation coefficient and Spearman correlation coefficient of BALM on cold drug splits are increased by 79.8% and 82% respectively. - **Scaffold split**: BALM also outperforms the baseline models on scaffold splits, especially in terms of Pearson correlation coefficient and Spearman correlation coefficient. ### Parameter - efficient fine - tuning (PEFT): - Through parameter - efficient fine - tuning methods (such as LoHa and LoKr), the performance of BALM is further improved. Combined with the best fine - tuning method, BALM+PEFT shows significant advantages on all data splits and evaluation metrics. ### Zero - shot performance: - Zero - shot performance varies significantly among different targets, and commonly used cumulative metrics (such as overall Pearson or Spearman correlation coefficients) may overestimate the reliability of the model. Through Fisher - transformed correlations, the performance of the model on different targets can be more realistically reflected. Overall, this paper addresses the limitations of existing methods in binding affinity prediction by introducing the BALM framework and an improved evaluation strategy, providing a practical tool for early drug discovery screening.