Abstract:Accurate in-silico prediction of protein-ligand binding affinity is essential for efficient hit identification in large molecular libraries. Commonly used structure-based methods such as giga-docking often fail to rank compounds effectively, and free energy-based approaches, while accurate, are too computationally intensive for large-scale screening. Existing deep learning models struggle to generalize to new targets or drugs, and current evaluation methods do not reflect real-world performance accurately. We introduce BALM, a deep learning framework that predicts binding affinity using pretrained protein and ligand language models. BALM learns experimental binding affinities by optimizing cosine similarity in a shared embedding space. We also propose improved evaluation strategies with diverse data splits and metrics to better assess model performance. Using the BindingDB dataset, BALM shows strong generalization to unseen drugs, scaffolds, and targets. It excels in few-shot scenarios for targets such as USP7 and Mpro, outperforming traditional machine learning and docking methods, including AutoDock Vina. Adoption of the target-based evaluation methods proposed will allow for more stringent evaluation of machine learning-based scoring tools. Frameworks such as BALM show good performance, are computationally efficient, and are highly adaptable within this evaluation setting, making them practical tools for early-stage drug discovery screening.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is the challenge of accurately predicting protein - ligand binding affinities when efficiently identifying hit compounds in large - scale molecular libraries. Specifically, existing structure - based methods such as giga - docking are not effective in compound ranking, and while free - energy - based methods are accurate, they are too computationally expensive for large - scale screening. Existing deep - learning models are difficult to generalize to new targets or drugs, and current evaluation methods do not accurately reflect actual performance. To address these issues, the authors introduced BALM (Binding Affinity via Language Models), a deep - learning framework that uses pre - trained protein and ligand language models to predict binding affinities. BALM learns experimental binding affinities by optimizing cosine similarity in a shared embedding space and proposes an improved evaluation strategy to better assess model performance. Using the BindingDB dataset, BALM has demonstrated strong generalization capabilities for unseen drugs, scaffolds, and targets, and outperforms traditional machine - learning and docking methods for targets such as USP7 and Mpro in few - shot scenarios. ### Main contributions: 1. **Model level**: Introduced BALM, a sequence - based deep - learning method that uses pre - trained protein and ligand language models to predict binding affinities. 2. **Data level**: Systematically studied the performance of BALM on the BindingDB dataset, including zero - shot (cold drugs and cold targets), scaffolds, and random splits. 3. **Evaluation level**: Proposed an improved evaluation strategy, including diverse data splits and metrics, to more accurately assess model performance. ### Experimental results: - **Random split**: BALM outperforms the baseline models on all evaluation metrics, especially showing significant advantages in Pearson correlation coefficient and root - mean - square error (RMSE). - **Cold target split**: BALM performs significantly better than the baseline models on cold target splits, showing stronger generalization ability. - **Cold drug split**: The Pearson correlation coefficient and Spearman correlation coefficient of BALM on cold drug splits are increased by 79.8% and 82% respectively. - **Scaffold split**: BALM also outperforms the baseline models on scaffold splits, especially in terms of Pearson correlation coefficient and Spearman correlation coefficient. ### Parameter - efficient fine - tuning (PEFT): - Through parameter - efficient fine - tuning methods (such as LoHa and LoKr), the performance of BALM is further improved. Combined with the best fine - tuning method, BALM+PEFT shows significant advantages on all data splits and evaluation metrics. ### Zero - shot performance: - Zero - shot performance varies significantly among different targets, and commonly used cumulative metrics (such as overall Pearson or Spearman correlation coefficients) may overestimate the reliability of the model. Through Fisher - transformed correlations, the performance of the model on different targets can be more realistically reflected. Overall, this paper addresses the limitations of existing methods in binding affinity prediction by introducing the BALM framework and an improved evaluation strategy, providing a practical tool for early drug discovery screening.

Learning Binding Affinities via Fine-tuning of Protein and Ligand Language Models

BAPULM: Binding Affinity Prediction using Language Models

Binding Affinity Prediction with 3D Machine Learning: Training Data and Challenging External Testing

On Machine Learning Approaches for Protein-Ligand Binding Affinity Prediction

DualBind: A Dual-Loss Framework for Protein-Ligand Binding Affinity Prediction

QuickBind: A Light-Weight And Interpretable Molecular Docking Model

Improved prediction of ligand-protein binding affinities by meta-modeling

Improving the generalizability of protein-ligand binding predictions with AI-Bind

Machine learning accelerates MD-based binding pose prediction between ligands and proteins

AI-Bind: Improving Binding Predictions for Novel Protein Targets and Ligands

Binding Affinity Prediction: From Conventional to Machine Learning-Based Approaches

BigBind: Learning from Nonstructural Data for Structure-Based Virtual Screening

A machine learning approach to predicting protein–ligand binding affinity with applications to molecular docking

Synergistic Application of Molecular Docking and Machine Learning for Improved Binding Pose

DOCKSTRING: Easy Molecular Docking Yields Better Benchmarks for Ligand Design

GAABind: a Geometry-Aware Attention-Based Network for Accurate Protein-Ligand Binding Pose and Binding Affinity Prediction

Development and evaluation of a deep learning model for protein-ligand binding affinity prediction

Improved Protein–Ligand Binding Affinity Prediction with Structure-Based Deep Fusion Inference

Binding affinity prediction for protein–ligand complex using deep attention mechanism based on intermolecular interactions

Harnessing Pre-trained Models for Accurate Prediction of Protein-Ligand Binding Affinity

EquiBind: Geometric Deep Learning for Drug Binding Structure Prediction