Abstract:Protein-ligand interactions are increasingly profiled at high-throughput, playing a vital role in lead compound discovery and drug optimization. Accurate prediction of binding pose and binding affinity constitutes a pivotal challenge in advancing our computational understanding of protein-ligand interactions. However, inherent limitations still exist, including high computational cost for conformational search sampling in traditional molecular docking tools, and the unsatisfactory molecular representation learning and intermolecular interaction modeling in deep learning-based methods. Here we propose a geometry-aware attention-based deep learning model, GAABind, which effectively predicts the pocket-ligand binding pose and binding affinity within a multi-task learning framework. Specifically, GAABind comprehensively captures the geometric and topological properties of both binding pockets and ligands, and employs expressive molecular representation learning to model intramolecular interactions. Moreover, GAABind proficiently learns the intermolecular many-body interactions and simulates the dynamic conformational adaptations of the ligand during its interaction with the protein through meticulously designed networks. We trained GAABind on the PDBbindv2020 and evaluated it on the CASF2016 dataset; the results indicate that GAABind achieves state-of-the-art performance in binding pose prediction and shows comparable binding affinity prediction performance. Notably, GAABind achieves a success rate of 82.8% in binding pose prediction, and the Pearson correlation between predicted and experimental binding affinities reaches up to 0.803. Additionally, we assessed GAABind's performance on the severe acute respiratory syndrome coronavirus 2 main protease cross-docking dataset. In this evaluation, GAABind demonstrates a notable success rate of 76.5% in binding pose prediction and achieves the highest Pearson correlation coefficient in binding affinity prediction compared with all baseline methods.

LigBind: Identifying Binding Residues for Over 1000 Ligands with Relation-A-ware Graph Neural Networks

GraphBind: protein structural context embedded rules learned by hierarchical graph neural networks for recognizing nucleic-acid-binding residues

AI-Bind: Improving Binding Predictions for Novel Protein Targets and Ligands

Improving the generalizability of protein-ligand binding predictions with AI-Bind

Harnessing Pre-trained Models for Accurate Prediction of Protein-Ligand Binding Affinity

GAABind: a Geometry-Aware Attention-Based Network for Accurate Protein-Ligand Binding Pose and Binding Affinity Prediction

Leak Proof PDBBind: A Reorganized Dataset of Protein-Ligand Complexes for More Generalizable Binding Affinity Prediction

BindWeb: A web server for ligand binding residue and pocket prediction from protein structures

DeepBindGCN: Integrating Molecular Vector Representation with Graph Convolutional Neural Networks for Protein–Ligand Interaction Prediction

Protein Ligand-Specific Binding Residue Predictions by an Ensemble Classifier

PDBBind Optimization to Create a High-Quality Protein-Ligand Binding Dataset for Binding Affinity Prediction

PGBind: pocket-guided explicit attention learning for protein–ligand docking

On Machine Learning Approaches for Protein-Ligand Binding Affinity Prediction

PLANET: A Multi-Objective Graph Neural Network Model for Protein–Ligand Binding Affinity Prediction

DEELIG: A Deep Learning Approach to Predict Protein-Ligand Binding Affinity

Binding Affinity Prediction: From Conventional to Machine Learning-Based Approaches

DynamicBind: predicting ligand-specific protein-ligand complex structure with a deep equivariant generative model

A Spatial-Temporal Graph Attention Network for Protein-Ligand Binding Affinity Prediction Based on Molecular Geometry

Learning Binding Affinities via Fine-tuning of Protein and Ligand Language Models

CrossBind: Collaborative Cross-Modal Identification of Protein Nucleic-Acid-Binding Residues