BAPULM: Binding Affinity Prediction using Language Models

Radheesh Sharma Meda,Amir Barati Farimani
2024-11-06
Abstract:Identifying drug-target interactions is essential for developing effective therapeutics. Binding affinity quantifies these interactions, and traditional approaches rely on computationally intensive 3D structural data. In contrast, language models can efficiently process sequential data, offering an alternative approach to molecular representation. In the current study, we introduce BAPULM, an innovative sequence-based framework that leverages the chemical latent representations of proteins via ProtT5-XL-U50 and ligands through MolFormer, eliminating reliance on complex 3D configurations. Our approach was validated extensively on benchmark datasets, achieving scoring power (R) values of 0.925 $\pm$ 0.043, 0.914 $\pm$ 0.004, and 0.8132 $\pm$ 0.001 on benchmark1k2101, Test2016_290, and CSAR-HiQ_36, respectively. These findings indicate the robustness and accuracy of BAPULM across diverse datasets and underscore the potential of sequence-based models in-silico drug discovery, offering a scalable alternative to 3D-centric methods for screening potential ligands.
Quantitative Methods,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to predict protein - ligand binding affinity efficiently and accurately during the drug development process. Traditional prediction methods rely on computationally intensive 3D structure data, which limits their application in large - scale screening of potential drug molecules. However, language - model - based methods can efficiently process sequence data and provide an alternative molecular representation method. Therefore, this paper proposes an innovative sequence - based framework - BAPULM (Binding Affinity Prediction using Language Models), which utilizes the chemical latent representation of proteins (obtained through ProtT5 - XL - U50) and the representation of ligands (obtained through MolFormer), eliminating the dependence on complex 3D configurations. BAPULM aims to improve the accuracy and efficiency of binding affinity prediction and provide a scalable alternative for virtual drug screening.