DeepLPI: a novel deep learning-based model for protein–ligand interaction prediction for drug repurposing

Bomin Wei,Yue Zhang,Xiang Gong
DOI: https://doi.org/10.1038/s41598-022-23014-1
IF: 4.6
2022-10-29
Scientific Reports
Abstract:The substantial cost of new drug research and development has consistently posed a huge burden for both pharmaceutical companies and patients. In order to lower the expenditure and development failure rate, repurposing existing and approved drugs by identifying interactions between drug molecules and target proteins based on computational methods have gained growing attention. Here, we propose the DeepLPI, a novel deep learning-based model that mainly consists of ResNet-based 1-dimensional convolutional neural network (1D CNN) and bi-directional long short term memory network (biLSTM), to establish an end-to-end framework for protein–ligand interaction prediction. We first encode the raw drug molecular sequences and target protein sequences into dense vector representations, which go through two ResNet-based 1D CNN modules to derive features, respectively. The extracted feature vectors are concatenated and further fed into the biLSTM network, followed by the MLP module to finally predict protein–ligand interaction. We downloaded the well-known BindingDB and Davis dataset for training and testing our DeepLPI model. We also applied DeepLPI on a COVID-19 dataset for externally evaluating the prediction ability of DeepLPI. To benchmark our model, we compared our DeepLPI with the baseline methods of DeepCDA and DeepDTA, and observed that our DeepLPI outperformed these methods, suggesting the high accuracy of the DeepLPI towards protein–ligand interaction prediction. The high prediction performance of DeepLPI on the different datasets displayed its high capability of protein–ligand interaction in generalization, demonstrating that the DeepLPI has the potential to pinpoint new drug-target interactions and to find better destinations for proven drugs.
multidisciplinary sciences
What problem does this paper attempt to address?
### Problems the Paper Attempts to Solve The paper aims to address the issues of high costs, long cycles, and low success rates in new drug development. To reduce development costs and failure rates, researchers are increasingly focusing on identifying interactions between drug molecules and target proteins through computational methods, thereby repurposing existing and approved drugs. This paper proposes a new deep learning-based model—DeepLPI, for predicting protein-ligand interactions (PLI) to facilitate drug repurposing. Specifically, the DeepLPI model mainly consists of the following parts: 1. **ResNet-based 1D Convolutional Neural Network (1D CNN)**: Used to extract features from raw drug molecule sequences and target protein sequences. 2. **Bidirectional Long Short-Term Memory Network (biLSTM)**: Used to process the extracted feature vectors, further enhancing the model's predictive capability. 3. **Multilayer Perceptron (MLP)**: Finally used for binary classification prediction to determine whether protein-ligand interactions occur. The model encodes the raw drug molecule sequences and target protein sequences, converting them into dense vector representations, and then extracts features through two ResNet-based 1D CNN modules respectively. The extracted feature vectors are concatenated and input into the biLSTM network, and finally, predictions are made through the MLP module. To validate the effectiveness of the model, researchers used the well-known BindingDB and Davis datasets for training and testing, and conducted external evaluations on the COVID-19 dataset. Experimental results show that DeepLPI performs excellently in predicting protein-ligand interactions, outperforming existing baseline methods DeepCDA and DeepDTA. This indicates that DeepLPI has high predictive performance and generalization ability, and is expected to discover new drug-target interactions in new drug development.