PepCA: Unveiling protein-peptide interaction sites with a multi-input neural network model

Junxiong Huang,Weikang Li,Bin Xiao,Chunqing Zhao,Hancheng Zheng,Yingrui Li,Jun Wang
DOI: https://doi.org/10.1016/j.isci.2024.110850
IF: 5.8
2024-08-30
iScience
Abstract:The protein-peptide interaction plays a pivotal role in fields such as drug development, yet remains underexplored experimentally and challenging to model computationally. Herein, we introduce PepCA, a sequence-based approach for predicting peptide-binding sites on proteins. A primary obstacle in predicting peptide-protein interactions is the difficulty in acquiring precise protein structures, coupled with the uncertainty of polypeptide configurations. To address this, we first encode protein sequences using the Evolutionary Scale Modeling 2 (ESM-2) pre-trained model to extract latent structural information. Additionally, we have developed a multi-input coattention mechanism to concurrently update the encoding of both peptide and protein residues. PepCA integrates this module within an encoder-decoder structure. This model's high precision in identifying binding sites significantly advances the field of computational biology, offering vital insights for peptide drug development and protein science.
What problem does this paper attempt to address?