Prediction of Paratope–Epitope Pairs Using Convolutional Neural Networks

Dong Li,Fabrizio Pucci,Marianne Rooman
DOI: https://doi.org/10.3390/ijms25105434
IF: 5.6
2024-05-17
International Journal of Molecular Sciences
Abstract:Antibodies play a central role in the adaptive immune response of vertebrates through the specific recognition of exogenous or endogenous antigens. The rational design of antibodies has a wide range of biotechnological and medical applications, such as in disease diagnosis and treatment. However, there are currently no reliable methods for predicting the antibodies that recognize a specific antigen region (or epitope) and, conversely, epitopes that recognize the binding region of a given antibody (or paratope). To fill this gap, we developed ImaPEp, a machine learning-based tool for predicting the binding probability of paratope–epitope pairs, where the epitope and paratope patches were simplified into interacting two-dimensional patches, which were colored according to the values of selected features, and pixelated. The specific recognition of an epitope image by a paratope image was achieved by using a convolutional neural network-based model, which was trained on a set of two-dimensional paratope–epitope images derived from experimental structures of antibody–antigen complexes. Our method achieves good performances in terms of cross-validation with a balanced accuracy of 0.8. Finally, we showcase examples of application of ImaPep, including extensive screening of large libraries to identify paratope candidates that bind to a selected epitope, and rescoring and refining antibody–antigen docking poses.
biochemistry & molecular biology,chemistry, multidisciplinary
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the current lack of reliable methods to predict antibodies that can recognize specific antigen regions (or epitopes), as well as the binding regions (or antigen epitopes) of given antibodies. Specifically, the author has developed a tool named ImaPEp based on convolutional neural network (CNN) to predict the binding probability of paratope - epitope pairs. The following is a detailed interpretation: ### Research Background Antibodies play a crucial role in the adaptive immune response and function by specifically recognizing exogenous or endogenous antigens. Rational design of antibodies has a wide range of applications in biotechnology and medicine, such as disease diagnosis and treatment. However, there are currently no reliable methods to predict antibodies that can recognize specific antigen regions (or epitopes), and vice versa, that is, to predict epitopes that can recognize the binding regions (or antigen epitopes) of given antibodies. ### Solution To solve this problem, the author has developed ImaPEp, a machine - learning - based tool for predicting the binding probability of antigen - epitope pairs. The specific methods are as follows: 1. **Data Representation**: Simplify epitopes and antigen epitopes into two - dimensional images, and color and pixelate them according to the values of selected features. 2. **Model Training**: Use a convolutional neural network (CNN) model and train it based on the two - dimensional images of antibody - antigen complexes in experimental structures. 3. **Performance Evaluation**: Evaluate the model performance through cross - validation and achieve a balanced accuracy (BAC) of 0.8. ### Main Contributions - **Improved Prediction Method**: Compared with existing methods, ImaPEp predicts the entire antigen - epitope pair instead of individual epitope residues and antigen - epitope residues, which improves the prediction accuracy. - **Introduction of Structural Information**: Unlike other sequence - only - based methods, ImaPEp takes into account structural information, which is crucial for antibody - antigen recognition. - **Simplified Representation and Shallow Network**: ImaPEp adopts a simplified representation and a shallow neural network architecture, making the method faster and less prone to overfitting, and suitable for large - scale screening of antibody - antigen binding complexes. ### Application Examples - **Large - scale Screening**: Demonstrate the application of ImaPEp in large - scale screening of large libraries to identify antigen - epitope candidates that bind to specific epitopes. - **Rescoring and Optimization**: Used for rescoring and optimizing antibody - antigen docking postures. In conclusion, this research fills the gap in the field of antibody - antigen binding prediction by developing ImaPEp and provides a new and effective tool for antibody design.