MATE-Pred: Multimodal Attention-based TCR-Epitope interaction Predictor

Etienne Goffinet,Raghvendra Mall,Ankita Singh,Rahul Kaushik,Filippo Castiglione
2023-12-05
Abstract:An accurate binding affinity prediction between T-cell receptors and epitopes contributes decisively to develop successful immunotherapy strategies. Some state-of-the-art computational methods implement deep learning techniques by integrating evolutionary features to convert the amino acid residues of cell receptors and epitope sequences into numerical values, while some other methods employ pre-trained language models to summarize the embedding vectors at the amino acid residue level to obtain sequence-wise representations.
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? This paper aims to solve the problem of accurately predicting the binding affinity between T - cell receptors (TCRs) and epitopes. Specifically, the researchers developed a new multimodal attention - based mechanism model - MATE - Pred (Multimodal Attention - based TCR - Epitope interaction Predictor) to improve the prediction accuracy of TCR - epitope interactions. #### Background and Significance 1. **Importance of Immunotherapy** - The specific binding between T - cell receptors (TCRs) and antigen epitopes is a crucial step in activating T - cells and initiating an immune response. - Accurately predicting this binding affinity is essential for the development of successful immunotherapy strategies, especially in personalized medicine and vaccine design. 2. **Limitations of Existing Methods** - Existing computational methods mainly rely on deep - learning techniques, converting amino acid residues into numerical representations by integrating evolutionary features or using pre - trained language models. - Although these methods have certain effects, their performance is limited when dealing with new, unseen epitopes, especially in cross - species prediction. #### Innovations of MATE - Pred 1. **Multimodal Fusion** - MATE - Pred introduces a multimodal representation method, combining three different types of features: - **Text Representation**: Embed the text representation of proteins using a pre - trained bidirectional encoder model. - **Physicochemical Properties**: Include a series of selected physicochemical attributes. - **Contact Map**: The predicted contact map estimates the 3D distance of amino acid residues in the sequence. 2. **Attention Mechanism** - By introducing an attention mechanism, MATE - Pred can effectively capture contextual information, physicochemical information, and structural information, thus predicting binding affinity more accurately. #### Experimental Results 1. **Performance Improvement** - Compared with the baseline model, MATE - Pred shows significant improvements in multiple evaluation metrics, such as MCC (+8.4%) and AUC (+5.5%). 2. **Validation in an Independent Test Set** - In an independent test set containing rare epitopes and MHC class II epitopes, MATE - Pred shows better generalization ability, further demonstrating its potential in practical applications. In conclusion, this paper addresses the deficiencies of existing methods in predicting TCR - epitope binding affinity by proposing the MATE - Pred model, providing new tools and technical support for the development of immunotherapy.