TSpred: a robust prediction framework for TCR-epitope interactions using paired chain TCR sequence data

Ha Young Kim,Sungsik Kim,Woong-Yang Park,Dongsup Kim
DOI: https://doi.org/10.1093/bioinformatics/btae472
IF: 5.8
2024-08-02
Bioinformatics
Abstract:Motivation: Prediction of T-cell receptor (TCR)-epitope interactions is important for many applications in biomedical research, such as cancer immunotherapy and vaccine design. The prediction of TCR-epitope interactions remains challenging especially for novel epitopes, due to the scarcity of available data. Results: We propose TSpred, a new deep learning approach for the pan-specific prediction of TCR binding specificity based on paired chain TCR data. We develop a robust model that generalizes well to unseen epitopes by combining the predictive power of CNN and the attention mechanism. In particular, we design a reciprocal attention mechanism which focuses on extracting the patterns underlying TCR-epitope interactions. Upon a comprehensive evaluation of our model, we find that TSpred achieves state-of-the-art performances in both seen and unseen epitope specificity prediction tasks. Also, compared to other predictors, TSpred is more robust to bias related to peptide imbalance in the dataset. In addition, the reciprocal attention component of our model allows for model interpretability by capturing structurally important binding regions. Results indicate that TSpred is a robust and reliable method for the task of TCR-epitope binding prediction. Availability and implementation: Source code is available at https://github.com/ha01994/TSpred.
What problem does this paper attempt to address?