Abstract:Background As the major histocompatibility complex (MHC), human leukocyte antigens (HLAs) are one of the most polymorphic genes in humans. Patients carrying certain HLA alleles may develop adverse drug reactions (ADRs) after taking specific drugs. Peptides play an important role in HLA related ADRs as they are the necessary co-binders of HLAs with drugs. Many experimental data have been generated for understanding HLA-peptide binding. However, efficiently utilizing the data for understanding and accurately predicting HLA-peptide binding is challenging. Therefore, we developed a network analysis based method to understand and predict HLA-peptide binding. Methods Qualitative Class I HLA-peptide binding data were harvested and prepared from four major databases. An HLA-peptide binding network was constructed from this dataset and modules were identified by the fast greedy modularity optimization algorithm. To examine the significance of signals in the yielded models, the modularity was compared with the modularity values generated from 1,000 random networks. The peptides and HLAs in the modules were characterized by similarity analysis. The neighbor-edges based and unbiased leverage algorithm (Nebula) was developed for predicting HLA-peptide binding. Leave-one-out (LOO) validations and two-fold cross-validations were conducted to evaluate the performance of Nebula using the constructed HLA-peptide binding network. Results Nine modules were identified from analyzing the HLA-peptide binding network with a highest modularity compared to all the random networks. Peptide length and functional side chains of amino acids at certain positions of the peptides were different among the modules. HLA sequences were module dependent to some extent. Nebula archived an overall prediction accuracy of 0.816 in the LOO validations and average accuracy of 0.795 in the two-fold cross-validations and outperformed the method reported in the literature. Conclusions Network analysis is a useful approach for analyzing large and sparse datasets such as the HLA-peptide binding dataset. The modules identified from the network analysis clustered peptides and HLAs with similar sequences and properties of amino acids. Nebula performed well in the predictions of HLA-peptide binding. We demonstrated that network analysis coupled with Nebula is an efficient approach to understand and predict HLA-peptide binding interactions and thus, could further our understanding of ADRs.

Predictive Bayesian neural network models of MHC class II peptide binding

Ranking-based Convolutional Neural Network Models for Peptide-MHC Binding Prediction

Machine learning application to predict binding affinity between peptide containing non-canonical amino acids and HLA0201

Improving Prediction of MHC Class I Binding Peptides with Additional Binding Data

Peptide-binding specificity prediction using fine-tuned protein structure prediction networks

HLA class I binding prediction via convolutional neural networks

Improving MHC Binding Peptide Prediction by Incorporating Binding Data of Auxiliary MHC Molecules

Classifying antimicrobial and multifunctional peptides with Bayesian network models

Trans-Allelic Model for Prediction of Peptide:MHC-II Interactions

Peptide binding predictions for HLA DR, DP and DQ molecules

IMGT/RobustpMHC: robust training for class-I MHC peptide binding prediction

MHC2MIL: a Novel Multiple Instance Learning Based Method for MHC-II Peptide Binding Prediction by Considering Peptide Flanking Region and Residue Positions

DeepMHCII: a novel binding core-aware deep interaction model for accurate MHC-II peptide binding affinity prediction

NetMHCpan-4.1 and NetMHCIIpan-4.0: improved predictions of MHC antigen presentation by concurrent motif deconvolution and integration of MS MHC eluted ligand data

NetMHCpan-4.0: Improved Peptide–MHC Class I Interaction Predictions Integrating Eluted Ligand and Peptide Binding Affinity Data

Understanding and predicting binding between human leukocyte antigens (HLAs) and peptides by network analysis

MetaMHC: a meta approach to predict peptides binding to MHC molecules.

DeepMHCI: an anchor position-aware deep interaction model for accurate MHC-I peptide binding affinity prediction

Evaluating NetMHCpan performance on non-European HLA alleles not present in training data

Prediction of MHC-binding Peptides of Flexible Lengths from Sequence-Derived Structural and Physicochemical Properties

Prediction of human major histocompatibility complex class II binding peptides by continuous kernel discrimination method.