EPMLR: Sequence-Based Linear B-cell Epitope Prediction Method Using Multiple Linear Regression

Yao Lian,Meng Ge,Xian-Ming Pan
DOI: https://doi.org/10.1186/s12859-014-0414-y
IF: 3.307
2014-01-01
BMC Bioinformatics
Abstract:BACKGROUND:B-cell epitopes have been studied extensively due to their immunological applications, such as peptide-based vaccine development, antibody production, and disease diagnosis and therapy. Despite several decades of research, the accurate prediction of linear B-cell epitopes has remained a challenging task.RESULTS:In this work, based on the antigen's primary sequence information, a novel linear B-cell epitope prediction model was developed using the multiple linear regression (MLR). A 10-fold cross-validation test on a large non-redundant dataset was performed to evaluate the performance of our model. To alleviate the problem caused by the noise of negative dataset, 300 experiments utilizing 300 sub-datasets were performed. We achieved overall sensitivity of 81.8%, precision of 64.1% and area under the receiver operating characteristic curve (AUC) of 0.728.CONCLUSIONS:We have presented a reliable method for the identification of linear B cell epitope using antigen's primary sequence information. Moreover, a web server EPMLR has been developed for linear B-cell epitope prediction: http://www.bioinfo.tsinghua.edu.cn/epitope/EPMLR/ .
What problem does this paper attempt to address?