The Effectiveness of Ensemble Learning Based on Four Different Classifiers for Predicting Membrane Protein Types

Mingyuan Li,Shunfang Wang,Lei Guo
DOI: https://doi.org/10.1109/ccdc.2018.8408150
2018-01-01
Abstract:It is an important task to predict the types of membrane protein in the field of bioinformatics. With the development of technology, machine learning techniques were widely utilized in bioinformatics. Individual classifier shows its inflexibility and lack of robustness which was used for many existing methods, so this is the reason why ensemble learning comes into view. What ensemble members we proposed in this study comprises of Logistic Regression (LR), Support Vector Machine (SVM) with RBF kernel, Support Vector Machine with linear kernel and Neural Network(NN). We chose these ensemble members for their different pros and cons of performance on membrane protein types prediction. Then we used logarithmic weighting to combine them and got the final prediction model. Experiments were carried out on benchmark dataset that contains a training set, a hold-out cross-validation set, and a test set. The accuracy and average f1-score we got on test set are 0.9201 and 0.9251 respectively. It can be seen that we got better performance than every individual classifier and also surpassed the performance of existing method.
What problem does this paper attempt to address?