LogitBoost classifier for discriminating thermophilic and mesophilic proteins.

Guangya Zhang,Baishan Fang
DOI: https://doi.org/10.1016/j.jbiotec.2006.07.020
IF: 3.595
2007-01-01
Journal of Biotechnology
Abstract:A novel classifier, the so-called LogitBoost classifier, was introduced to discriminate the thermophilic and mesophilic proteins according to their primary structures. When the 20-amino acid composition was chosen as the feature vector, the overall accuracy of the self-consistency check and a five-fold cross-validation procedure was 97.0% and 86.6%, respectively. To test if the method was also applicable to a wide range of biological targets, an independent testing dataset was also used. The method based on LogitBoost algorithm has achieved an overall classification accuracy of 88.9%. According to the three different validation check approaches, it was demonstrated that LogitBoost outperformed AdaBoost and performed comparably with RBF neural network and support vector machine. The influence of protein size on discrimination was addressed.
What problem does this paper attempt to address?