A study on predicting the cofactors of oxidoreductases based on different se-quence features

Zhang Guangya,Ge Huihua,Fang Baishan
DOI: https://doi.org/10.3969/j.issn.1001-4160.2008.05.008
2008-01-01
Abstract:Extracting the features from protein sequences is always an important task in bioinformatics.In this paper,eight features ex- traction methods were tested with different classifiers in order to predict the cofactors of the oxidoreductases and the results were inter- esting.The average prediction accuracy of amino acid composition was only 64.96% ,which was the worst among the 8 feature extrac- tion methods.However,when the support vector machine was used as the classifier,and the features of amphiphilic pseudo amino acid composition and new amino acid composition distribution were fused,the predicting accuracy was the best and reached 92.93%.Be- sides,our results also showed that there existed a matching problem between the feature extraction method and the classifier.Only the best match between them was found could the accuracy reach the best one.
What problem does this paper attempt to address?