Classification of membrane protein using a novel method of feature extraction and support vector machine

Zhang Shaowu,Pan Quan,Cheng Yongmei,Shi Jianyu
DOI: https://doi.org/10.3969/j.issn.1001-4160.2006.04.002
2006-01-01
Abstract:The weighted idea is introduced to form a novel feature extraction method, that is, the weighted auto-correlation function method, to represent the protein sequences. The support vector machine (SVM) algorithm is combined felicitously with this novel feature extraction method, and two classification strategies (‘one-versus-rest' and‘one-versus-one' ) are also used to classify the membrane proteins. The results are significantly improved. With the same SVM and 'one-versus-rest' strategy, the results based on the weighted auto-correlation function method are better than that based on amino acid composition method. The total accuracy and lipidchain anchored accuracy are 87. 98% and 65. 85% , which are 3. 38,9. 75 percentage points higher than that of amino acid composition method respectively in jackknife test. The total accuracy of ' one-versus-one' strategy may be up to 94. 88% in jackknife test, which is 6.9 percentage points higher than that of "one-versus-rest" strategy. The classification performance of SVM is superior to Bayes covariant discriminant algorithm. The total accuracy of SVM is 15. 6 percentage points farthest higher than that of Bayes covariant discriminant method.
What problem does this paper attempt to address?