PREDICTION OF RESIDUE SOLVENT ACCESSIBILITY IN PROTEIN WITH SUPPORT VECTOR MACHINE

WANG Xian,Li Ao,WANG Ming-hui,Feng Huan-qing
DOI: https://doi.org/10.3321/j.issn:1000-6737.2005.01.008
2005-01-01
ACTA BIOPHYSICA SINICA
Abstract:Residues in protein sequences can be divided into two classes (exposed/buried) or three classes (exposed/intermediate/buried) according to their relative solvent accessibility. Several lengths and parameters of window were explored to achieve the best performance. The prediction accuracies of support vector machine (SVM) for different cut-off thresholds were analyzed and compared with other methods, which showed that the SVM was a better method than neural network and information theory when using the same dataset. The best accuracy, in two-class problem, could be as high as 79.0%, and in three-class problem, could be as high as 67.5%. These results show that the support vector machine is an effective method in the prediction of protein solvent accessibility.
What problem does this paper attempt to address?