Protein Domains Prediction Method Based on Support Vector Machines

邹淑雪,黄艳新,李艳文,周春光
DOI: https://doi.org/10.3321/j.issn:1671-5489.2008.05.024
2008-01-01
Abstract:Guessing the boundaries of structural domains has been an important and challenging problem in experiment and computational structural biology.A promising method for detecting the domain structure of a protein from sequence information alone was presented.The method is based on analyzing multiple sequence alignments that are derived from a database search.Multiple measures were defined to quantify the domain information content of each position along the sequence and were combined into a single predictor using support vector machines.The overall accuracy of the method for a single protein chains dataset is about 85%.The result demonstrates that the utility of the method can help not only predict the complete 3D structure of a protein but also study proteins' building blocks of functional analysis.
What problem does this paper attempt to address?