Protein domain boundary prediction by combining support vector machine and domain guess by size algorithm

Dong Qiwen,Wang Xiaolong,Lin Lei
2007-01-01
Abstract:Successful prediction of protein domain boundaries provides valuable information not only for the computational structure prediction of multi2domain proteins but also for the experimental structure deter2 mination. A novel method for domain boundary prediction has been presented , which combines the sup2 port vector machine with domain guess by size algorithm. Since the evolutional information of multiple do2 mains can be detected by position specific score matrix , the support vector machine method is trained and tested using the values of position specific score matrix generated by PSI2BLAST. The candidate domain boundaries are selected from the output of support vector machine , and are then inputted to domain guess by size algorithm to give the final results of domain boundary prediction. The experimental results show that the combined method outperforms the individual method of both support vector machine and domain guess by size.
What problem does this paper attempt to address?