Improving Psi-Blast'S Fold Recognition Performance Through Combining Consensus Sequences And Support Vector Machine

Ren-Xiang Yan,Jing Liu,Yi-Min Tao
DOI: https://doi.org/10.4018/978-1-60960-064-8.ch005
2011-01-01
Abstract:Profile-profile alignment may be the most sensitive and useful computational resource for identifying remote homologies and recognizing protein folds. However, profile-profile alignment is usually much more complex and slower than sequence-sequence or profile-sequence alignment. The profile or PSSM (position-specific scoring matrix) can be used to represent the mutational variability at each sequence position of a protein by using a vector of amino acid substitution frequencies and it is a much richer encoding of a protein sequence. Consensus sequence, which can be considered as a simplified profile, was used to improve sequence alignment accuracy in the early time. Recently, several studies were carried out to improve PSI-BLAST's fold recognition performance by using consensus sequence information. There are several ways to compute a consensus sequence. Based on these considerations, we propose a method that combines the information of different types of consensus sequences with the assistance of support vector machine learning in this chapter. Benchmark results suggest that our method can further improve PSI-BLAST's fold recognition performance.
What problem does this paper attempt to address?