Predicting Lipase Types by Improved Chous Pseudo-Amino Acid Composition

Guang-Ya Zhang,Hong-Chun Li,Jia-Qiang Gao,Bai-Shan Fang
DOI: https://doi.org/10.2174/092986608786071184
2008-01-01
Abstract:By proposing a improved Chou's pseudo amino acid composition approach to extract the features of the sequences, a powerful predictor based on k-nearest neighbor was introduced to identify the types of lipases according to their sequences. To avoid redundancy and bias, demonstrations were performed on a dataset where none of the proteins has > or =25% sequence identity to any other. The overall success rate thus obtained by the 10-fold cross-validation test was over 90%, indicating that the improved Chou's pseudo amino acid composition might be a useful tool for extracting the features of protein sequences, or at lease can play a complementary role to many of the other existing approaches.
What problem does this paper attempt to address?