Prediction of ketoacyl synthase family using reduced amino acid alphabets

Wei Chen,Pengmian Feng,Hao Lin
DOI: https://doi.org/10.1007/s10295-011-1047-z
2011-01-01
Journal of Industrial Microbiology & Biotechnology
Abstract:Ketoacyl synthases are enzymes involved in fatty acid synthesis and can be classified into five families based on primary sequence similarity. Different families have different catalytic mechanisms. Developing cost-effective computational models to identify the family of ketoacyl synthases will be helpful for enzyme engineering and in knowing individual enzymes’ catalytic mechanisms. In this work, a support vector machine-based method was developed to predict ketoacyl synthase family using the n -peptide composition of reduced amino acid alphabets. In jackknife cross-validation, the model based on the 2-peptide composition of a reduced amino acid alphabet of size 13 yielded the best overall accuracy of 96.44% with average accuracy of 93.36%, which is superior to other state-of-the-art methods. This result suggests that the information provided by n -peptide compositions of reduced amino acid alphabets provides efficient means for enzyme family classification and that the proposed model can be efficiently used for ketoacyl synthase family annotation.
What problem does this paper attempt to address?