Prediction of transmembrane segments based on fuzzy cluster analysis of amino acids

Yong Deng,Qi Liu,Yixue Li
DOI: https://doi.org/10.3321/j.issn:0567-7351.2004.19.022
2004-01-01
Acta Chimica Sinica
Abstract:Transmembrane protein sequences are badly conserved during evolution. Even two homologous proteins have a low level of sequence identity. Consequently, the commonly used method to select training sequences based on sequence identity can not efficiently reduce the sampling bias in the transmembrane segment predictions. To solve this problem, this paper presents a new prediction algorithm based on fuzzy cluster analysis of amino acids. It clusters the amino acids into groups according to their distribution similarity in different regions and then makes the prediction based on the distribution properties of each group instead of those of each amino acid. The results show that the new algorithm can efficiently reduce the impact of the selection of training sequences on the prediction results to some extent and thus improve the prediction accuracy.
What problem does this paper attempt to address?