A novel method for protein function prediction based on sequence numerical features

Ang Yang,Renfa Li,Wen Zhu,Guangxue Yue
2012-01-01
Abstract:Compared with costly and time-consuming biological experiments, computational approaches to predict protein functions are easier and more cost-efficient. In this work, a feature vector constructed by extracting numerical features from sequences based on hydrophobicity, polarity and charge properties, and a function possibility of sequence are proposed. Then the feature vector and function possibility are used to predict protein function with k-nearest neighbors algorithm (KNN). Our method avoids some problems of sequence similarity based methods, because it has involved both local and global information of sequences. The results of our experiments show that our method is more efficient.
What problem does this paper attempt to address?