Predicting protein-protein interactions based on the optimized feature subset

Zhan-chao LI,Zong DAI,Xiao-yong ZOU
DOI: https://doi.org/10.3969/j.issn.1004-1656.2014.09.021
2014-01-01
Abstract:Identification of protein-protein interactions can provide useful information to elucidate protein functions and discover drug target. In this study,amino acid composition,dipeptide composition,conjoint triad,composition,transition,distribution and nor-malized Moreau-Broto autocorrelation features are used to characterize protein-protein interactions. Minimum redundancy maximum relevance is employed to select the optimized feature subset,and support vector machine is adopted to construct model and predict protein-protein interactions of saccharomyces. Based on the optimized subset,accuracies of training set and test set are about 5%and 2%higher than those of dipeptide composition,showing the effectiveness of the current method.
What problem does this paper attempt to address?