Protein Function Prediction Using Multi-label Learning and ISOMAP Embedding.

Huadong Liang,Dengdi Sun,Zhuanlian Ding,Meiling Ge
DOI: https://doi.org/10.1007/978-3-662-49014-3_23
2015-01-01
Abstract:As more and more high-throughput proteome data are collected, automated annotation of protein function has been one of the most challenging problems of the post-genomic era. To address this challenge, we propose a novel functional annotation framework incorporating manifold embedding and multi-label classification to predict protein function on protein-protein interaction (PPI) network. Unlike the existing approaches that depend on the original network, our method weights it by edge betweenness, and embeds simultaneously the annotated and unannotated proteins into an Euclidean metric space via isometric feature mapping (ISOMAP). Then, with these low-dimensional coordinates, the protein expressions are quantified and the functional assignment is transformed into a multi-label classification problem. The approach results in a set of feasible functional labels for each unannotated protein. We conduct extensive experiments on yeast PPI database to evaluate the performance of different multi-label learning methods. The results demonstrate that the proposed method is an effective tool for protein function prediction.
What problem does this paper attempt to address?