Mining physical protein-protein interactions by exploiting abundant features

Minlie Huang,Shilin Ding,Hongning Wang,Xiaoyan Zhu
2007-01-01
Abstract:It is a great challenge to mine protein-protein interactions from bioscience literature. From a general perspective, there are three sub-tasks to mine biologically meaningful knowledge: first, classify documents containing interactions or not and filter irrelevant ones; second, extract protein-protein interactions (or interacting protein pairs) from the documents; finally, extract detailed information about the interactions, such as experimental detection methods of interactions, and summarization sentences describing them. Particularly, it is the knowledge from the third sub-task that is really meaningful for biologists. In this paper, we present a method of mining physical protein-protein interactions by exploiting abundant features during our participation in the PPI task of BioCreAtIvE Challenge 2006. Several machine learning algorithms for classification and ranking, including SVM and probabilistic model, and abundant …
What problem does this paper attempt to address?