Fusing multiple methods for discovering implicit knowledge in biomedical literature

Ran Chen,Hongfei Lin,Zhihao Yang
2009-01-01
Journal of Information and Computational Science
Abstract:The amount of biomedical literatures grows exponentially in various public databases and new relationships are often implicit from existed information. The objective of the paper is to select an effective computational method to discover these implicit relationships from MEDLINE, such as new connections between disease and chemicals, drugs or genes. Three computational methods are compared for scoring and ranking the MeSH terms: z-score, TFIDF (Term Frequency Inverse Document Frequency) and PMI (pointwise mutual information). According to the characteristics of the three methods, a fusion formula is introduced for re-ranking and re-choosing the terms to improve the final outcome. We report on three sets of experiments: Alzheimer's disease, migraine disorders, schizophrenia, and use the information retrieval metrics to evaluate the performances. Our empirical results validate the effectiveness of fusion approach over each traditional text mining computational approach. 1548-7741/ Copyright © 2009 Binary Information Press.
What problem does this paper attempt to address?