A MeSH-based Biomedical Literature Mining Method for Exploring Associations Between Genes and Clinical Terms

Feng Ya-Ning,Jiao Meng-Ying,Duan Hui-Long,Deng Ning
DOI: https://doi.org/10.16476/j.pibb.2015.0129
2015-01-01
PROGRESS IN BIOCHEMISTRY AND BIOPHYSICS
Abstract:The causes and progressions of cancers have close associations with the mutations of genes in our body, which lead to abnormal symptoms and detection indicators. Therefore, providing clinical decision support for early diagnosis and precise treatment of cancers is very urgent and necessary, which can be achieved by mining the associations between genes and clinical behaviors from conclusive biomedical literature data. A MeSH-based (Medical Subject Headings, MeSH) method was proposed for biomedical objects association mining in this paper. By using MeSH (which is provided in PubMed) to represent each object as a vector in the Vector Space Model and taking the citations between articles into consideration, we translated the associations mining into mathematical operating successfully. We finally obtained 203 genes and 462 associations related to colorectal cancer (CRC) after applying our method in the associations mining between genes and clinical behaviors of CRC. In order to analyze and verify the mining results, some bioinformatics tools, such as g:Profiler and KEGG were used for functions and pathway analysis of genes. The results show that this MeSH-based method works robust in the association mining. Besides removing the restriction of co-occurrence for the indirect associations mining, our proposed method also avoid complex grammatical analysis which lead to massive calculation.
What problem does this paper attempt to address?