Abstract:Objective: The amount of biomedical data in different disciplines is growing at an exponential rate. Integrating these significant knowledge sources to generate novel hypotheses for systems biology research is difficult. Traditional Chinese medicine (TCM) is a completely different discipline, and is a complementary knowledge system to modern biomedical science. This paper uses a significant TCM bibliographic literature database in China, together with MEDLINE, to help discover novel gene functional knowledge. Materials and methods: We present an integrative mining approach to uncover the functional gene relationships from MEDLINE and TCM bibliographic literature. This paper introduces TCM literature (about 50,000 records) as one knowledge source for constructing literature-based gene networks. We use the TCM diagnosis, TCM syndrome, to automatically congregate the related genes. The syndrome-gene relationships are discovered based on the syndrome-disease relationships extracted from TCM literature and the disease-gene relationships in MEDLINE. Based on the bubble-bootstrapping and relation weight computing methods, we have developed a prototype system called MeDisco/3S, which has name entity and relation extraction, and online analytical processing (OLAP) capabilities, to perform the integrative mining process. Results: We have got about 200,000 syndrome-gene relations, which could help generate syndrome-based gene networks, and help analyze the functional knowledge of genes from syndrome perspective. We take the gene network of Kidney-Yang Deficiency syndrome (KYD syndrome) and the functional analysis of some genes, such as CRH (corticotropin releasing hormone), PTH (parathyroid hormone), PRL (prolactin), BRCA1 (breast cancer 1, early onset) and BRCA2 (breast cancer 2, early onset), to demonstrate the preliminary results. The underlying hypothesis is that the related genes of the same syndrome will have some biological functional relationships, and will constitute a functional network. Conclusion: This paper presents an approach to integrate TCM literature and modern biomedical data to discover novel gene networks and functional knowledge of genes. The preliminary results show that the novel gene functional knowledge and gene networks, which are worthy of further investigation, could be generated by integrating the two complementary biomedical data sources. It will be a promising research field through integrative mining of TCM and modern life science literature.

Literature-Mining For Genes Based Natural Language Processing And Biomedical Ontology

Gene Related Mining of Biomedical Literatures

Literature Mining Associations of Diseases Using Gene Ontology

Ontology-based biomedical literature management for competitive analysis

Mining Disease-Specific Molecular Association Profiles from Biomedical Literature: A Case Study

Literature mining discerns latent disease–gene relationships

Mining Meaningful Topics from Massive Biomedical Literature

Building a literature knowledge base towards transparent biomedical AI

Recent advances in biomedical literature mining

Biotopic: A Topic-Driven Biological Literature Mining System

A Semantic-Based Approach for Mining Undiscovered Public Knowledge from Biomedical Literature.

A Semantic Approach for Mining Hidden Links from Complementary and Non-interactive Biomedical Literature

Integrative Mining of Traditional Chinese Medicine Literature and MEDLINE for Functional Gene Networks

Knowledge Discovery in Biomedical Literature:Survey and Prospect

From Biomedical Literature to Knowledge: Mining Protein-Protein Interactions

Research on Text Mining of Biomedical Field Based on Pubmed

Mining Biomedical Literature for Terms Related to Epidemiologic Exposures.

Extracting Relationship Both Gene2Disease and Gene2Gene from Biomedical Literatures

A Text Feature-Based Approach for Literature Mining of Lncrna-Protein Interactions

A MeSH-based Biomedical Literature Mining Method for Exploring Associations Between Genes and Clinical Terms

PubMed and beyond: biomedical literature search in the age of artificial intelligence