A computable biomedical knowledge system: Toward rapidly building candidate‐directed acyclic graphs

Yongmei Bai,Xuanyu Shi,Jian Du
DOI: https://doi.org/10.1111/jebm.12602
2024-04-02
Journal of Evidence-Based Medicine
Abstract:Aim It is essential for health researchers to have a systematic understanding of third‐party variables that influence both the exposure and outcome under investigation, as shown by a directed acyclic graph (DAG). The traditional construction of DAGs through literature review and expert knowledge often needs to be more systematic and consistent, leading to potential biases. We try to introduce an automatic approach to building network linking variables of interest. Methods Large‐scale text mining from medical literature was utilized to construct a conceptual network based on the Semantic MEDLINE Database (SemMedDB). SemMedDB is a PubMed‐scale repository of the "concept‐relation‐concept" triple format. Relations between concepts are categorized as Excitatory, Inhibitory, or General. Results To facilitate the use of large‐scale triple sets in SemMedDB, we have developed a computable biomedical knowledge (CBK) system (https://cbk.bjmu.edu.cn/), a website that enables direct retrieval of related publications and their corresponding triples without the necessity of writing SQL statements. Three case studies were elaborated to demonstrate the applications of the CBK system. Conclusions The CBK system is openly available and user‐friendly for rapidly capturing a set of influencing factors for a phenotype and building candidate DAGs between exposure‐outcome variables. It could be a valuable tool to reduce the exploration time in considering relationships between variables, and constructing a DAG. A reliable and standardized DAG could significantly improve the design and interpretation of observational health research.
medicine, general & internal
What problem does this paper attempt to address?