A Review on Knowledge Graphs for Healthcare: Resources, Applications, and Promises

Carl Yang,Hejie Cui,Jiaying Lu,Shiyu Wang,Ran Xu,Wenjing Ma,Yue Yu,Shaojun Yu,Xuan Kan,Chen Ling,Tianfan Fu,Liang Zhao,Joyce Ho,Fei Wang
2024-08-04
Abstract:Healthcare knowledge graphs (HKGs) are valuable tools for organizing biomedical concepts and their relationships with interpretable structures. The recent advent of large language models (LLMs) has paved the way for building more comprehensive and accurate HKGs. This, in turn, can improve the reliability of generated content and enable better evaluation of LLMs. However, the challenges of HKGs such as regarding data heterogeneity and limited coverage are not fully understood, highlighting the need for detailed reviews. This work provides the first comprehensive review of HKGs. It summarizes the pipeline and key techniques for HKG construction, as well as the common utilization approaches, i.e., model-free and model-based. The existing HKG resources are also organized based on the data types they capture and application domains they cover, along with relevant statistical information (Resource available at <a class="link-external link-https" href="https://github.com/lujiaying/Awesome-HealthCare-KnowledgeBase" rel="external noopener nofollow">this https URL</a>). At the application level, we delve into the successful integration of HKGs across various health domains, ranging from fine-grained basic science research to high-level clinical decision support and public health. Lastly, the paper highlights the opportunities for HKGs in the era of LLMs. This work aims to serve as a valuable resource for understanding the potential and opportunities of HKG in health research.
Artificial Intelligence,Computation and Language,Machine Learning,Social and Information Networks
What problem does this paper attempt to address?
The paper aims to address the issues of organizing biomedical concepts and their relationships in Healthcare Knowledge Graphs (HKGs) and to explore their potential in various health-related applications. Specifically, the goals of the paper include: 1. **Comprehensive Overview**: Provide the first comprehensive review of HKGs, covering the processes and techniques of HKGs construction, as well as successful cases of their application in different health-related contexts. 2. **Technologies and Methods**: Discuss in detail the methods of constructing HKGs, including building from scratch and integrating existing data resources; and introduce two main utilization methods: model-free and model-based approaches. 3. **Existing Resources**: Summarize the existing HKGs resources and their application scopes, so that researchers and healthcare professionals can better utilize these resources. 4. **Application Cases**: Explore the applications of HKGs in various fields such as basic scientific research, drug development, clinical decision support, and public health. 5. **Future Opportunities**: Identify the opportunities and challenges that HKGs face in the era of large-scale language models (LLMs). Through the above work, the paper aims to promote further research and development of HKGs in the healthcare field, enhancing their reliability and effectiveness in practical applications.