Traditional Chinese Medicine Knowledge Graph Construction Based on Large Language Models

Yichong Zhang,Yongtao Hao
DOI: https://doi.org/10.3390/electronics13071395
IF: 2.9
2024-04-08
Electronics
Abstract:This study explores the use of large language models in constructing a knowledge graph for Traditional Chinese Medicine (TCM) to improve the representation, storage, and application of TCM knowledge. The knowledge graph, based on a graph structure, effectively organizes entities, attributes, and relationships within the TCM domain. By leveraging large language models, we collected and embedded substantial TCM–related data, generating precise representations transformed into a knowledge graph format. Experimental evaluations confirmed the accuracy and effectiveness of the constructed graph, extracting various entities and their relationships, providing a solid foundation for TCM learning, research, and application. The knowledge graph has significant potential in TCM, aiding in teaching, disease diagnosis, treatment decisions, and contributing to TCM modernization. In conclusion, this paper utilizes large language models to construct a knowledge graph for TCM, offering a vital foundation for knowledge representation and application in the field, with potential for future expansion and refinement.
engineering, electrical & electronic,computer science, information systems,physics, applied
What problem does this paper attempt to address?
The paper aims to address the following issues: 1. **Constructing a Traditional Chinese Medicine (TCM) Knowledge Graph**: By utilizing large-scale language models (LLMs) to construct a knowledge graph in the field of Traditional Chinese Medicine (TCM), thereby improving the representation, storage, and application of TCM knowledge. The study mentions that traditional methods are inefficient and prone to errors when handling large amounts of data and complex relationships, whereas large-scale language models can automate the extraction of entities, attributes, and relationships, significantly reducing the workload of manual annotation and improving construction efficiency and accuracy. 2. **Enhancing Automation and Reducing Labor Costs**: Existing methods for constructing TCM knowledge graphs still rely heavily on manual intervention to correct errors and improve data quality, which increases human resource costs. This paper adopts advanced large-scale language models and fine-tunes them with domain expert knowledge to enhance the accuracy of the automatic extraction process, thereby reducing dependence on human resources. 3. **Achieving High-Precision Named Entity Recognition**: By using large-scale language models for named entity recognition and employing few-shot learning techniques to achieve high-precision entity recognition, the cost of manual annotation is significantly reduced, laying a foundation for constructing an accurate TCM knowledge graph. In summary, this paper is primarily dedicated to overcoming the challenges in the current construction of TCM knowledge graphs by leveraging large-scale language models, improving automation and accuracy, and reducing labor costs, thereby providing a solid foundation for the modern application of TCM knowledge.