A Deep Learning Model Incorporating Knowledge Representation Vectors and Its Application in Diabetes Prediction

He Xu,Qunli Zheng,Jingshu Zhu,Zuoling Xie,Haitao Cheng,Peng Li,Yimu Ji
DOI: https://doi.org/10.1155/2022/7593750
2022-08-12
Disease Markers
Abstract:The deep learning methods for various disease prediction tasks have become very effective and even surpass human experts. However, the lack of interpretability and medical expertise limits its clinical application. This paper combines knowledge representation learning and deep learning methods, and a disease prediction model is constructed. The model initially constructs the relationship graph between the physical indicator and the test value based on the normal range of human physical examination index. And the human physical examination index for testing value by knowledge representation learning model is encoded. Then, the patient physical examination data is represented as a vector and input into a deep learning model built with self-attention mechanism and convolutional neural network to implement disease prediction. The experimental results show that the model which is used in diabetes prediction yields an accuracy of 97.18% and the recall of 87.55%, which outperforms other machine learning methods (e.g., lasso, ridge, support vector machine, random forest, and XGBoost). Compared with the best performing random forest method, the recall is increased by 5.34%, respectively. Therefore, it can be concluded that the application of medical knowledge into deep learning through knowledge representation learning can be used in diabetes prediction for the purpose of early detection and assisting diagnosis.
genetics & heredity,pathology,medicine, research & experimental,biotechnology & applied microbiology
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the limitation of the current deep - learning methods in disease prediction tasks, namely the lack of interpretability and medical expertise, which hinders their wide application in clinical diagnosis. Specifically, although deep - learning methods have become very effective in prediction tasks for various diseases and even outperformed human experts, due to the black - box nature of deep - learning algorithms, their internal decision - making processes are opaque, making it difficult for patients or doctors to understand the decision - making basis of the models, which leads to trust issues. In addition, most deep - learning models are trained based on data - driven methods and require the support of high - quality and large - scale data sets. However, medical data are characterized by uncertainty, heterogeneity, time - dependence, sparsity and irregularity, all of which pose challenges to ensuring data quality. Therefore, this paper proposes a method that combines knowledge representation and deep - learning. By integrating medical expertise into deep - learning models, it improves the interpretability of the models and data quality, especially in the application of diabetes prediction. This method aims to achieve early detection and auxiliary diagnosis of diseases while improving performance indicators such as the accuracy and recall rate of the models.