Abstract:BACKGROUND Artificial intelligence–based assistive diagnostic systems imitate the deductive reasoning process of a human physician in biomedical disease diagnosis and treatment decision making. While impressive progress in this area has been reported, most of the reported successes are applications of artificial intelligence in Western medicine. The application of artificial intelligence in traditional Chinese medicine has lagged mainly because traditional Chinese medicine practitioners need to perform syndrome differentiation as well as biomedical disease diagnosis before a treatment decision can be made. Syndrome, a concept unique to traditional Chinese medicine, is an abstraction of a variety of signs and symptoms. The fact that the relationship between diseases and syndromes is not one-to-one but rather many-to-many makes it very challenging for a machine to perform syndrome predictions. So far, only a handful of artificial intelligence–based assistive traditional Chinese medicine diagnostic models have been reported, and they are limited in application to a single disease-type. OBJECTIVE The objective was to develop an artificial intelligence–based assistive diagnostic system capable of diagnosing multiple types of diseases that are common in traditional Chinese medicine, given a patient’s electronic health record notes. The system was designed to simultaneously diagnose the disease and produce a list of corresponding syndromes. METHODS Unstructured freestyle electronic health record notes were processed by natural language processing techniques to extract clinical information such as signs and symptoms which were represented by named entities. Natural language processing used a recurrent neural network model called bidirectional long short-term memory network–conditional random forest. A convolutional neural network was then used to predict the disease-type out of 187 diseases in traditional Chinese medicine. A novel traditional Chinese medicine syndrome prediction method—an integrated learning model—was used to produce a corresponding list of probable syndromes. By following a majority-rule voting method, the integrated learning model for syndrome prediction can take advantage of four existing prediction methods (back propagation, random forest, extreme gradient boosting, and support vector classifier) while avoiding their respective weaknesses which resulted in a consistently high prediction accuracy. RESULTS A data set consisting of 22,984 electronic health records from Guanganmen Hospital of the China Academy of Chinese Medical Sciences that were collected between January 1, 2017 and September 7, 2018 was used. The data set contained a total of 187 diseases that are commonly diagnosed in traditional Chinese medicine. The diagnostic system was designed to be able to detect any one of the 187 disease-types. The data set was partitioned into a training set, a validation set, and a testing set in a ratio of 8:1:1. Test results suggested that the proposed system had a good diagnostic accuracy and a strong capability for generalization. The disease-type prediction accuracies of the top one, top three, and top five were 80.5%, 91.6%, and 94.2%, respectively. CONCLUSIONS The main contributions of the artificial intelligence–based traditional Chinese medicine assistive diagnostic system proposed in this paper are that 187 commonly known traditional Chinese medicine diseases can be diagnosed and a novel prediction method called an integrated learning model is demonstrated. This new prediction method outperformed all four existing methods in our preliminary experimental results. With further improvement of the algorithms and the availability of additional electronic health record data, it is expected that a wider range of traditional Chinese medicine disease-types could be diagnosed and that better diagnostic accuracies could be achieved.

A Patient-Similarity-based Model for Diagnostic Prediction

Deep Dynamic Patient Similarity Analysis: Model Development and Validation in ICU.

A Prediction Model Based on Machine Learning for Diagnosing the Early COVID-19 Patients

Interactive similar patient retrieval for visual summary of patient outcomes

Explainable Diagnosis Prediction through Neuro-Symbolic Integration

A Comorbidity Knowledge-Aware Model for Disease Prognostic Prediction

Artificial Intelligence–Based Traditional Chinese Medicine Assistive Diagnostic System: Validation Study (Preprint)

Artificial Intelligence–Based Traditional Chinese Medicine Assistive Diagnostic System: Validation Study

A General Framework for Diagnosis Prediction Via Incorporating Medical Code Descriptions

Enhancing Model Interpretability and Accuracy for Disease Progression Prediction via Phenotype-Based Patient Similarity Learning

A novel intelligent model for visualized inference of medical diagnosis: A case of TCM

A multiperiod hybrid decision support model for medical diagnosis and treatment based on similarities and three‐way decision theory

SCOPE: predicting future diagnoses in office visits using electronic health records

Leveraging Interpretable Feature Representations for Advanced Differential Diagnosis in Computational Medicine

A Visual Analytics System for Multi-model Comparison on Clinical Data Predictions

Patient similarity: methods and applications

Patient Similarity Analysis with Longitudinal Health Data

Simultaneous Imputation and Prediction with High-dimensional Data (SIP-HD): A Deep Learning Model for Disease Diagnosis

Efficient symptom inquiring and diagnosis via adaptive alignment of reinforcement learning and classification

Diagnose Like a Radiologist: Hybrid Neuro-Probabilistic Reasoning for Attribute-Based Medical Image Diagnosis