A doctor's diagnosis experience enhanced transformer model for automatic diagnosis
Fuxi Zhang,Guoming Sang,Zhi Liu,Hongfei Lin,Yijia Zhang
DOI: https://doi.org/10.1016/j.engappai.2024.108675
IF: 8
2024-06-01
Engineering Applications of Artificial Intelligence
Abstract:Automatic diagnosis, as an important research direction in artificial intelligence engineering, has advanced significantly in recent years. In real diagnostic scenarios, after the patient informs the doctor about their most obvious symptoms, the doctor is often required to guide the patient to uncover potential symptom information. The doctor then makes the final diagnosis after obtaining sufficient evidence. Previous work has modeled this as a sequential decision-making process and trained an optimal policy using RL-based methods, achieving a high diagnosis correctness rate. However, the RL-based method only finds symptom sequences that improve reward through random trials, failing to directly capture the inter-relationships between symptom sequences in the inquiry process. Also, the doctors' diagnostic experience, especially symptoms that often co-occur and are located in close proximity to each other in the doctors' inquiry symptom sequence, has not been well explored in previous implementations. To address this, we propose a model that further enhances the learning of doctors' diagnostic experience based on a two-part transformer model that completes symptom inquiry and disease diagnosis separately. We introduce a Doctor Diagnosis Experience Reinforcement Module (DDEM) to the symptom sequence inquiry part, reducing the model's sensitivity to the order of symptom sequences within the same symptom cluster. Then, we further enhance the learning of this symptom distribution by corresponding reinforcement learning reward rules. Also, we propose three new transition rules when transitioning the symptom inquiry model to the disease diagnosis model to enable the symptom inquiry model and the disease diagnosis model to collaborate in a manner that closely aligns with the diagnostic logic used by doctors. Experiments on three public real-world medical dialogue datasets demonstrate that the proposed model improves diagnostic accuracy by 2%, 1.4%, and 0.7% and shows a clear advantage in symptom recall rate, highlighting its effectiveness.
automation & control systems,computer science, artificial intelligence,engineering, electrical & electronic, multidisciplinary