ChatCAD+: Towards a Universal and Reliable Interactive CAD using LLMs

Zihao Zhao,Sheng Wang,Jinchen Gu,Yitao Zhu,Lanzhuju Mei,Zixu Zhuang,Zhiming Cui,Qian Wang,Dinggang Shen
2024-04-17
Abstract:The integration of Computer-Aided Diagnosis (CAD) with Large Language Models (LLMs) presents a promising frontier in clinical applications, notably in automating diagnostic processes akin to those performed by radiologists and providing consultations similar to a virtual family doctor. Despite the promising potential of this integration, current works face at least two limitations: (1) From the perspective of a radiologist, existing studies typically have a restricted scope of applicable imaging domains, failing to meet the diagnostic needs of different patients. Also, the insufficient diagnostic capability of LLMs further undermine the quality and reliability of the generated medical reports. (2) Current LLMs lack the requisite depth in medical expertise, rendering them less effective as virtual family doctors due to the potential unreliability of the advice provided during patient consultations. To address these limitations, we introduce ChatCAD+, to be universal and reliable. Specifically, it is featured by two main modules: (1) Reliable Report Generation and (2) Reliable Interaction. The Reliable Report Generation module is capable of interpreting medical images from diverse domains and generate high-quality medical reports via our proposed hierarchical in-context learning. Concurrently, the interaction module leverages up-to-date information from reputable medical websites to provide reliable medical advice. Together, these designed modules synergize to closely align with the expertise of human medical professionals, offering enhanced consistency and reliability for interpretation and advice. The source code is available at
Computer Science
What problem does this paper attempt to address?
The paper aims to address the following issues: 1. **Limitations of Multimodal Medical Image Diagnosis**: Current research combining large language models (LLMs) with computer-aided diagnosis (CAD) systems is usually limited to specific imaging fields (such as chest X-rays), lacking generality and reliability, and failing to meet the diagnostic needs of different patients. Additionally, the quality of medical reports generated by existing LLMs is not high, affecting their reliability and practicality. 2. **Insufficient Functionality of Virtual Family Doctors**: Current LLMs lack sufficient medical expertise when providing medical consultations, leading to potentially unreliable advice, which limits their effectiveness as virtual family doctors. To address these issues, the research team proposed the ChatCAD+ system, which has two main modules: - **Reliable Report Generation Module**: Capable of interpreting medical images from different fields and generating high-quality medical reports through a hierarchical context learning approach. - **Reliable Interaction Module**: Utilizes the latest medical website information to provide reliable medical advice, thereby enhancing the quality and reliability of interactions with patients. Through the synergistic work of these two modules, ChatCAD+ aims to achieve diagnostic and consultation capabilities closer to the level of human medical professionals, improving the consistency and reliability of diagnostic reports and medical advice.