ChatCAD+: Towards a Universal and Reliable Interactive CAD using LLMs

Zihao Zhao,Sheng Wang,Jinchen Gu,Yitao Zhu,Lanzhuju Mei,Zixu Zhuang,Zhiming Cui,Qian Wang,Dinggang Shen

2024-04-17

Abstract:The integration of Computer-Aided Diagnosis (CAD) with Large Language Models (LLMs) presents a promising frontier in clinical applications, notably in automating diagnostic processes akin to those performed by radiologists and providing consultations similar to a virtual family doctor. Despite the promising potential of this integration, current works face at least two limitations: (1) From the perspective of a radiologist, existing studies typically have a restricted scope of applicable imaging domains, failing to meet the diagnostic needs of different patients. Also, the insufficient diagnostic capability of LLMs further undermine the quality and reliability of the generated medical reports. (2) Current LLMs lack the requisite depth in medical expertise, rendering them less effective as virtual family doctors due to the potential unreliability of the advice provided during patient consultations. To address these limitations, we introduce ChatCAD+, to be universal and reliable. Specifically, it is featured by two main modules: (1) Reliable Report Generation and (2) Reliable Interaction. The Reliable Report Generation module is capable of interpreting medical images from diverse domains and generate high-quality medical reports via our proposed hierarchical in-context learning. Concurrently, the interaction module leverages up-to-date information from reputable medical websites to provide reliable medical advice. Together, these designed modules synergize to closely align with the expertise of human medical professionals, offering enhanced consistency and reliability for interpretation and advice. The source code is available at

Computer Science

What problem does this paper attempt to address?

The paper aims to address the following issues: 1. **Limitations of Multimodal Medical Image Diagnosis**: Current research combining large language models (LLMs) with computer-aided diagnosis (CAD) systems is usually limited to specific imaging fields (such as chest X-rays), lacking generality and reliability, and failing to meet the diagnostic needs of different patients. Additionally, the quality of medical reports generated by existing LLMs is not high, affecting their reliability and practicality. 2. **Insufficient Functionality of Virtual Family Doctors**: Current LLMs lack sufficient medical expertise when providing medical consultations, leading to potentially unreliable advice, which limits their effectiveness as virtual family doctors. To address these issues, the research team proposed the ChatCAD+ system, which has two main modules: - **Reliable Report Generation Module**: Capable of interpreting medical images from different fields and generating high-quality medical reports through a hierarchical context learning approach. - **Reliable Interaction Module**: Utilizes the latest medical website information to provide reliable medical advice, thereby enhancing the quality and reliability of interactions with patients. Through the synergistic work of these two modules, ChatCAD+ aims to achieve diagnostic and consultation capabilities closer to the level of human medical professionals, improving the consistency and reliability of diagnostic reports and medical advice.

ChatCAD+: Towards a Universal and Reliable Interactive CAD using LLMs

ChatCAD: Interactive Computer-Aided Diagnosis on Medical Image using Large Language Models

TCMChat: A Generative Large Language Model for Traditional Chinese Medicine

An Interactive Task Analysis Framework and Interactive System Research for Computer Aided Diagnosis

ChatDoctor: A Medical Chat Model Fine-tuned on LLaMA Model using Medical Domain Knowledge

CopilotCAD: Empowering Radiologists with Report Completion Models and Quantitative Evidence from Medical Image Foundation Models

Exploring the Potential of Large Language Models in Radiological Imaging Systems: Improving User Interface Design and Functional Capabilities

ChatRadio-Valuer: A Chat Large Language Model for Generalizable Radiology Report Generation Based on Multi-institution and Multi-system Data

AI Hospital: Interactive Evaluation and Collaboration of LLMs As Intern Doctors for Clinical Diagnosis

MedXChat: A Unified Multimodal Large Language Model Framework towards CXRs Understanding and Generation

ChatDoctor: A Medical Chat Model Fine-Tuned on a Large Language Model Meta-AI (LLaMA) Using Medical Domain Knowledge

Improving Clinical Expertise in Large Language Models Using Electronic Medical Records

Enhancing Clinical Accuracy of Medical Chatbots with Large Language Models

Can large language models be new supportive tools in coronary computed tomography angiography reporting?

ECG-Chat: A Large ECG-Language Model for Cardiac Disease Diagnosis

LLM-Mini-CEX: Automatic Evaluation of Large Language Model for Diagnostic Conversation

AI Hospital: Benchmarking Large Language Models in a Multi-agent Medical Interaction Simulator

Dia-LLaMA: Towards Large Language Model-driven CT Report Generation

Towards Accurate Differential Diagnosis with Large Language Models

MedChatZH: a Better Medical Adviser Learns from Better Instructions

MedChatZH: A tuning LLM for traditional Chinese medicine consultations