Integrating MedCLIP and Cross-Modal Fusion for Automatic Radiology Report Generation

Qianhao Han,Junyi Liu,Zengchang Qin,Zheng Zheng
2024-12-10
Abstract:Automating radiology report generation can significantly reduce the workload of radiologists and enhance the accuracy, consistency, and efficiency of clinical <a class="link-external link-http" href="http://documentation.We" rel="external noopener nofollow">this http URL</a> propose a novel cross-modal framework that uses MedCLIP as both a vision extractor and a retrieval mechanism to improve the process of medical report <a class="link-external link-http" href="http://generation.By" rel="external noopener nofollow">this http URL</a> extracting retrieved report features and image features through an attention-based extract module, and integrating them with a fusion module, our method improves the coherence and clinical relevance of generated <a class="link-external link-http" href="http://reports.Experimental" rel="external noopener nofollow">this http URL</a> results on the widely used IU-Xray dataset demonstrate the effectiveness of our approach, showing improvements over commonly used methods in both report quality and <a class="link-external link-http" href="http://relevance.Additionally" rel="external noopener nofollow">this http URL</a>, ablation studies provide further validation of the framework, highlighting the importance of accurate report retrieval and feature integration in generating comprehensive medical reports.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?