Instance-level Expert Knowledge and Aggregate Discriminative Attention for Radiology Report Generation

Shenshen Bu,Taiji Li,Zhiming Dai,Yuedong Yang
DOI: https://doi.org/10.1109/cvpr52733.2024.01346
2024-01-01
Abstract:Automatic radiology report generation can provide sub-stantial advantages to clinical physicians by effectively re-ducing their workload and improving efficiency. Despite the promising potential of current methods, challenges persist in effectively extracting and preventing degradation of prominent features, as well as enhancing attention on piv-otal regions. In this paper, we propose an Instance-level Expert Knowledge and Aggregate Discriminative Attention framework (EKAGen 1 1 https://github.com/hnjzbss/EKAGen) for radiology report generation. We convert expert reports into an embedding space and gener-ate comprehensive representations for each disease, which serve as Preliminary Knowledge Support (PKS). To prevent feature disruption, we select the representations in the em-bedding space with the smallest distances to P KS as Rec-tified Knowledge Support (RKS). Then, EKAGen diagnoses the diseases and retrieves knowledge from RKS, creating Instance-level Expert Knowledge (IEK) for each query image, boosting generation. Additionally, we introduce Ag-gregate Discriminative Attention Map (ADM), which uses weak supervision to create maps of discriminative regions that highlight pivotal regions. For training, we propose a Global Information Self-Distillation (GID) strategy, using an iteratively optimized model to distill global knowledge into EKAGen. Extensive experiments and analyses on IU X-Ray and MIMIC-CXR datasets demonstrate that EKAGen outperforms previous state-of-the-art methods.
What problem does this paper attempt to address?