Adaptively Multi-Objective Adversarial Training for Medical Image Report Generation.

Xuemiao Zhang,Junfei Liu
DOI: https://doi.org/10.1109/BIBM58861.2023.10386004
2023-01-01
Abstract:Medical Image Report Generation aims to automate the creation of medical reports from radiology images, alleviating the burden on radiologists. The challenge lies in producing text that aligns with the image content, highlights local anomalies, and adheres to professional norms. In this study, we restructure this task as a multi-objective optimization problem and introduce a novel adversarial training framework, MORGAN, with multiple discriminators to guide the generation of reports akin to those written by professional radiologists. MORGAN employs three discriminators to assess the alignment of image-text, focus on local anomalies, and adherence to medical norms when generating each token. These assessments serve as reinforcement learning rewards, guiding the generator to optimize these objectives in real time. We extensively test MORGAN on two prevalent chest-related disease datasets. The results demonstrate that MORGAN significantly enhances the generator’s performance on most metrics.
What problem does this paper attempt to address?