Generative Adversarial Network Using Multi-modal Guidance for Ultrasound Images Inpainting.

Ruiguo Yu,Jiachen Hu,Xi Wei,Mei Yu,Jialin Zhu,Jie Gao,Zhiqiang Liu,Xuewei Li
DOI: https://doi.org/10.1007/978-3-030-63830-6_29
2020-01-01
Abstract:Medical image inpainting not only helps computer-aided diagnosis systems to eliminate the interference of irrelevant information in medical images, but also helps doctors to prognosis and evaluate the operation by blocking and inpainting the lesion area. However, the existing diffusion-based or patch-based methods have poor performance on complex images with non-repeating structures, and the generate-based methods lack sufficient priori knowledge, which leads to the inability to generate repair content with reasonable structure and visual reality. This paper proposes a generative adversarial network via multi-modal guidance (MMG-GAN), which is composed of the multi-modal guided network and the fine inpainting network. The multi-modal guided network obtains the low-frequency structure, high-frequency texture and high-order semantic of original image through the structure reconstruction generator, texture refinement generator and semantic guidance generator. Utilizing the potential attention mechanism of convolution operation, the fine inpainting network adaptively fuses features to achieve realistic inpainting. By adding the multi-modal guided network, MMG-GAN realizes the inpainting content with reasonable structure, reliable texture and consistent semantic. Experimental results on Thyroid Ultrasound Image (TUI) dataset and TN-SCUI2020 dataset show that our method outperforms other state-of-the-art methods in terms of PSNR, SSIM, and relative l1 measures. Code and TUI dataset will be made publicly available.
What problem does this paper attempt to address?