MultiResUNet : Rethinking the U-Net Architecture for Multimodal Biomedical Image Segmentation

Nabil Ibtehaz,M. Sohel Rahman
DOI: https://doi.org/10.1016/j.neunet.2019.08.025
2019-02-12
Abstract:In recent years Deep Learning has brought about a breakthrough in Medical Image Segmentation. U-Net is the most prominent deep network in this regard, which has been the most popular architecture in the medical imaging community. Despite outstanding overall performance in segmenting multimodal medical images, from extensive experimentations on challenging datasets, we found out that the classical U-Net architecture seems to be lacking in certain aspects. Therefore, we propose some modifications to improve upon the already state-of-the-art U-Net model. Hence, following the modifications we develop a novel architecture MultiResUNet as the potential successor to the successful U-Net architecture. We have compared our proposed architecture MultiResUNet with the classical U-Net on a vast repertoire of multimodal medical images. Albeit slight improvements in the cases of ideal images, a remarkable gain in performance has been attained for challenging images. We have evaluated our model on five different datasets, each with their own unique challenges, and have obtained a relative improvement in performance of 10.15%, 5.07%, 2.63%, 1.41%, and 0.62% respectively.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
### The Problem the Paper Attempts to Solve The paper aims to improve the classic U-Net architecture to enhance the performance of multimodal medical image segmentation. Although U-Net performs excellently in medical image segmentation, it still has certain shortcomings when dealing with challenging datasets. Therefore, the authors propose a new architecture—MultiResUNet, which surpasses the traditional U-Net model through the following improvements: 1. **Multi-resolution Analysis**: Introduces MultiRes blocks, enabling the network to better handle features at different scales. 2. **Semantic Gap Compensation**: Proposes "Res paths" to reduce the potential semantic gap between the encoder and decoder. 3. **Experimental Validation**: Conducts extensive experimental validation on multiple different medical image datasets, including fluorescence microscopy images, electron microscopy images, dermoscopy images, endoscopy images, and magnetic resonance imaging (MRI). Through these improvements, MultiResUNet demonstrates significantly better performance than the traditional U-Net on various challenging images.