Normalization in Training U-Net for 2-D Biomedical Semantic Segmentation

Xiao-Yun Zhou,Guang-Zhong Yang
DOI: https://doi.org/10.1109/lra.2019.2896518
IF: 5.2
2019-01-01
IEEE Robotics and Automation Letters
Abstract:Two-dimensional (2-D) biomedical semantic segmentation is important for robotic vision in surgery. Segmentation methods based on deep convolutional neural network (DCNN) can out-perform conventional methods in terms of both accuracy and levels of automation. One common issue in training a DCNN for biomedical semantic segmentation is the internal covariate shift where the training of convolutional kernels is encumbered by the distribution change of input features, hence both the training speed and performance are decreased. Batch normalization (BN) is the first proposed method for addressing internal covariate shift and is widely used. Instance normalization (IN) and layer normalization (LN) have also been proposed. Group normalization (GN) is proposed more recently and has not yet been applied to 2-D biomedical semantic segmentation (GN was used in 3-D biomedical semantic segmentation in [P.-Y. Kao, T. Ngo, A. Zhang, J. Chen, and B. Manjunath, Brain tumor segmentation and tractographic feature extraction from structural MR images for overall survival prediction 2018, arXiv:1807.07716], however, no specific validations on GN were given). Most DCNNs for biomedical semantic segmentation adopt BN as the normalization method by default, without reviewing its performance. In this letter, four normalization methods-BN, IN, LN, and GN are compared in details, specifically for 2-D biomedical semantic segmentation. U-Net is adopted as the basic DCNN structure. Three datasets regarding the right ventricle, aorta, and left ventricle are used for the validation. The results show that detailed subdivision of the feature map, i.e., GN with a large group number or IN, achieves higher accuracy. This accuracy improvement mainly comes from better model generalization. Codes are uploaded and maintained at Xiao-Yun Zhou's Github.
What problem does this paper attempt to address?