New Building Extraction Method Based on Semantic Segmentation

LONG Lihong,ZHU Yuting,YAN Jingwen,LIU Jingjin,WANG Zongyue
DOI: https://doi.org/10.11834/jrs.20211029
2023-01-01
Journal of Remote Sensing
Abstract:Semantic segmentation of high-resolution remote sensing image has important theoretical and practical value in the field of aerial image analysis.However,the traditional segmentation methods are prone to edge blur,loss of detail information,and low resolution due to the richness of building semantics and the complexity of image background in high-resolution remote sensing images. An end-to-end convolutional neural network called Dilated-UNet(D-UNet)is proposed to solve the problem of fuzzy boundary and information loss in high-resolution satellite image semantic segmentation.First,the U-Net network structure is improved and the multiscale dilated convolution module of four channels is expanded using the division technology.Each channel uses different convolution expansion rates to identify the multiscale semantic information for extracting richer detailed information.Second,a joint loss function of cross entropy and Dice coefficient is designed to achieve the desired segmentation effect. The model is comprehensively evaluated and tested on the Inria aerial image dataset.Experimental results show that the proposed remote sensing image segmentation method can effectively segment urban buildings at pixel level from high-resolution remote sensing images,and the segmentation accuracy is higher and is therefore better than those of other methods. Our proposed D-UNet can deliver automatic building segmentation from high-resolution remote sensing images with high accuracy.Thus,it is a useful tool for practical application scenarios.
What problem does this paper attempt to address?