Diamond-Unet: A Novel Semantic Segmentation Network Based on U-Net Network and Transformer for Deep Space Rock Images

Guocheng Li,Bobo Xi,Yufei He,Tie Zheng,Yunsong Li,Changbin Xue,Jocelyn Chanussot
DOI: https://doi.org/10.1109/lgrs.2024.3397870
IF: 5.343
2024-05-22
IEEE Geoscience and Remote Sensing Letters
Abstract:Extracting rock objects from the surface of celestial bodies in deep space exploration environments is crucial for self-service path planning, navigation of detectors, and regional information evaluation. Most existing image semantic segmentation frameworks decrease the spatial resolution of the feature maps as networks deepen, resulting in limitations in detecting small targets and the inability to accurately segment boundary regions. In this letter, we propose a novel semantic segmentation network based on U-Net network and Transformer for deep space rock images, referred to as Diamond-Unet. This model integrates overcomplete and undercomplete branches and incorporates a global–local feature extraction (GLFE) module based on Transformer and convolutional neural network (CNN) technologies to effectively capture discriminative information. Furthermore, an innovative feature cross-fusion path (FCFP) is introduced to enhance information exchange between the dual-branch networks, enabling the capture of both fine-grained details and coarse-grained semantics in the full-scale image segmentation architecture. Experimental results demonstrate that the Diamond-Unet achieves the mean intersection over union (MIoU) scores of 79.32% and 93.43% on two public datasets, which are superior to the compared methods.
imaging science & photographic technology,remote sensing,engineering, electrical & electronic,geochemistry & geophysics
What problem does this paper attempt to address?