MATNet: multiattention Transformer network for cropland semantic segmentation in remote sensing images

Zixuan Zhang Liang Huang Bo-Hui Tang Weipeng Le Meiqi Wang Jiapei Cheng Qiang Wu a Faculty of Land Resources Engineering,Kunming University of Science and Technology,Kunming,People' s Republic of People's Republic of Chinab Key Laboratory of Plateau Remote Sensing,Yunnan Provincial Department of Education,Kunming University of Science and Technology,Kunming,People's Republic of China
DOI: https://doi.org/10.1080/17538947.2024.2392845
IF: 4.606
2024-08-19
International Journal of Digital Earth
Abstract:Remote sensing image semantic segmentation methods have become the main approach for extracting cropland information. However, in the mountainous regions of southwestern China, croplands exhibit narrow and fragmented shapes, as well as complex planting patterns, making it difficult for traditional semantic segmentation methods to accurately delineate fine-grained cropland boundaries. To address these challenges, a multiattention Transformer network named MATNet is proposed in this paper, for fine-grained extraction of cropland at the parcel level in complex scenes. MATNet built upon the fusion of CNN encoder and Transformer decoder. In the encoder, spatial and channel reconstruction units are introduced, reducing information redundancy in the convolutional layers. The Transformer decoder incorporates multiple attention mechanisms, this design feature enhances the attention window's perception of local content and improves the model's ability to extract features from fine-grained cropland parcels through optimized computationnal al location. Taking the experimental results of the Dali cropland dataset as an illustration, MATNet achieved the highest values across five evaluation metrics, including mIoU. Specifically, the Recall, F1, and mIoU scores were 94.68%, 94.69%, and 89.92%, respectively. Compared with six other advanced models, MATNet consistently performed best in terms of extracting fine-grained cropland parcels.
geography, physical,remote sensing
What problem does this paper attempt to address?