Abstract:With the development of deep learning and remote sensing technologies in recent years, many semantic segmentation methods based on convolutional neural networks (CNNs) have been applied to road extraction. However, previous deep learning-based road extraction methods primarily used RGB imagery as an input and did not take advantage of the spectral information contained in hyperspectral imagery. These methods can produce discontinuous outputs caused by objects with similar spectral signatures to roads. In addition, the images obtained from different Earth remote sensing sensors may have different spatial resolutions, enhancing the difficulty of the joint analysis. This work proposes the Multiscale Fusion Attention Network (MSFANet) to overcome these problems. Compared to traditional road extraction frameworks, the proposed MSFANet fuses information from different spectra at multiple scales. In MSFANet, multispectral remote sensing data is used as an additional input to the network, in addition to RGB remote sensing data, to obtain richer spectral information. The Cross-source Feature Fusion Module (CFFM) is used to calibrate and fuse spectral features at different scales, reducing the impact of noise and redundant features from different inputs. The Multiscale Semantic Aggregation Decoder (MSAD) fuses multiscale features and global context information from the upsampling process layer by layer, reducing information loss during the multiscale feature fusion. The proposed MSFANet network was applied to the SpaceNet dataset and self-annotated images from Chongzhou, a representative city in China. Our MSFANet performs better over the baseline HRNet by a large margin of +6.38 IoU and +5.11 F1-score on the SpaceNet dataset, +3.61 IoU and +2.32 F1-score on the self-annotated dataset (Chongzhou dataset). Moreover, the effectiveness of MSFANet was also proven by comparative experiments with other studies.

Gradient-Guided Multi-Scale Focal Attention Network for Remote Sensing Scene Classification

A lightweight and stochastic depth residual attention network for remote sensing scene classification

Remote Sensing Scene Image Classification Model Based on Multi-Scale Features and Attention Mechanism

Enhanced Multi-Scale Feature Adaptive Fusion Sparse Convolutional Network for Large-Scale Scenes Semantic Segmentation

Remote Sensing Scene Classification by Gated Bidirectional Network

Multilayer Feature Fusion Network With Spatial Attention and Gated Mechanism for Remote Sensing Scene Classification

Class-level Prototype Guided Multi-Scale Feature Learning for Remote Sensing Scene Classification with Limited Labels

DBGA-Net: Dual-Branch Global–Local Attention Network for Remote Sensing Scene Classification

SGMNet: Scene Graph Matching Network for Few-Shot Remote Sensing Scene Classification

GCSANet: A Global Context Spatial Attention Deep Learning Network for Remote Sensing Scene Classification

Unbalanced Class Learning Network With Scale-Adaptive Perception for Complicated Scene in Remote Sensing Images Segmentation

Progressive Feature Fusion Framework Based on Graph Convolutional Network for Remote Sensing Scene Classification

MSFANet: Multiscale Fusion Attention Network for Road Segmentation of Multispectral Remote Sensing Data

Multi-scale Feature Aggregation Network for Salient Object Detection in Optical Remote Sensing Images

Multiscale Attention Fusion Graph Network for Remote Sensing Building Change Detection

Frequency and spatial based multi-layer context network (FSCNet) for remote sensing scene classification

MGFN: A Multi-Granularity Fusion Convolutional Neural Network for Remote Sensing Scene Classification

MGML: Multigranularity Multilevel Feature Ensemble Network for Remote Sensing Scene Classification

Cross-Attention-Driven Adaptive Graph Relational Network for Multilabel Remote Sensing Scene Classification

SGMFNet: a remote sensing image object detection network based on spatial global attention and multi-scale feature fusion

Multi-Label Remote Sensing Image Scene Classification by Combining a Convolutional Neural Network and a Graph Neural Network