Abstract:Abstract. The urban road network detection and extraction have significant applications in many domains, such as intelligent transportation and navigation, urban planning, and automatic driving. Although manual annotation methods can provide accurate road network maps, their low efficiency with high-cost consumption are insufficient for the current tasks. Traditional methods based on spectral or geometric information rely on shallow features and often struggle with low semantic segmentation accuracy in complex remote sensing backgrounds. In recent years, deep convolutional neural networks (CNN) have provided robust feature representations to distinguish complex terrain objects. However, these CNNs ignore the fusion of global-local contexts and are often confused with other types of features, especially buildings. In addition, conventional convolution operations use a fixed template paradigm to aggregate local feature information. The road features present complex linear-shape geometric relationships, which brings some obstacles to feature construction. To address the above issues, we proposed a hybrid network structure that combines the advantages of CNN and transformer models. Specifically, a multiscale deformable convolution module has been developed to capture local road context information adaptively. The Transformer model is introduced into the encoder to enhance semantic information to build the global context. Meanwhile, the CNN features are fused with the transformer features. Finally, the model outputs a road extraction prediction map in high spatial resolution. Quantitative analysis and visual expression confirm that the proposed model can effectively and automatically extract road features from complex remote sensing backgrounds, outperforming state-of-the-art methods with IOU by 86.5% and OA by 97.4%.

Hsgnet: A Road Extraction Network Based On Global Perception Of High-Order Spatial Information

A Saliency-Aware Deep Network for Narrow Road Extraction of High-Resolution Remote Sensing Imagery

Automatic Road Extraction from High-Resolution Remote Sensing Images Using a Method Based on Densely Connected Spatial Feature-Enhanced Pyramid

GA-Net: A geometry prior assisted neural network for road extraction

A Global Context-aware and Batch-independent Network for road extraction from VHR satellite imagery

A novel small-signal modeling and simulation technique in SiGe: C HBT for ultra high frequency applications

Global–Local Information Fusion Network for Road Extraction: Bridging the Gap in Accurate Road Segmentation in China

AGD-Linknet: A Road Semantic Segmentation Model for High Resolution Remote Sensing Images Integrating Attention Mechanism, Gated Decoding Block and Dilated Convolution

C2Net: Road Extraction via Context Perception and Cross Spatial-Scale Feature Interaction

Road Extraction from High-Resolution Remote Sensing Images via Local and Global Context Reasoning

TransRoadNet: A Novel Road Extraction Method for Remote Sensing Images via Combining High-Level Semantic Feature and Context

DEGANet: Road Extraction Using Dual-Branch Encoder With Gated Attention Mechanism

Multiscale Global Attention Network With Edge Perceptron for Automatic Road Extraction From Remote Sensing Imagery

DA-RoadNet: A Dual-Attention Network for Road Extraction From High Resolution Satellite Imagery

Rse-net: Road-shape enhanced neural network for Road extraction in high resolution remote sensing image

Fine-Grained Extraction of Road Networks via Joint Learning of Connectivity and Segmentation

Road Extraction With Satellite Images and Partial Road Maps

DCTNET: HYBRID NETWORK MODEL FUSING WITH MULTISCALE DEFORMABLE CNN AND TRANSFORMER STRUCTURE FOR ROAD EXTRACTION FROM GAOFEN SATELLITE REMOTE SENSING IMAGE

Bi-HRNet: A Road Extraction Framework from Satellite Imagery Based on Node Heatmap and Bidirectional Connectivity

AGF-Net: adaptive global feature fusion network for road extraction from remote-sensing images

A new two-step road extraction method in high resolution remote sensing images