Abstract:Automatic and accurate semantic segmentation from high-resolution remote-sensing images plays an important role in the field of aerial images analysis. The task of dense semantic segmentation requires that semantic labels be assigned to each pixel in the image. Recently, convolutional neural networks (CNNs) have proven to be powerful tools for image classification, and they have been adopted in the remote-sensing community. But many limitations still exist when modern CNN architectures are directly applied to remote-sensing images, such as gradient explosion when the depth of the network increases, over-fitting with limited labeled remote-sensing data, and special differences between remote-sensing images and natural images. In this paper, we present a novel architecture that combines the thought of dense connection and fully convolutional networks, referred as DFCN, to automatically provide fine-grained semantic segmentation maps. In addition, we improve DFCN with multi-scale filters to widen the network and to increase the richness and diversity of extracted information, making the network more powerful and expressive than the naive convolution layer. Furthermore, we investigate a multi-modal network that incorporates digital surface models (DSMs) into a DFCN structure, and then we propose dual-path densely convolutional networks where the encoder consists of two paths that, respectively, extract features from spectral data and DSMs data and then fuse them. Finally, through conducting comprehensive experimental evaluations on two remote sensing benchmark datasets, we test our proposed models and compare them with other deep networks. The results demonstrate the effectiveness of proposed approaches; they can achieve competitive performance compared with the current state-of-the-art methods.

Learning to Segment Objects of Various Sizes in VHR Aerial Images.

High-Resolution Aerial Imagery Semantic Labeling With Dense Pyramid Network

Semantic Segmentation of Large-Size VHR Remote Sensing Images Using a Two-Stage Multiscale Training Architecture

High-Resolution Remote Sensing Image Semantic Segmentation Method Based on Improved Encoder-Decoder Convolutional Neural Network

Dense Pyramid Network for Semantic Segmentation of High Resolution Aerial Imagery.

Transformer and CNN Hybrid Deep Neural Network for Semantic Segmentation of Very-High-Resolution Remote Sensing Imagery

Semantic Labeling Of High Resolution Aerial Imagery And Lidar Data With Fine Segmentation Network

Dual-Path Geometry-Aware Network for Semantic Segmentation of High-Resolution Aerial Images

Small Object Segmentation Using Dilated Convolutions With Increasing-Decreasing Dilation

Contextual Pyramid Attention Network for Building Segmentation in Aerial Imagery

Densely Based Multi-Scale and Multi-Modal Fully Convolutional Networks for High-Resolution Remote-Sensing Image Semantic Segmentation

Semantic Segmentation of Very-High-Resolution Remote Sensing Images via Deep Multi-Feature Learning

Cascaded CNN and global–local attention transformer network-based semantic segmentation for high-resolution remote sensing image

An Object-Aware Network Embedding Deep Superpixel for Semantic Segmentation of Remote Sensing Images

BUILDING SEGMENTATION FROM AIRBORNE VHR IMAGES USING MASK R-CNN

A Dual-Path and Lightweight Convolutional Neural Network for High-Resolution Aerial Image Segmentation

A Lightweight CNN–Transformer Network With Laplacian Loss for Low-Altitude UAV Imagery Semantic Segmentation

Comments on Point-Counterpoint "The muscle metaboreflex does/does not restore blood flow to contracting muscles".

Classification of Very-High-Spatial-Resolution Aerial Images Based on Multiscale Features with Limited Semantic Information

Few-Shot Aerial Image Semantic Segmentation Leveraging Pyramid Correlation Fusion

AF2: Adaptive Focus Framework for Aerial Imagery Segmentation