Abstract:Semantic segmentation of oblique unmanned aerial vehicle (UAV) images serves as a foundation for many modern urban applications, such as road scene monitoring and semantic 3D modeling. However, objects in UAV images can vary intensely in size and undergo severe perspective distortion because of the oblique viewing style. Existing general segmentation models designed for ground and remote sensing images rarely considered these challenges specific to UAV images. Therefore, they have large difficulties in learning discriminative representation for simultaneously reasoning the extremely large and small objects in UAV images. In this paper, we propose a dense context distillation network (DCDNet) to learn distortion-robust feature representation for semantic segmentation of UAV images. The basic DCDNet is deployed as an dual-branch encoder–decoder architecture. To accomplish the goal of dense context distillation, DCDNet is first equipped with several cross-scale context selectors at different encoding stages to densely and selectively gather the useful context from low- to high-level dual-scale feature maps. A joint supervision is then applied to reinforce the learning of shallower features for distilling more low-level contexts that are vital to the reasoning of small or thin structures. A multi-scale feature aggregator is incorporated to adaptively fuse the long-range context during decoding, which absorbs the complementary merits of the dense context captured from feature maps of different levels. With the dense context distillation, DCDNet is more capable of offering the differently scaled objects with the required context for better learning and prediction. Extensive experiments on the challenging UAVid dataset demonstrate that our DCDNet can well adapt to the oblique UAV images, achieving a state-of-the-art segmentation performance with a mIoU score of 72.38%.

Aerial image semantic segmentation using DCNN predicted distance maps

Optimizing Spatial Relationships in GCN to Improve the Classification Accuracy of Remote Sensing Images

A Dual-Path and Lightweight Convolutional Neural Network for High-Resolution Aerial Image Segmentation

Improving Semantic Image Segmentation with a Probabilistic Superpixel-Based Dense Conditional Random Field.

Densely Based Multi-Scale and Multi-Modal Fully Convolutional Networks for High-Resolution Remote-Sensing Image Semantic Segmentation

Semantic Segmentation of Aerial Imagery Via Split-Attention Networks with Disentangled Nonlocal and Edge Supervision

Semantic Segmentation of Aerial Image Using Fully Convolutional Network.

Semantic Segmentation for High-Resolution Aerial Imagery Using Multi-Skip Network and Markov Random Fields

A Top-Down Manner-Based DCNN Architecture for Semantic Image Segmentation.

High-Resolution Aerial Imagery Semantic Labeling With Dense Pyramid Network

An Aerial Image Segmentation Approach Based on Enhanced Multi-Scale Convolutional Neural Network

Dense Convolutional Networks for Semantic Segmentation.

A Relation-Augmented Fully Convolutional Network for Semantic Segmentation in Aerial Scenes

Learning Contextual Information For Indoor Semantic Segmentation

Semantic Image Segmentation with Deep Convolutional Nets and Fully Connected CRFs

Gated Convolutional Neural Network for Semantic Segmentation in High-Resolution Images

Remote Sensing Image Semantic Segmentation Method Based on a Deep Convolutional Neural Network and Multiscale Feature Fusion

Real-Time High-Performance Semantic Image Segmentation of Urban Street Scenes

Semantic Labeling of Very High-Resolution Imagery by Leveraging Contextual Information with Optimized Non-Local Neural Network.

Dense Context Distillation Network for Semantic Parsing of Oblique UAV Images

DCANet: Dense Context-Aware Network for Semantic Segmentation