Abstract:Traditional pyramid pooling modules have shown effective improvements in semantic segmentation tasks by capturing multi-scale feature information. However, their limitations arise from the shallow structure, which fails to fully extract contextual information, and the fused multi-scale feature information lacks distinctiveness, resulting in issues with the final segmentation discriminability. To address these issues, we proposes an effective solution called FCPFNet, which is based on global contextual prior for deep feature extraction of detailed information. Specifically, we introduce a novel deep feature aggregation module to extract semantic information from the output feature map of each layer through a deep aggregation of context information module, and expands the effective perception range. Additionally, we propose an Efficient Pyramid Pooling Module (EPPM) to capture distinctive features through communicating information between different sub-features and performs multi-scale fusion, which is integrated as a branch within the network to complement the information loss resulting from downsampling operations. Furthermore, in order to ensure the richness of image detail feature information and maintain a large receptive field to obtain more contextual information, EPPM concatenates the input feature map and the output feature map of the pyramid pooling module to acquire more comprehensive global contextual information. It has been demonstrated by experiment that the method described in this article achieves competitive performance on the challenging scene segmentation datasets Pascal VOC 2012, Cityscapes and Coco-Stuff, with MIOU of 81.0%, 78.8% and 40.1%, respectively.

Semantic segmentation based on double pyramid network with improved global attention mechanism

SAFPN: a Full Semantic Feature Pyramid Network for Object Detection

Encoder-decoder with double spatial pyramid for semantic segmentation.

GPNet: Gated pyramid network for semantic segmentation

Chemical signalling in the nervous system.

DPNet: Dual-Pyramid Semantic Segmentation Network Based on Improved Deeplabv3 Plus

PCANet: Pyramid convolutional attention network for semantic segmentation

Semantic segmentation based on enhanced gated pyramid network with lightweight attention module

FCPFNet: Feature Complementation Network with Pyramid Fusion for Semantic Segmentation

High-Resolution Aerial Imagery Semantic Labeling With Dense Pyramid Network

Attention Guided Global Enhancement and Local Refinement Network for Semantic Segmentation

Adaptive multi-scale dual attention network for semantic segmentation

DARSegNet: A Real-Time Semantic Segmentation Method Based on Dual Attention Fusion Module and Encoder-Decoder Network

An Attention-Fused Network for Semantic Segmentation of Very-High-Resolution Remote Sensing Imagery

A Method of Image Semantic Segmentation Based on PSPNet

Semantic Image Segmentation with Improved Position Attention and Feature Fusion

Semantic Segmentation With Attention Mechanism for Remote Sensing Images

Dense Pyramid Network for Semantic Segmentation of High Resolution Aerial Imagery.

DPANET:Dual Pooling Attention Network for Semantic Segmentation

Ppednet: Pyramid Pooling Encoder-Decoder Network For Real-Time Semantic Segmentation

PPNet : Pooling Position Attention Network for Semantic Segmentation