Abstract:Existing semantic segmentation networks perform well in accuracy by spending much computation. However, for practical applications, not only high segmentation accuracy but also high inference speed is required. To solve the problem of the difficult balance between accuracy and speed, we propose a new real-time semantic segmentation network (FBRNet). To extract multi-scale semantic information more quickly, we propose a lightly weighted reinforced atrous spatial pyramid pooling module (arASPP) based on the attention mechanism, which can extract richer and more advanced features with less computation than the original ASPP. To eliminate the semantic gap between high- and low-level features, we propose a new feature fusion module (CSFM), in which a shuffling mechanism is introduced to enhance robustness, and a parallel contextual information enhancement module and detail information enhancement module are built to facilitate the information exchange between high- and low-level features, achieving the effect of improving the model feature representation. Finally, we also introduce high-level features, fusing Laplace convolution and spatial attention mechanisms, and design the edge feature reinforcement module (LABRM) to eliminate the noise of low-level features and compensate for the model's segmentation effect target boundary. In the Cityscapes validation set and test set, FBRNet achieves 77.63% and 75.3% mIoU, and 101.9 FPS on a single tesla-T4 GPU, also achieved 72.4% mIoU and 89.8 FPS on the CamVid dataset and 55.2% mIoU and 100.8 FPS on the BDD100K dataset, which is a better balance of accuracy and speed compared with existing networks. The code is available at https://github.com/little5570/FBRNet.

LRFNet: An Occlusion Robust Fusion Network for Semantic Segmentation with Light Field

NLFNet: Non-Local Fusion Towards Generalized Multimodal Semantic Segmentation Across RGB-Depth, Polarization, and Thermal Images

LRNet: lightweight attention-oriented residual fusion network for light field salient object detection

End-to-End Semantic Segmentation Utilizing Multi-scale Baseline Light Field

LFFNet: lightweight feature-enhanced fusion network for real-time semantic segmentation of road scenes

LMFNet: An Efficient Multimodal Fusion Approach for Semantic Segmentation in High-Resolution Remote Sensing

Incorporating Luminance, Depth and Color Information by a Fusion-based Network for Semantic Segmentation

Feature Fusion Detector for Semantic Cognition of Remote Sensing

Semantic Segmentation With Light Field Imaging and Convolutional Neural Networks

Residual Spatial Fusion Network for RGB-Thermal Semantic Segmentation

Mask-R-FCN: A Deep Fusion Network for Semantic Segmentation.

DeOccNet: Learning to See Through Foreground Occlusions in Light Fields

LRNNet: A Light-Weighted Network with Efficient Reduced Non-Local Operation for Real-Time Semantic Segmentation

EFRNet: A Lightweight Network with Efficient Feature Fusion and Refinement for Real-Time Semantic Segmentation

ARFNet: Attention-Oriented Refinement and Fusion Network for Light Field Salient Object Detection

FRNet: Frustum-Range Networks for Scalable LiDAR Segmentation

Deep Feature Selection-And-Fusion for RGB-D Semantic Segmentation

Light field super-resolution using complementary-view feature attention

FBRNet: a feature fusion and border refinement network for real-time semantic segmentation

Semi-Supervised Semantic Segmentation for Light Field Images Using Disparity Information

MFVNet: a deep adaptive fusion network with multiple field-of-views for remote sensing image semantic segmentation