Abstract:Semantic segmentation plays a crucial role in practical applications, such as autonomous driving and robot navigation. However, prevalent semantic segmentation networks suffer from two primary challenges: oversized networks with redundant parameters that hinder network inference speed and excessively lightweight network structures that sacrifice semantic segmentation accuracy. Therefore, it is essential to design a semantic segmentation network that strikes a balance between accuracy and inference speed. We propose the asymmetric residual bottleneck module, which incorporates dilated convolution, depth-wise separable asymmetric convolution, channel attention mechanism, and a channel shuffle unit. By utilizing these components, model parameters are effectively reduced, and inference speed is accelerated. Furthermore, a feature aggregation module is designed to integrate features from feature maps with various resolutions, thereby enhancing segmentation accuracy. Based on these advancements, an efficient and lightweight real-time semantic segmentation network called efficiently lightweight asymmetrical network (ELANet) is proposed. Experimental results of the Cityscapes and CamVid datasets demonstrate that ELANet strikes a favorable balance between speed and accuracy. Notably, without any pretrained model and postprocessing scheme, ELANet achieves an impressive mean intersection over union of 72.5% on the Cityscapes test dataset with only 0.82 million parameters, operating at an inference speed of 173.5 frames per second on a single NVIDIA GTX 3090 GPU, with a 512×1024 input image. These findings underscore ELANet’s tremendous potential for real-time applications.

Senet: Spatial Information Enhancement for Semantic Segmentation Neural Networks

EHANet: Efficient Hybrid Attention Network Towards Real-time Semantic Segmentation

Spatial-Assistant Encoder-Decoder Network for Real Time Semantic Segmentation

Real-Time Semantic Segmentation via Spatial-Detail Guided Context Propagation

Real-time Semantic Segmentation in Traffic Scene Using Cross Stage Partial-based Encoder–decoder Network

BiSeNet: Bilateral Segmentation Network for Real-time Semantic Segmentation

Edge-guided Nonlinear Dynamic Convolution Network for Lightweight Semantic Segmentation

EANET: Efficient Attention-Augmented Network for Real-Time Semantic Segmentation.

ELANet: an efficiently lightweight asymmetrical network for real-time semantic segmentation

DESENet: a bilateral network with detail-enhanced semantic encoder for real-time semantic segmentation

Efficient Dense Modules of Asymmetric Convolution for Real-Time Semantic Segmentation

DARSegNet: A Real-Time Semantic Segmentation Method Based on Dual Attention Fusion Module and Encoder-Decoder Network

DSANet: Dilated Spatial Attention for Real-Time Semantic Segmentation in Urban Street Scenes.

Esnet: Edge-Based Segmentation Network For Real-Time Semantic Segmentation In Traffic Scenes

Semantics Recalibration and Detail Enhancement Network for Real‐time Semantic Segmentation

BiSeNet V3: Bilateral Segmentation Network with Coordinate Attention for Real-time Semantic Segmentation

Multiple Resolutions Detail Enhancement Network for Real-Time Image Semantic Segmentation.

Real-time Semantic Segmentation Via Region and Pixel Context Network.

A Lightweight Network for Fast Semantic Segmentation.

NDNet: Narrow While Deep Network for Real-Time Semantic Segmentation