Abstract:In recent years, intelligent driving navigation and security monitoring have made considerable progress with the help of deep Convolutional Neural Networks (CNNs). As one of the state-of-the-art perception approaches, semantic segmentation unifies distinct detection tasks widely desired by both autonomous driving and security monitoring. Currently, semantic segmentation shows remarkable efficiency and reliability in standard scenarios such as daytime scenes with favorable illumination conditions. However, in face of adverse conditions such as the nighttime, semantic segmentation loses its accuracy significantly. One of the main causes of the problem is the lack of sufficient annotated segmentation datasets of nighttime scenes. In this paper, we propose a framework to alleviate the accuracy decline when semantic segmentation is taken to adverse conditions by using Generative Adversarial Networks (GANs). To bridge the daytime and nighttime image domains, we made key observation that compared to datasets in adverse conditions, there are considerable amount of segmentation datasets in standard conditions such as BDD and our collected ZJU datasets. Our GAN-based nighttime semantic segmentation framework includes two methods. In the first method, GANs were used to translate nighttime images to the daytime, thus semantic segmentation can be performed using robust models already trained on daytime datasets. In another method, we use GANs to translate different ratio of daytime images in the dataset to the nighttime but still with their labels. In this sense, synthetic nighttime segmentation datasets can be generated to yield models prepared to operate at nighttime conditions robustly. In our experiment, the later method significantly boosts the performance at the nighttime evidenced by quantitative results using Intersection over Union (IoU) and Pixel Accuracy (Acc). We show that the performance varies with respect to the proportion of synthetic nighttime images in the dataset, where the sweet spot corresponds to most robust performance across the day and night. The proposed framework not only makes contribution to the optimization of visual perception in intelligent vehicles, but also can be applied to diverse navigational assistance systems.

Using Image Priors to Improve Scene Understanding

NLFNet: Non-Local Fusion Towards Generalized Multimodal Semantic Segmentation Across RGB-Depth, Polarization, and Thermal Images

LEARNING SHAPE PRIORS BY PAIRWISE COMPARISON FOR ROBUST SEMANTIC SEGMENTATION

See Clearer at Night: Towards Robust Nighttime Semantic Segmentation Through Day-Night Image Conversion

Research of improving semantic image segmentation based on a feature fusion model

Neural Map Prior for Autonomous Driving

Learning High-level Prior with Convolutional Neural Networks for Semantic Segmentation

Rethinking the necessity of image fusion in high-level vision tasks: A practical infrared and visible image fusion network based on progressive semantic injection and scene fidelity

Dual-modal Prior Semantic Guided Infrared and Visible Image Fusion for Intelligent Transportation System

Visual Semantic Navigation using Scene Priors

LIF-Seg: LiDAR and Camera Image Fusion for 3D LiDAR Semantic Segmentation

Enhancing Feature Fusion with Spatial Aggregation and Channel Fusion for Semantic Segmentation

FuseSeg: Semantic Segmentation of Urban Scenes Based on RGB and Thermal Data Fusion

A Multi-phase Camera-LiDAR Fusion Network for 3D Semantic Segmentation with Weak Supervision

Neural Scene Flow Prior

Learning 3D Scene Priors with 2D Supervision

Boosting Real-Time Driving Scene Parsing with Shared Semantics

2DPASS: 2D Priors Assisted Semantic Segmentation on LiDAR Point Clouds

RGB and LiDAR Fusion-based 3D Semantic Segmentation for Autonomous Driving

Multi-scale Semantic Prior Features Guided Deep Neural Network for Urban Street-view Image

Improved 3D Semantic Segmentation Model Based on RGB Image and LiDAR Point Cloud Fusion for Automantic Driving