Abstract:In recent years, intelligent driving navigation and security monitoring have made considerable progress with the help of deep Convolutional Neural Networks (CNNs). As one of the state-of-the-art perception approaches, semantic segmentation unifies distinct detection tasks widely desired by both autonomous driving and security monitoring. Currently, semantic segmentation shows remarkable efficiency and reliability in standard scenarios such as daytime scenes with favorable illumination conditions. However, in face of adverse conditions such as the nighttime, semantic segmentation loses its accuracy significantly. One of the main causes of the problem is the lack of sufficient annotated segmentation datasets of nighttime scenes. In this paper, we propose a framework to alleviate the accuracy decline when semantic segmentation is taken to adverse conditions by using Generative Adversarial Networks (GANs). To bridge the daytime and nighttime image domains, we made key observation that compared to datasets in adverse conditions, there are considerable amount of segmentation datasets in standard conditions such as BDD and our collected ZJU datasets. Our GAN-based nighttime semantic segmentation framework includes two methods. In the first method, GANs were used to translate nighttime images to the daytime, thus semantic segmentation can be performed using robust models already trained on daytime datasets. In another method, we use GANs to translate different ratio of daytime images in the dataset to the nighttime but still with their labels. In this sense, synthetic nighttime segmentation datasets can be generated to yield models prepared to operate at nighttime conditions robustly. In our experiment, the later method significantly boosts the performance at the nighttime evidenced by quantitative results using Intersection over Union (IoU) and Pixel Accuracy (Acc). We show that the performance varies with respect to the proportion of synthetic nighttime images in the dataset, where the sweet spot corresponds to most robust performance across the day and night. The proposed framework not only makes contribution to the optimization of visual perception in intelligent vehicles, but also can be applied to diverse navigational assistance systems.

TrafficScene: A Multi-modal Dataset Including Light Field for Semantic Segmentation of Traffic Scenes

See Clearer at Night: Towards Robust Nighttime Semantic Segmentation Through Day-Night Image Conversion

Semantic Segmentation With Light Field Imaging and Convolutional Neural Networks

TrafficCAM: A Versatile Dataset for Traffic Flow Segmentation

3D Scene Reconstruction with Sparse LiDAR Data and Monocular Image in Single Frame

SFNet-N: An Improved SFNet Algorithm for Semantic Segmentation of Low-Light Autonomous Driving Road Scenes

End-to-End Semantic Segmentation Utilizing Multi-scale Baseline Light Field

A New Parallel Intelligence Based Light Field Dataset for Depth Refinement and Scene Flow Estimation

The Fieldscapes Dataset for Semantic Field Scene Understanding

ME-Seg&DLS-Net: A Dataset and a Network for Autonomous Driving Based on Multi-Element Semantic Segmentation of Pavement

A Dataset for Lane Instance Segmentation in Urban Environments

RoScenes: A Large-scale Multi-view 3D Dataset for Roadside Perception

Performance Evaluation of Deep Learning Networks for Semantic Segmentation of Traffic Stereo-Pair Images

Traffic Scene Parsing through the TSP6K Dataset

The ParallelEye Dataset: Constructing Large-Scale Artificial Scenes for Traffic Vision Research

Semantic Segmentation for Urban-Scene Images

MSeg3D: Multi-modal 3D Semantic Segmentation for Autonomous Driving

Towards Semantic Segmentation of Urban-Scale 3D Point Clouds: A Dataset, Benchmarks and Challenges

Road Environment Semantic Segmentation with Deep Learning from MLS Point Cloud Data

LiDAR-based Panoptic Segmentation via Dynamic Shifting Network

Revisiting Multi-modal 3D Semantic Segmentation in Real-world Autonomous Driving