Abstract:In recent years, intelligent driving navigation and security monitoring have made considerable progress with the help of deep Convolutional Neural Networks (CNNs). As one of the state-of-the-art perception approaches, semantic segmentation unifies distinct detection tasks widely desired by both autonomous driving and security monitoring. Currently, semantic segmentation shows remarkable efficiency and reliability in standard scenarios such as daytime scenes with favorable illumination conditions. However, in face of adverse conditions such as the nighttime, semantic segmentation loses its accuracy significantly. One of the main causes of the problem is the lack of sufficient annotated segmentation datasets of nighttime scenes. In this paper, we propose a framework to alleviate the accuracy decline when semantic segmentation is taken to adverse conditions by using Generative Adversarial Networks (GANs). To bridge the daytime and nighttime image domains, we made key observation that compared to datasets in adverse conditions, there are considerable amount of segmentation datasets in standard conditions such as BDD and our collected ZJU datasets. Our GAN-based nighttime semantic segmentation framework includes two methods. In the first method, GANs were used to translate nighttime images to the daytime, thus semantic segmentation can be performed using robust models already trained on daytime datasets. In another method, we use GANs to translate different ratio of daytime images in the dataset to the nighttime but still with their labels. In this sense, synthetic nighttime segmentation datasets can be generated to yield models prepared to operate at nighttime conditions robustly. In our experiment, the later method significantly boosts the performance at the nighttime evidenced by quantitative results using Intersection over Union (IoU) and Pixel Accuracy (Acc). We show that the performance varies with respect to the proportion of synthetic nighttime images in the dataset, where the sweet spot corresponds to most robust performance across the day and night. The proposed framework not only makes contribution to the optimization of visual perception in intelligent vehicles, but also can be applied to diverse navigational assistance systems.

Semantic scene understanding on mobile device with illumination invariance for the visually impaired

Scene Text Detection and Recognition System for Visually Impaired People in Real World

Robustifying Semantic Cognition of Traversability Across Wearable RGB-depth Cameras

Unifying Terrain Awareness Through Real-Time Semantic Segmentation

See Clearer at Night: Towards Robust Nighttime Semantic Segmentation Through Day-Night Image Conversion

Unifying Visual Localization and Scene Recognition for People with Visual Impairment

Semantic perception of curbs beyond traversability for real-world navigation assistance systems

An Environmental Perception and Navigational Assistance System for Visually Impaired Persons Based on Semantic Stixels and Sound Interaction

A New Approach of Point Cloud Processing and Scene Segmentation for Guiding the Visually Impaired

A Scalable Real-time Semantic Segmentation Network for Autonomous Driving

Visual Localizer: Outdoor Localization Based on ConvNet Descriptor and Global Optimization for Visually Impaired Pedestrians

Unifying Terrain Awareness for the Visually Impaired through Real-Time Semantic Segmentation

Indoor Navigation Assistance System for Visually Impaired with Semantic Segmentation using EdgeTPU

Lightweight Semantic Segmentation Network for Semantic Scene Understanding on Low-Compute Devices

Light-Deeplabv3+: a lightweight real-time semantic segmentation method for complex environment perception

Rapid Detection of Blind Roads and Crosswalks by Using a Lightweight Semantic Segmentation Network

Real-Time Semantic Segmentation via Spatial-Detail Guided Context Propagation

Mobile-Seed: Joint Semantic Segmentation and Boundary Detection for Mobile Robots

Panoptic Lintention Network: Towards Efficient Navigational Perception for the Visually Impaired

A Wearable Navigation Device for Visually Impaired People Based on the Real-Time Semantic Visual SLAM System

AsymFormer: Asymmetrical Cross-Modal Representation Learning for Mobile Platform Real-Time RGB-D Semantic Segmentation