Abstract:A self-driving car is a hot research topic in the field of the intelligent transportation system, which can greatly alleviate traffic jams and improve travel efficiency. Scene classification is one of the key technologies of self-driving cars, which can provide the basis for decision-making in self-driving cars. In recent years, deep learning-based solutions have achieved good results in the problem of scene classification. However, some problems should be further studied in the scene classification methods, such as how to deal with the similarities among different categories and the differences among the same category. To deal with these problems, an improved deep network-based scene classification method is proposed in this article. In the proposed method, an improved faster region with convolutional neural network features (RCNN) network is used to extract the features of representative objects in the scene to obtain local features, where a new residual attention block is added to the Faster RCNN network to highlight local semantics related to driving scenarios. In addition, an improved Inception module is used to extract global features, where a mixed Leaky ReLU and ELU function is presented, to reduce the possible redundancy of the convolution kernel and enhance the robustness. Then, the local features and the global features are fused to realize the scene classification. Finally, a private dataset is built from the public datasets for the specialized application of scene classification in the self-driving field, and the proposed method is tested on the proposed dataset. The experimental results show that the accuracy of the proposed method can reach 94.76%, which is higher than the state-of-the-art methods.

Deep Integration: A Multi-Label Architecture for Road Scene Recognition

Vehicle-Related Scene Understanding Using Deep Learning

A system of vision sensor based deep neural networks for complex driving scene analysis in support of crash risk assessment and prevention

Research on Road Scene Understanding of Autonomous Vehicles Based on Multi-Task Learning

Scene recognition under special traffic conditions based on deep multi‐task learning

A Scene Understanding Network Based on Driving Scene

Dense-ACSSD for End-to-end Traffic Scenes Recognition

An Improved Deep Network-Based Scene Classification Method for Self-Driving Cars

MultiScene: A Large-scale Dataset and Benchmark for Multi-scene Recognition in Single Aerial Images

A Deep Model for Joint Object Detection and Semantic Segmentation in Traffic Scenes.

Driving Assistance System Based on Deep Learning and Traditional Vision

Lightweight Deep Learning for Road Environment Recognition

Enhancing scene understanding based on deep learning for end-to-end autonomous driving

Learning from Maps: Visual Common Sense for Autonomous Driving

Multi-Modal Sensor Fusion-Based Deep Neural Network for End-to-End Autonomous Driving With Scene Understanding

An End-to-End Multi-Task Learning Model for Drivable Road Detection via Edge Refinement and Geometric Deformation

LiDAR-Based Multi-Task Road Perception Network for Autonomous Vehicles

A Hierarchical Deep Architecture and Mini-Batch Selection Method For Joint Traffic Sign and Light Detection

Deep Neural Network for Structural Prediction and Lane Detection in Traffic Scene.

Multi-Task Learning for Automotive Foggy Scene Understanding via Domain Adaptation to an Illumination-Invariant Representation