A Novel Semantic Information Perception Architecture for Extreme Targets Detection in Complex Traffic Scenarios

Ang Li,Ziwei Wang,Fanxun Wang,Zhichao Liu,Guodong Yin,Ruiqi Fang,Keke Geng
DOI: https://doi.org/10.1109/tiv.2024.3450201
IF: 8.2
2024-01-01
IEEE Transactions on Intelligent Vehicles
Abstract:Semantic perception information is essential for high-level tasks such as behavior prediction and path planning in autonomous vehicles. Traditional algorithms, when directly applied from computer vision to complex traffic scenarios, often struggle to adapt to certain challenging targets, including two typical kinds of extreme targets: the first includes those with partially missed features, commonly found in congested traffic scenes with occlusions or appearing suddenly in camera blind spots; the second encompasses targets with indistinct features, typically encountered in low-light conditions or at night. A mixed multi-scale attention module (MMSA) is designed to equilibrate parameter counts across various scales and bolster the networks capacity to learn incomplete features. The infrared imaging and a fusion algorithm are introduced to the detection algorithms pre-processing part, enhancing the scenes thermal information in insufficient lighting. We conduct quantities of experiments on the well-lit KITTI, bimodal LLVIP and a self-collected datasets. The results show that mixed multi-scales attention module plays a significant role in detection of some specific objects and the fusion module help to improve the performance of networks especially in poor lighting scenarios. Our code and pre-trained models on KITTI datasets will be available at https://github.com/vehicleAngLi/MMSANet.
What problem does this paper attempt to address?