Abstract:In the realm of intelligent transportation, pedestrian detection has witnessed significant advancements. However, it continues to grapple with challenging issues, notably the detection of pedestrians in complex lighting scenarios. Conventional visible light mode imaging is profoundly affected by varying lighting conditions. Under optimal daytime lighting, visibility is enhanced, leading to superior pedestrian detection outcomes. Conversely, under low-light conditions, visible light mode imaging falters due to the inadequate provision of pedestrian target information, resulting in a marked decline in detection efficacy. In this context, infrared light mode imaging emerges as a valuable supplement, bolstering pedestrian information provision. This paper delves into pedestrian detection and tracking algorithms within a multi-modal image framework grounded in deep learning methodologies. Leveraging the YOLOv4 algorithm as a foundation, augmented by a channel stack fusion module, a novel multi-modal pedestrian detection algorithm tailored for intelligent transportation is proposed. This algorithm capitalizes on the fusion of visible and infrared light mode image features to enhance pedestrian detection performance amidst complex road environments. Experimental findings demonstrate that compared to the Visible-YOLOv4 algorithm, renowned for its high performance, the proposed Double-YOLOv4-CSE algorithm exhibits a notable improvement, boasting a 5.0% accuracy rate enhancement and a 6.9% reduction in logarithmic average missing rate. This research's goal is to ensure that the algorithm can run smoothly even on a low configuration 1080 Ti GPU and to improve the algorithm's coverage at the application layer, making it affordable and practical for both urban and rural areas. This addresses the broader research problem within the scope of smart cities and remote ends with limited computational power.

Deep Learning-Based Pedestrian Detection Using RGB Images and Sparse LiDAR Point Clouds

Spatio-Contextual Deep Network Based Multimodal Pedestrian Detection For Autonomous Driving

Learning Cross-Modal Deep Representations for Robust Pedestrian Detection

An Efficient 3D Pedestrian Detector with Calibrated RGB Camera and 3D LiDAR.

3D Sensor Based Pedestrian Detection by Integrating Improved HHA Encoding and Two-Branch Feature Fusion

Pseudo-Image and Sparse Points: Vehicle Detection with 2D LiDAR Revisited by Deep Learning-Based Methods

Research on Multi-Modal Pedestrian Detection and Tracking Algorithm Based on Deep Learning

Fusion of Multispectral Data Through Illumination-aware Deep Neural Networks for Pedestrian Detection

Spatio-contextual deep network-based multimodal pedestrian detection for autonomous driving

Pedestrian Detection by Fusion of RGB and Infrared Images in Low-Light Environment

Real-Time Pedestrian Detection for Driver Assistance Systems Based on Deep Learning

A Dual-Modality Pedestrian Detection Method Based on Multi-Scale Feature Fusion

Lightweight Cross-Modal Multispectral Pedestrian Detection Based on Spatial Reweighted Attention Mechanism

Box-level Segmentation Supervised Deep Neural Networks for Accurate and Real-time Multispectral Pedestrian Detection

Three-Dimensional Pedestrian Detection by Fusing Image Semantics and Point Cloud Spatial Visibility Features

Deep Learning-Based Pedestrian Detection Combined with Semantics

Robust Pedestrian Detection Based on Multi-Spectral Image Fusion and Convolutional Neural Networks

A Real-Time Lidar And Vision Based Pedestrian Detection System For Unmanned Ground Vehicles

A Pedestrian Detection and Tracking Framework for Autonomous Cars: Efficient Fusion of Camera and LiDAR Data

Multispectral Pedestrian Detection Based on Deep Convolutional Neural Networks.