Abstract:In the realm of intelligent transportation, pedestrian detection has witnessed significant advancements. However, it continues to grapple with challenging issues, notably the detection of pedestrians in complex lighting scenarios. Conventional visible light mode imaging is profoundly affected by varying lighting conditions. Under optimal daytime lighting, visibility is enhanced, leading to superior pedestrian detection outcomes. Conversely, under low-light conditions, visible light mode imaging falters due to the inadequate provision of pedestrian target information, resulting in a marked decline in detection efficacy. In this context, infrared light mode imaging emerges as a valuable supplement, bolstering pedestrian information provision. This paper delves into pedestrian detection and tracking algorithms within a multi-modal image framework grounded in deep learning methodologies. Leveraging the YOLOv4 algorithm as a foundation, augmented by a channel stack fusion module, a novel multi-modal pedestrian detection algorithm tailored for intelligent transportation is proposed. This algorithm capitalizes on the fusion of visible and infrared light mode image features to enhance pedestrian detection performance amidst complex road environments. Experimental findings demonstrate that compared to the Visible-YOLOv4 algorithm, renowned for its high performance, the proposed Double-YOLOv4-CSE algorithm exhibits a notable improvement, boasting a 5.0% accuracy rate enhancement and a 6.9% reduction in logarithmic average missing rate. This research's goal is to ensure that the algorithm can run smoothly even on a low configuration 1080 Ti GPU and to improve the algorithm's coverage at the application layer, making it affordable and practical for both urban and rural areas. This addresses the broader research problem within the scope of smart cities and remote ends with limited computational power.

A Deep Top-Down Framework Towards Generalisable Multi-View Pedestrian Detection

A Novel Approach to Design the Fast Pedestrian Detection for Video Surveillance System

3D Random Occlusion and Multi-Layer Projection for Deep Multi-Camera Pedestrian Localization

Towards Accurate Dense Pedestrian Detection Via Occlusion-Prediction Aware Label Assignment and Hierarchical-Nms.

See Extensively While Focusing on the Core Area for Pedestrian Detection.

Pedestrian Detection with Multi-View Convolution Fusion Algorithm

Unsupervised Multi-view Pedestrian Detection

A Top-View Multiple People Tracking System Based on Newest YOLOv5 and DeepSort Using Depth Data

Multi-view and Multi-Plane Data Fusion for Effective Pedestrian Detection in Intelligent Visual Surveillance

A Methodology Review on Multi-view Pedestrian Detection

Multi-Grained Deep Feature Learning for Pedestrian Detection

Research on Multi-Modal Pedestrian Detection and Tracking Algorithm Based on Deep Learning

Multi-View Pedestrian Recognition Using Shared Dictionary Learning with Group Sparsity

An End-to-end Tracking Framework Via Multi-View and Temporal Feature Aggregation

Multiview Detection with Cardboard Human Modeling

Pedestrian Detection Using Multi-Channel Visual Feature Fusion by Learning Deep Quality Model.

Robust Multiple Cameras Pedestrian Detection with Multi-View Bayesian Network.

Locality guided cross-modal feature aggregation and pixel-level fusion for multispectral pedestrian detection

Deep Pedestrian Detection Using Contextual Information and Multi-level Features

A Boosted Multi-Task Model for Pedestrian Detection with Occlusion Handling.

Contour Information-Guided Multi-Scale Feature Detection Method for Visible-Infrared Pedestrian Detection