Abstract:In the realm of intelligent transportation, pedestrian detection has witnessed significant advancements. However, it continues to grapple with challenging issues, notably the detection of pedestrians in complex lighting scenarios. Conventional visible light mode imaging is profoundly affected by varying lighting conditions. Under optimal daytime lighting, visibility is enhanced, leading to superior pedestrian detection outcomes. Conversely, under low-light conditions, visible light mode imaging falters due to the inadequate provision of pedestrian target information, resulting in a marked decline in detection efficacy. In this context, infrared light mode imaging emerges as a valuable supplement, bolstering pedestrian information provision. This paper delves into pedestrian detection and tracking algorithms within a multi-modal image framework grounded in deep learning methodologies. Leveraging the YOLOv4 algorithm as a foundation, augmented by a channel stack fusion module, a novel multi-modal pedestrian detection algorithm tailored for intelligent transportation is proposed. This algorithm capitalizes on the fusion of visible and infrared light mode image features to enhance pedestrian detection performance amidst complex road environments. Experimental findings demonstrate that compared to the Visible-YOLOv4 algorithm, renowned for its high performance, the proposed Double-YOLOv4-CSE algorithm exhibits a notable improvement, boasting a 5.0% accuracy rate enhancement and a 6.9% reduction in logarithmic average missing rate. This research's goal is to ensure that the algorithm can run smoothly even on a low configuration 1080 Ti GPU and to improve the algorithm's coverage at the application layer, making it affordable and practical for both urban and rural areas. This addresses the broader research problem within the scope of smart cities and remote ends with limited computational power.

DELTA: Integrating Multimodal Sensing with Micromobility for Enhanced Sidewalk and Pedestrian Route Understanding

APE: An Open and Shared Annotated Dataset for Learning Urban Pedestrian Path Networks

Leveraging 3D LiDAR Sensors to Enable Enhanced Urban Safety and Public Health: Pedestrian Monitoring and Abnormal Activity Detection

Understanding Pedestrian Movement Using Urban Sensing Technologies: The Promise of Audio-based Sensors

When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset

Diving Deeper Into Pedestrian Behavior Understanding: Intention Estimation, Action Prediction, and Event Risk Assessment

Towards Rich, Portable, and Large-Scale Pedestrian Data Collection

Urban Pedestrian Routes' Accessibility Assessment Using Geographic Information System Processing and Deep Learning-Based Object Detection

TBD Pedestrian Data Collection: Towards Rich, Portable, and Large-Scale Natural Pedestrian Data

Sidewalk Measurements from Satellite Images: Preliminary Findings

Pedestrian Origin-Destination Estimation Based on Multi-Camera Person Re-Identification

uB-VisioGeoloc: An image sequences dataset of pedestrian navigation including geolocalised-inertial information and spatial sound rendering of the urban environment's obstacles

InCrowd-VI: A Realistic Visual-Inertial Dataset for Evaluating SLAM in Indoor Pedestrian-Rich Spaces for Human Navigation

Unveiling pedestrian injury risk factors through integration of urban contexts using multimodal deep learning

Research on Multi-Modal Pedestrian Detection and Tracking Algorithm Based on Deep Learning

Multi-object urban dataset: A resource for detecting pedestrians, traffic and motorbikes

Pedestrian Detection for Autonomous Vehicles Using Virtual-to-Real Augmentation

AV-PedAware: Self-Supervised Audio-Visual Fusion for Dynamic Pedestrian Awareness

Pedestrian Attribute Recognition: A New Benchmark Dataset and A Large Language Model Augmented Framework

Graph-SIM: A Graph-based Spatiotemporal Interaction Modelling for Pedestrian Action Prediction

A Pedestrian Detection Algorithm Based on Score Fusion for Multi-LiDAR Systems