Abstract:Real-time detection and tracking of pedestrian targets is a critical aspect of autonomous driving. However, due to complex road conditions, bad weather and other factors, traditional image detection algorithms are difficult to accurately detect pedestrians on the road, which leads to traffic accidents. To solve the above problems, this paper proposes an improved pedestrian detection and tracking algorithm using YOLOv8n and DeepSort. In the detection stage, the RC-YOLOv8n model is designed. Considering the high real-time requirement in driverless technology, to ensure that the system can quickly respond to a variety of complex road conditions and emergencies, the lightweight RepViT model is introduced to achieve efficient and accurate real-time detection. Meanwhile, the C2f-DCN (Deformable Convolutional Networks) module is proposed to improve the detection accuracy and speed of the model by using deformable convolution. In the tracking phase, the CA-DeepSort model is proposed. The model replaces the pedestrian re-recognition network in DeepSort with the CANet36 network, which captures the spatial location information of pedestrians more accurately by introducing the coordinate attention mechanism, thus improving the fineness and recognition of the feature representation. The CA coordinate attention mechanism is an attention mechanism that enhances the ability of convolutional neural networks to capture spatial and channel information. In addition, Wise-IoU is introduced to replace the traditional IoU for matching, which comprehensively takes into account the shape and position features of the target object, thus improving the matching accuracy. After conducting experiments on different datasets and analysing the experimental data, it is concluded that the RC-YOLOv8n algorithm improves the mAP by 1.6%. When combined with the CA-DeepSort algorithm for tracking, the MOTA reaches 73.2%, which is 12.8% higher than the original algorithm, and the MOTP is 81.9%, which is 1.4% higher than the original algorithm. These improvements significantly enhance the pedestrian detection and tracking performance of driverless vehicles.

Multi ROI and Multi Map Networks for Accurate and Efficient Pedestrian Detection

Towards Accurate Dense Pedestrian Detection Via Occlusion-Prediction Aware Label Assignment and Hierarchical-Nms.

Pedestrian Detection Method Based on Improved YOLOv5s for Densely Occluded Scenarios

Multi-Scale Feature Pyramid Network: A Heavily Occluded Pedestrian Detection Network Based on ResNet

Multi-scale cross-layer fusion and center position network for pedestrian detection

Deep Convolutional Neural Networks For Pedestrian Detection With Skip Pooling

Pedestrian Detection Using Multi-Channel Visual Feature Fusion by Learning Deep Quality Model.

A Unified Multi-Task Learning Architecture for Fast and Accurate Pedestrian Detection

Attention-Guided Region Proposal Network for Pedestrian Detection

Pedestrian Detection Based on Hand-crafted Features and Multi-layer Feature Fused-ResNet Model

Fast Pedestrian Detection with Attention-Enhanced Multi-Scale RPN and Soft-Cascaded Decision Trees

MM_Fast_RCNN_ResNet: Construction of Multimodal Faster RCNN Inception and ResNet V2 for Pedestrian Tracking and detection

Too Far to See? Not Really! --- Pedestrian Detection with Scale-aware Localization Policy

End-to-End Network for Pedestrian Detection, Tracking and Re-Identification in Real-Time Surveillance System

Research on Pedestrian Tracking Technology for Autonomous Driving Scenarios

Pedestrian Detection with Multi-View Convolution Fusion Algorithm

Deep Pedestrian Detection Using Contextual Information and Multi-level Features

Accurate Pedestrian Detection by Human Pose Regression

A Multiscale Parallel Pedestrian Recognition Algorithm Based on YOLOv5

Count- and Similarity-Aware R-CNN for Pedestrian Detection

Multi-Scale Infrared Pedestrian Detection Based on Deep Attention Mechanism