Abstract:In order to address the challenges of identifying, detecting, and tracking moving objects in video surveillance, this paper emphasizes image-based dynamic entity detection. It delves into the complexities of numerous moving objects, dense targets, and intricate backgrounds. Leveraging the You Only Look Once (YOLOv3) algorithm framework, this paper proposes improvements in image segmentation and data filtering to address these challenges. These enhancements form a novel multi-object detection algorithm based on an improved YOLOv3 framework, specifically designed for video applications. Experimental validation demonstrates the feasibility of this algorithm, with success rates exceeding 60% for videos such as "jogging", "subway", "video 1", and "video 2". Notably, the detection success rates for "jogging" and "video 1" consistently surpass 80%, indicating outstanding detection performance. Although the accuracy slightly decreases for "Bolt" and "Walking2", success rates still hover around 70%. Comparative analysis with other algorithms reveals that this method's tracking accuracy surpasses that of particle filters, Discriminative Scale Space Tracker (DSST), and Scale Adaptive Multiple Features (SAMF) algorithms, with an accuracy of 0.822. This indicates superior overall performance in target tracking. Therefore, the improved YOLOv3-based multi-object detection and tracking algorithm demonstrates robust filtering and detection capabilities in noise-resistant experiments, making it highly suitable for various detection tasks in practical applications. It can address inherent limitations such as missed detections, false positives, and imprecise localization. These improvements significantly enhance the efficiency and accuracy of target detection, providing valuable insights for researchers in the field of object detection, tracking, and recognition in video surveillance.

A New Local-Main-Gradient-Orientation HOG and Contour Differences Based Algorithm for Object Classification

A Method for Vehicle Detection Based on Local Gradients Vector

An HOG-CT Human Detector with Histogram-Based Search.

High Efficient Moving Object Extraction and Classification in Traffic Video Surveillance

An Enhanced Histogram of Oriented Gradients for Pedestrian Detection

Research on Recognition and Classification of Moving Objects in Mixed Traffic Based on Video Detection

Pedestrian detection using improved Histogram of Oriented Gradients

Fast Pedestrian Detection And Tracking Based On Vibe Combined Hog-Svm Scheme

Local Attention Sequence Model for Video Object Detection

Vision-Based Moving Objects Detection with Background Modeling

Beyond HOG: Learning Local Parts for Object Detection.

Object Recognition Using Words Model of Optimal Size in Histograms of Oriented Gradients

Object Detection Based on LHOG Feature Matching

Hybrid harris hawk-arithmetic optimization with deep learning-driven object detection and classification for surveillance video analysis

See the Difference: Direct Pre-Image Reconstruction and Pose Estimation by Differentiating HOG

Real-time human detection based on gentle MILBoost with variable granularity HOG-CSLBP

A novel hierarchical framework for human head-shoulder detection

Multi-Object Vehicle Detection and Tracking Algorithm Based on Improved YOLOv8 and ByteTrack

A Cascade Svm Approach For Head-Shoulder Detection Using Histograms Of Oriented Gradients

Image convolution techniques integrated with YOLOv3 algorithm in motion object data filtering and detection

Multi-target Classification Method Based on the Fusion of HOG and SURF Features