Abstract:Object detection and tracking are pivotal tasks in machine learning, particularly within the domain of computer vision technologies. Despite significant advancements in object detection frameworks, challenges persist in real-world tracking scenarios, including object interactions, occlusions, and background interference. Many algorithms have been proposed to carry out such tasks; however, most struggle to perform well in the face of disturbances and uncertain environments. This research proposes a novel approach by integrating the You Only Look Once (YOLO) architecture for object detection with a robust filter for target tracking, addressing issues of disturbances and uncertainties. The YOLO architecture, known for its real-time object detection capabilities, is employed for initial object detection and centroid location. In combination with the detection framework, the sliding innovation filter, a novel robust filter, is implemented and postulated to improve tracking reliability in the face of disturbances. Specifically, the sliding innovation filter is implemented to enhance tracking performance by estimating the optimal centroid location in each frame and updating the object's trajectory. Target tracking traditionally relies on estimation theory techniques like the Kalman filter, and the sliding innovation filter is introduced as a robust alternative particularly suitable for scenarios where a priori information about system dynamics and noise is limited. Experimental simulations in a surveillance scenario demonstrate that the sliding innovation filter-based tracking approach outperforms existing Kalman-based methods, especially in the presence of disturbances. In all, this research contributes a practical and effective approach to object detection and tracking, addressing challenges in real-world, dynamic environments. The comparative analysis with traditional filters provides practical insights, laying the groundwork for future work aimed at advancing multi-object detection and tracking capabilities in diverse applications.

Filtering Empty Video Frames for Efficient Real-Time Object Detection

A Fast Filtering Mechanism to Improve Efficiency of Large-Scale Video Analytics

Accelerating real‐time object detection in high‐resolution video surveillance

Real-Time and Accurate Object Detection in Compressed Video by Long Short-term Feature Aggregation

Exploiting Detected Visual Objects for Frame-Level Video Filtering

FrameHopper: Selective Processing of Video Frames in Detection-driven Real-Time Video Analytics

Tracking Assisted Faster Video Object Detection

Practical Video Object Detection via Feature Selection and Aggregation

YOLOv4-5D: An Effective and Efficient Object Detector for Autonomous Driving

High-precision real-time autonomous driving target detection based on YOLOv8

Object Detection and Tracking with YOLO and the Sliding Innovation Filter

Generalized Haar Filter based Deep Networks for Real-Time Object Detection in Traffic Scene

A Low-Latency Object Detection Algorithm for the Edge Devices of IoV Systems

Image convolution techniques integrated with YOLOv3 algorithm in motion object data filtering and detection

Video frame feeding approach for validating the performance of an object detection model in real-world conditions

Online Visual Multi-Object Tracking via Labeled Random Finite Set Filtering

Road User Detection in Videos

Efficient One-stage Video Object Detection by Exploiting Temporal Consistency

Real-time Traffic Object Detection for Autonomous Driving

FPGA-Based Vehicle Detection and Tracking Accelerator

Real-Time Traffic Light Detection with Adaptive Background Suppression Filter