Accelerating Object Detection with YOLOv4 for Real-Time Applications

K. Senthil Kumar,K.M.B. Abdullah Safwan
2024-10-18
Abstract:Object Detection is related to Computer Vision. Object detection enables detecting instances of objects in images and videos. Due to its increased utilization in surveillance, tracking system used in security and many others applications have propelled researchers to continuously derive more efficient and competitive algorithms. However, problems emerges while implementing it in real-time because of their dynamic environment and complex algorithms used in object detection. In the last few years, Convolution Neural Network (CNN) have emerged as a powerful tool for recognizing image content and in computer vision approach for most problems. In this paper, We revived begins the brief introduction of deep learning and object detection framework like Convolutional Neural Network(CNN), You only look once - version 4 (YOLOv4). Then we focus on our proposed object detection architectures along with some modifications. The traditional model detects a small object in images. We have some modifications to the model. Our proposed method gives the correct result with accuracy.
Computer Vision and Pattern Recognition,Artificial Intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to improve the efficiency and accuracy of object detection in real - time applications, especially in dynamic environments such as unmanned aerial vehicles (UAVs). Specifically, the paper focuses on the following aspects: 1. **Challenges in real - time object detection**: - The images captured by UAVs are of poor quality, including noise and blurring, which makes it difficult to detect small objects. - Most on - board cameras have a low resolution and it is difficult to identify small objects. - Real - time object detection requires rapid and accurate identification and tracking of objects in dynamic environments. 2. **Limitations of existing methods**: - Traditional object detection algorithms face limitations in computational resources and speed when dealing with complex environments and real - time applications. - Problems such as dynamic backgrounds, illumination changes, non - rigid objects, non - linear motion, and occlusion make object detection and tracking more difficult. 3. **Proposed method**: - The paper introduces the YOLOv4 model, which is a single - stage object detector and can achieve a faster inference speed while maintaining high precision. - YOLOv4 significantly improves its performance on the COCO dataset through a series of improvement techniques (such as DropBlock regularization, data augmentation, Mish activation function, cross - stage partial connection (CSP), self - adversarial training (SAT), weighted residual connection (WRC), etc.), reaching an AP (average precision) of 43.5% and 65 FPS (frames per second). 4. **Specific application scenarios**: - UAVs need efficient real - time object detection in tasks such as surveillance, fire detection, reconnaissance, mapping, and search and rescue. - Fields such as autonomous driving, video surveillance, and UAV navigation have strict requirements for real - time performance and accuracy, and YOLOv4 can meet these requirements. In summary, this paper aims to solve the challenges of efficient real - time object detection in dynamic and complex environments by improving the YOLOv4 model, especially for the problems of low - resolution images and small - object detection in UAV applications.