Yolo Versions Architecture: Review

Rusul Hussein Hasan,Rasha Majid Hassoo,Inaam Salman Aboud
DOI: https://doi.org/10.31695/ijasre.2023.9.11.7
2023-01-01
International Journal of Advances in Scientific Research and Engineering
Abstract:Deep learning techniques are used across a wide range of fields for several applications. In recent years, deep learning-based object detection from aerial or terrestrial photos has gained popularity as a study topic. The goal of object detection in computer vision is to anticipate the presence of one or more objects, along with their classes and bounding boxes. The YOLO (You Only Look Once) modern object detector can detect things in real-time with accuracy and speed. A neural network from the YOLO family of computer vision models makes one-time predictions about the locations of bounding rectangles andclassification probabilities for an image. In layman's terms, it is a technique for instantly identifying and recognizing items in images.This article, will be focusing on comparing the main differences among the YOLO version's Architecture, and will discuss its evolution from YOLO to YOLOv8, its network architecture, newfeatures, and applications. Itstarts by looking at the basic ideas and design of the first YOLO model, which laid the groundwork for the following improvements in the YOLO family. In additionally, this article will provide a step-by-step guide on how to use the YOLO version architecture, Understanding the primary drivers, feature development, constraints, and even relationships for the versions is crucial as the YOLO versions advance.Researchers interested in object detection, especially beginning researchers, would find this paper useful and enlightening
What problem does this paper attempt to address?