A review of object detection: Datasets, performance evaluation, architecture, applications and current trends

Wei Chen,Jinjin Luo,Fan Zhang,Zijian Tian
DOI: https://doi.org/10.1007/s11042-023-17949-4
IF: 2.577
2024-01-14
Multimedia Tools and Applications
Abstract:Object detection is one of the most important and challenging branches of computer vision, whose main task is to classify and localize objects in images or videos. The development of object detection technology has been more than 20 years, from the early traditional detection methods to the current deep learning methods, the improvement of object detection accuracy and speed stems from the rapid development of deep learning technology. Traditional object detection techniques have many limitations, and using convolutional neural networks as the main framework for object detection can efficiently extract features and reduce the complexity of manual feature extraction. To comprehensively and deeply understand the development status of object detection, based on the research of domestic and foreign related literature, this paper reviews the research background of object detection, introduces the problems and dilemmas faced by traditional object detection algorithms, and analyzes the current mainstream object detection algorithms. This paper mainly carries out the relevant algorithms from three perspectives: Anchor-based, Anchor-free, and Transformer-based, and summarizes their structure, performance, advantages, and disadvantages in detail. This paper also introduces the commonly used datasets and related performance evaluation indexes for object detection, as well as the applications of object detection in industrial, transportation, medical, and other fields. According to the current research hotspots and the development trend of related technologies, the future research direction of object detection is prospected.
computer science, information systems, theory & methods,engineering, electrical & electronic, software engineering
What problem does this paper attempt to address?