A Survey of Deep Learning-based Object Detection

Licheng Jiao,Fan Zhang,Fang Liu,Shuyuan Yang,Lingling Li,Zhixi Feng,Rong Qu
DOI: https://doi.org/10.1109/ACCESS.2019.2939201
2019-10-11
Abstract:Object detection is one of the most important and challenging branches of computer vision, which has been widely applied in peoples life, such as monitoring security, autonomous driving and so on, with the purpose of locating instances of semantic objects of a certain class. With the rapid development of deep learning networks for detection tasks, the performance of object detectors has been greatly improved. In order to understand the main development status of object detection pipeline, thoroughly and deeply, in this survey, we first analyze the methods of existing typical detection models and describe the benchmark datasets. Afterwards and primarily, we provide a comprehensive overview of a variety of object detection methods in a systematic manner, covering the one-stage and two-stage detectors. Moreover, we list the traditional and new applications. Some representative branches of object detection are analyzed as well. Finally, we discuss the architecture of exploiting these object detection methods to build an effective and efficient system and point out a set of development trends to better follow the state-of-the-art algorithms and further research.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The main goal of this paper is to comprehensively review deep learning-based object detection techniques and analyze the current state-of-the-art methods and future research trends. Specifically: 1. **Research Background and Objectives**: Object detection, as an important branch of computer vision, has received increasing attention in recent years, especially in fields such as surveillance security and autonomous driving. With the development of deep convolutional neural networks, object detection technology has made significant progress. 2. **Main Contributions**: - Systematically analyzed existing typical detection models and their benchmark datasets. - Provided a comprehensive overview of various object detection methods (including single-stage and two-stage detectors) and detailed their architectures and performance. - Listed traditional and emerging application areas and conducted an in-depth analysis of some representative branches. - Discussed how to use these detection methods to build efficient and effective systems and pointed out future development trends. 3. **Main Content**: - **Two-Stage Detectors**: Such as Faster R-CNN, these detectors have high accuracy in object localization and recognition. - **Single-Stage Detectors**: Such as YOLO, SSD, these detectors achieve high-speed inference, suitable for real-time devices. - **Backbone Networks**: Discussed the advantages and disadvantages of different types of backbone networks (such as ResNet, MobileNet, etc.) in feature extraction. - **Classic Baselines**: Detailed the development history and improvements of important models such as R-CNN, Fast R-CNN, Faster R-CNN. - **Experimental Results**: Compared the performance of different models on multiple datasets through experiments, showcasing their strengths and weaknesses in terms of accuracy and speed. In summary, this paper aims to provide readers with a comprehensive review of deep learning-based object detection technology, helping to understand the latest advancements and development directions in this field.