Deep models for multi-view 3D object recognition: a review

Mona Alzahrani,Muhammad Usman,Salma Kammoun Jarraya,Saeed Anwar,Tarek Helmy
DOI: https://doi.org/10.1007/s10462-024-10941-w
IF: 9.588
2024-10-13
Artificial Intelligence Review
Abstract:This review paper focuses on the progress of deep learning-based methods for multi-view 3D object recognition. It covers the state-of-the-art techniques in this field, specifically those that utilize 3D multi-view data as input representation. The paper provides a comprehensive analysis of the pipeline for deep learning-based multi-view 3D object recognition, including the various techniques employed at each stage. It also presents the latest developments in CNN-based and transformer-based models for multi-view 3D object recognition. The review discusses existing models in detail, including the datasets, camera configurations, view selection strategies, pre-trained CNN architectures, fusion strategies, and recognition performance. Additionally, it examines various computer vision applications that use multi-view classification. Finally, it highlights future directions, factors impacting recognition performance, and trends for the development of multi-view 3D object recognition method.
computer science, artificial intelligence
What problem does this paper attempt to address?