Abstract:A point cloud is a set of points defined in a 3D metric space. Point clouds have become one of the most significant data formats for 3D representation and are gaining increased popularity as a result of the increased availability of acquisition devices, as well as seeing increased application in areas such as robotics, autonomous driving, and augmented and virtual reality. Deep learning is now the most powerful tool for data processing in computer vision and is becoming the most preferred technique for tasks such as classification, segmentation, and detection. While deep learning techniques are mainly applied to data with a structured grid, the point cloud, on the other hand, is unstructured. The unstructuredness of point clouds makes the use of deep learning for its direct processing very challenging. This paper contains a review of the recent state-of-the-art deep learning techniques, mainly focusing on raw point cloud data. The initial work on deep learning directly with raw point cloud data did not model local regions; therefore, subsequent approaches model local regions through sampling and grouping. More recently, several approaches have been proposed that not only model the local regions but also explore the correlation between points in the local regions. From the survey, we conclude that approaches that model local regions and take into account the correlation between points in the local regions perform better. Contrary to existing reviews, this paper provides a general structure for learning with raw point clouds, and various methods were compared based on the general structure. This work also introduces the popular 3D point cloud benchmark datasets and discusses the application of deep learning in popular 3D vision tasks, including classification, segmentation, and detection.

Deep models for multi-view 3D object recognition: a review

Deep Models for Multi-View 3D Object Recognition: A Review

Multi-view stereo in the Deep Learning Era: A comprehensive revfiew

Multi-view stereo in the Deep Learning Era: A comprehensive review

Deep learning for 3D object recognition: A survey

Deep Learning for 3D Reconstruction, Augmentation, and Registration: A Review Paper

Variable-Viewpoint Representations for 3D Object Recognition

Multi-view dual attention network for 3D object recognition

Single-View 3D reconstruction: A Survey of deep learning methods

Deep Learning for Multi-View Stereo via Plane Sweep: A Survey

Multi-View Stereo Representation Revist: Region-Aware MVSNet

Multi-view Moments Embedding Network for 3D Shape Recognition

A review on deep learning techniques for 3D sensed data classification

3D Point Cloud for Objects and Scenes Classification, Recognition, Segmentation, and Reconstruction: A Review

Multi-view Convolutional Neural Networks for 3D Shape Recognition

Learning Disentangled Representation for Multi-View 3D Object Recognition.

ReINView: Re-interpreting Views for Multi-view 3D Object Recognition

Learning the Global Descriptor for 3-D Object Recognition Based on Multiple Views Decomposition

Multi-View 3d Object Retrieval with Deep Embedding Network

Unsupervised Multi-View CNN for Salient View Selection and 3D Interest Point Detection

Review: Deep Learning on 3D Point Clouds