Abstract:3D shape retrieval has attracted much attention and become a hot topic in computer vision field recently.With the development of deep learning, 3D shape retrieval has also made great progress and many view-based methods have been introduced in recent years. However, how to represent 3D shapes better is still a challenging problem. At the same time, the intrinsic hierarchical associations among views still have not been well utilized. In order to tackle these problems, in this paper, we propose a multi-loop-view convolutional neural network (MLVCNN) framework for 3D shape retrieval. In this method, multiple groups of views are extracted from different loop directions first. Given these multiple loop views, the proposed MLVCNN framework introduces a hierarchical view-loop-shape architecture, i.e., the view level, the loop level, and the shape level, to conduct 3D shape representation from different scales. In the view-level, a convolutional neural network is first trained to extract view features. Then, the proposed Loop Normalization and LSTM are utilized for each loop of view to generate the loop-level features, which considering the intrinsic associations of the different views in the same loop. Finally, all the loop-level descriptors are combined into a shape-level descriptor for 3D shape representation, which is used for 3D shape retrieval. Our proposed method has been evaluated on the public 3D shape benchmark, i.e., ModelNet40. Experiments and comparisons with the state-of-the-art methods show that the proposed MLVCNN method can achieve significant performance improvement on 3D shape retrieval tasks. Our MLVCNN outperforms the state-of-the-art methods by the mAP of 4.84% in 3D shape retrieval task. We have also evaluated the performance of the proposed method on the 3D shape classification task where MLVCNN also achieves superior performance compared with recent methods.

MSDCNN: A multiscale dilated convolution neural network for fine-grained 3D shape classification

Multiscale 3-D-2-D Mixed CNN and Lightweight Attention-Free Transformer for Hyperspectral and LiDAR Classification

Multi-view SoftPool Attention Convolutional Networks for 3D Model Classification.

3D shape classification based on convolutional neural networks fusing multi-view information

Dilated Multi-scale Fusion for Point Cloud Classification and Segmentation

Deformable convolutional networks for multi‐view 3D shape classification

Hybrid Convolutional Network Combining Multiscale 3D Depthwise Separable Convolution and CBAM Residual Dilated Convolution for Hyperspectral Image Classification

Multi-Scale Dense Networks for Hyperspectral Remote Sensing Image Classification

Multiscale Information Fusion for Hyperspectral Image Classification Based on Hybrid 2D-3D CNN

Multi-view dual attention network for 3D object recognition

MV-C3D: A Spatial Correlated Multi-View 3D Convolutional Neural Networks

A New Multiscale Multiattention Convolutional Neural Network for Fine-Grained Surface Defect Detection

DTV-CNN: Neural network based on depth and thickness views for efficient 3D shape classification

MHSAN: Multi-view hierarchical self-attention network for 3D shape recognition

A Multispectral and Multiangle 3-D Convolutional Neural Network for the Classification of ZY-3 Satellite Images Over Urban Areas

MSMHSA-DeepLab V3+: An Effective Multi-Scale, Multi-Head Self-Attention Network for Dual-Modality Cardiac Medical Image Segmentation

Large-Scale 3D Scene Classification With Multi-View Volumetric CNN

Latent-MVCNN: 3D Shape Recognition Using Multiple Views from Pre-defined or Random Viewpoints

Unsupervised Multi-View CNN for Salient View Selection and 3D Interest Point Detection

MLVCNN: Multi-Loop-View Convolutional Neural Network for 3D Shape Retrieval

Remote Sensing Scene Classification Based on Multi-Structure Deep Features Fusion