Abstract:3D shape retrieval has attracted much attention and become a hot topic in computer vision field recently.With the development of deep learning, 3D shape retrieval has also made great progress and many view-based methods have been introduced in recent years. However, how to represent 3D shapes better is still a challenging problem. At the same time, the intrinsic hierarchical associations among views still have not been well utilized. In order to tackle these problems, in this paper, we propose a multi-loop-view convolutional neural network (MLVCNN) framework for 3D shape retrieval. In this method, multiple groups of views are extracted from different loop directions first. Given these multiple loop views, the proposed MLVCNN framework introduces a hierarchical view-loop-shape architecture, i.e., the view level, the loop level, and the shape level, to conduct 3D shape representation from different scales. In the view-level, a convolutional neural network is first trained to extract view features. Then, the proposed Loop Normalization and LSTM are utilized for each loop of view to generate the loop-level features, which considering the intrinsic associations of the different views in the same loop. Finally, all the loop-level descriptors are combined into a shape-level descriptor for 3D shape representation, which is used for 3D shape retrieval. Our proposed method has been evaluated on the public 3D shape benchmark, i.e., ModelNet40. Experiments and comparisons with the state-of-the-art methods show that the proposed MLVCNN method can achieve significant performance improvement on 3D shape retrieval tasks. Our MLVCNN outperforms the state-of-the-art methods by the mAP of 4.84% in 3D shape retrieval task. We have also evaluated the performance of the proposed method on the 3D shape classification task where MLVCNN also achieves superior performance compared with recent methods.

Double weighting convolutional neural net‐works for multi‐view 3D shape recognition

Hamming Embedding Sensitivity Guided Fusion Network for 3D Shape Representation.

GVCNN: Group-View Convolutional Neural Networks for 3D Shape Recognition

Multi-view Convolutional Neural Networks for 3D Shape Recognition

View-based weight network for 3D object recognition

Learning View-Based Graph Convolutional Network for Multi-View 3D Shape Analysis

Multi-view dual attention network for 3D object recognition

3D shape classification based on convolutional neural networks fusing multi-view information

Multi-view Moments Embedding Network for 3D Shape Recognition

LIMAN: Local Information based Multi Attention Network for 3D Shape Recognition

MANet: Multimodal Attention Network based Point- View fusion for 3D Shape Recognition

MHSAN: Multi-view hierarchical self-attention network for 3D shape recognition

A Unified Feature Representation and Learning Framework for 3D Shape

Latent-MVCNN: 3D Shape Recognition Using Multiple Views from Pre-defined or Random Viewpoints

Deformable convolutional networks for multi‐view 3D shape classification

MLVCNN: Multi-Loop-View Convolutional Neural Network for 3D Shape Retrieval

Multi-view SoftPool Attention Convolutional Networks for 3D Model Classification.

OVPT: Optimal Viewset Pooling Transformer for 3D Object Recognition.

Joint Multi-view 2D Convolutional Neural Networks for 3D Object Classification

Learning Disentangled Representation for Multi-View 3D Object Recognition.

Multiple Discrimination and Pairwise CNN for view-based 3D object retrieval