Abstract:Sketch-based 3D shape retrieval (SBSR) can be approached by learning domain-invariant descriptors or ranking metrics from sketches and 2D view images of 3D shapes rendered through numerous viewpoints. However, determining the most appropriate viewpoints that convey discriminative geometric features to benefit the task of SBSR became an essential yet not fully explored area. Existing works extract 3D features from multi-view images observed through pre-defined viewpoints to match 2D sketches. Those methods, however, fail to dynamically select viewpoints by considering the SBSR task. In this work, we introduce a fully differentiable viewpoint learning paradigm driven by the downstream SBSR task, which supports the task-aware and sketch-dependent dynamic viewpoint determination process. We naturally integrate this task-specific and sketch-dependent viewpoint learning process into a meta-learning framework to develop a novel Dynamic Viewer (DV) module for SBSR. DV module comprises a Meta View Learner (MVL) block and a View Generator (VG) block. Specifically, as the first part of the DV module, the MVL block learns to initiate the necessary network parameters of the VG block. Then, the VG block that serves as the second part learns the best viewpoints to render 2D images. To learn the optimal viewpoints for SBSR, we further introduce a view mining loss that aims to maximize the similarity of feature-level information among rendered 2D views and the query sketch. Further, we adopt a variational autoencoder (VAE) to retrieve 3D shapes by setting the newly rendered images and query sketch as inputs. As evidenced by the comprehensive experimental results conducted on popular SBSR datasets, the proposed framework has been demonstrated to outperform recent methods in both category-level sketch-based and fine-grained SBSR.

Fast Best Viewpoint Selection with Geometry-Enhanced Multiple Views and Cross-Modal Distillation

Structure-aware Viewpoint Selection for Volume Visualization

Monocular Viewpoints Estimation for Generic Objects in the Wild

Distilling Sub-Space Structure Across Views for Cardiac Indices Estimation

Multi-View Stereo Representation Revist: Region-Aware MVSNet

Improving Viewpoint-Independent Object-Centric Representations through Active Viewpoint Selection

Perceptual-Based Automatic Viewpoint Selection

Similarity Voting Based Viewpoint Selection for Volumes

View-based 3D object retrieval with discriminative views.

A classification-based approach for best view selection of 3D models

Retrieval-Specific View Learning for Sketch-to-Shape Retrieval

MV-DUSt3R+: Single-Stage Scene Reconstruction from Sparse Views In 2 Seconds

Learning to Select Camera Views: Efficient Multiview Understanding at Few Glances

ProbIBR: Fast Image-Based Rendering with Learned Probability-Guided Sampling

BEVDistill: Cross-Modal BEV Distillation for Multi-View 3D Object Detection

Deep Learning-Based Viewpoint Recommendation in Volume Visualization

VISTA: Boosting 3D Object Detection via Dual Cross-VIew SpaTial Attention

Which Viewpoint Shows it Best? Language for Weakly Supervising View Selection in Multi-view Videos

ViewActive: Active viewpoint optimization from a single image

Efficient Virtual View Selection for 3D Hand Pose Estimation

Structured Knowledge Distillation Towards Efficient and Compact Multi-View 3D Detection